ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	12
Since 2017 (last 10 years)	26
Since 2007 (last 20 years)	90

Descriptor

True Scores	416
Error of Measurement	121
Test Reliability	110
Statistical Analysis	107
Mathematical Models	97
Item Response Theory	87
Correlation	76
Equated Scores	76
Reliability	64
Test Theory	52
Test Items	51
Comparative Analysis	49
Scores	47
Measurement Techniques	45
Estimation (Mathematics)	41
Test Interpretation	39
Raw Scores	35
Equations (Mathematics)	33
Simulation	33
Models	32
Scoring	32
Test Validity	32
Criterion Referenced Tests	31
Test Construction	30
Item Analysis	29
More ▼

Publication Type

Journal Articles	192
Reports - Research	176
Reports - Evaluative	98
Speeches/Meeting Papers	49
Reports - Descriptive	22
Numerical/Quantitative Data	8
Dissertations/Theses -…	6
Opinion Papers	6
Guides - Non-Classroom	4
Reports - General	4
Information Analyses	3
Collected Works - General	2
Book/Product Reviews	1
Guides - Classroom - Teacher	1
Reference Materials -…	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	16
Postsecondary Education	10
Elementary Secondary Education	6
Secondary Education	4
High Schools	3
Early Childhood Education	2
Elementary Education	2
Junior High Schools	2
Grade 2	1
Grade 8	1
Middle Schools	1
Preschool Education	1
More ▼

Audience

Researchers	12
Practitioners	2
Administrators	1
Teachers	1

Location

Australia	1
Canada	1
China	1
Colorado	1
Illinois	1
Israel	1
New York	1
Oregon	1
Taiwan	1
Texas	1
United Kingdom (England)	1
United Kingdom (Great Britain)	1
Virgin Islands	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

What Works Clearinghouse Rating

Showing 286 to 300 of 416 results Save | Export

Livingston's Reliability Coefficient and Harris' Index of Efficiency: An Empirical Study of the Two Reliability Coefficients For Criterion-Referenced Tests.

Download full text

Rim, Eui-Do; Bresler, Samuel – 1974

Livingston's reliability coefficients and Harris' indices of efficiency were computed along with the classical internal consistency coefficients, KR-20's (Kuder-Richardson internal consistency coefficient), for 678 criterion-referenced tests in the A through E levels of an individualized mathematics program. The coefficients were carefully studied…

Descriptors: Academic Achievement, Correlation, Criterion Referenced Tests, Elementary School Mathematics

Correcting Four Similar Correlational Measures for Attenuation Due to Errors of Measurement in the Dependent Variable: Eta, Epsilon, Omega, and Intraclass r.

Download full text

Stanley, Julian C.; Livingston, Samuel A. – 1971

Besides the ubiquitous Pearson product-moment r, there are a number of other measures of relationship that are attenuated by errors of measurement and for which the relationship between true measures can be estimated. Among these are the correlation ratio (eta squared), Kelley's unbiased correlation ratio (epsilon squared), Hays' omega squared,…

Descriptors: Analysis of Variance, Cluster Grouping, Correlation, Data Analysis

An Empirical Investigation of Four Criterion-Referenced Testing Models.

Download full text

Epstein, Kenneth I. – 1975

Since the primary purpose of classical testing is to rank order examinees consistently, the absolute value of the true score has been relatively unimportant. However, the major purpose of criterion referenced testing is to estimate the true capabilities of examinees to perform specific tasks. Hence, the problems of true score determination assume…

Descriptors: Bayesian Statistics, Criterion Referenced Tests, Mathematical Models, Military Personnel

The Effects of Dimensionality on True Score Conversion Tables for the Law School Admission Test. LSAC Research Report Series.

Download full text

Camilli, Gregory; Wang, Ming-mei; Fesq, Jaqueline – 1992

The Law School Admission Test (LSAT) was examined to see if the items on a form could be divided into different subgroups in which items looked statistically similar within the subgroups but statistically different between subgroups. Of such subgrouping can be detected, it is likely that the subgroups of items measure different abilities, and the…

Descriptors: Admission (School), College Entrance Examinations, Factor Analysis, Item Response Theory

Integration of Concepts of Reliability and Standard Error of Measurement

Peer reviewed

Horn, John L. – Educational and Psychological Measurement, 1971

Descriptors: Analysis of Variance, Error of Measurement, Hypothesis Testing, Mathematical Models

Statistical Analysis of Sets of Congeneric Tests

Peer reviewed

Joreskog, K. G. – Psychometrika, 1971

Descriptors: Correlation, Factor Analysis, Goodness of Fit, Mathematical Models

Using Longitudinal Data to Estimate Reliability in the Presence of Correlated Measurement Errors.

Peer reviewed

Werts, C. E.; And Others – Educational and Psychological Measurement, 1980

Test-retest correlations can lead to biased reliability estimates when there is instability of true scores and/or when measurement errors are correlated. Using three administrations of the Test of Standard Written English and essay ratings, an analysis is demonstrated which separates true score instability and correlated errors. (Author/BW)

Descriptors: College Freshmen, Error of Measurement, Essay Tests, Higher Education

Derivations of Observed Score Equating Methods that Cater to Populations Differing in Ability.

Peer reviewed

MacCann, Robert G. – Journal of Educational Statistics, 1990

For anchor test equating, 3 linear observed score methods are derived for populations differing in ability. Each version requires that the correlations of the tests with the selection variable be known. Five sets of assumptions are made for each model--yielding 15 methods--which are then related to existing methods. (SLD)

Descriptors: Ability, Ability Grouping, Equated Scores, Equations (Mathematics)

Automated Essay Scoring versus Human Scoring: A Comparative Study

Peer reviewed
PDF on ERIC

Download full text

Direct link

Wang, Jinhao; Brown, Michelle Stallone – Journal of Technology, Learning, and Assessment, 2007

The current research was conducted to investigate the validity of automated essay scoring (AES) by comparing group mean scores assigned by an AES tool, IntelliMetric [TM] and human raters. Data collection included administering the Texas version of the WriterPlacer "Plus" test and obtaining scores assigned by IntelliMetric [TM] and by…

Descriptors: Test Scoring Machines, Scoring, Comparative Testing, Intermode Differences

Estimating the Reliability of Classifications Based on Composite Scores.

Download full text

Livingston, Samuel A. – 1984

Much previously published material for estimating the reliability of classification has been based on the assumption that a test consists of a known number of equally weighted items. The test score is the number of those items answered correctly. These methods cannot be used with classifications based on weighted composite scores, especially if…

Descriptors: Equated Scores, Essay Tests, Estimation (Mathematics), Mathematical Models

An Approach to Biased Item Identification Using Latent Trait Measurement Theory.

Download full text

Rudner, Lawrence M. – 1977

Because it is a true score model employing item parameters which are independent of the examined sample, item characteristic curve theory (ICC) offers several advantages over classical measurement theory. In this paper an approach to biased item identification using ICC theory is described and applied. The ICC theory approach is attractive in that…

Descriptors: Bias, Criteria, Culture Fair Tests, Item Analysis

Statistical Comparisons Among Hierarchies Based on Latent Structure Models. Research Monograph 77-1.

Download full text

Macready, George B.; Dayton, C. Mitchell – 1977

A probabilistic hypothesis testing procedure to assess the fit of hypothesized hierarchical structures for test item data is discussed. Statistical procedures are presented which are useful for evaluating the fit of data of a certain class of probabilistic models. These models apply to sets of dichotomous (O,1) responses for which there are…

Descriptors: Error of Measurement, Goodness of Fit, Hypothesis Testing, Mathematical Models

Bayesian and Empirical Bayes Approaches to Setting Passing Scores on Mastery Tests. Publication Series in Mastery Testing.

Download full text

Huynh, Huynh; Saunders, Joseph C., III – 1979

The Bayesian approach to setting passing scores, as proposed by Swaminathan, Hambleton, and Algina, is compared with the empirical Bayes approach to the same problem that is derived from Huynh's decision-theoretic framework. Comparisons are based on simulated data which follow an approximate beta-binomial distribution and on real test results from…

Descriptors: Bayesian Statistics, Cutting Scores, Grade 3, Mastery Tests

An Interpretation of Livingston's Reliability Coefficient for Criterion-Referenced Tests.

PDF pending restoration

Harris, Chester W. – 1971

Livingston's work is a careful analysis of what occurs when one pools two populations with different means, but similar variances and reliability coefficients. However, his work fails to advance reliability theory for the special case of criterion-referenced testing. See ED 042 802 for Livingston's paper. (MS)

Descriptors: Analysis of Variance, Criterion Referenced Tests, Error of Measurement, Reliability

Intraclass Reliability Estimates: Testing Structural Assumptions.

Werts, C. E.; And Others – 1972

Intraclass correlation reliability estimates are based on the assumption that the various measures are equivalent. Joreskog's (1970) general model for the analysis of covariance structures can be used to test the validity of this assumption. (For related document, see TM 002 301.) (Author)

Descriptors: Analysis of Covariance, Correlation, Hypothesis Testing, Mathematical Models

« Previous Page | Next Page »

Pages: 1 | ... | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | ... | 28

Educational and Psychological…	44
Journal of Educational…	40
Psychometrika	40
Applied Psychological…	23
ETS Research Report Series	15
Applied Measurement in…	12
Journal of Educational…	11
Journal of Experimental…	8
Journal of Educational and…	7
ProQuest LLC	6
Educational Measurement:…	5
Multivariate Behavioral…	5
Educational Testing Service	3
International Journal of…	3
Online Submission	3
Assessment	2
Developmental Psychology	2
International Educational…	2
Journal of School Psychology	2
Journal of Vocational Behavior	2
Practical Assessment,…	2
Scandinavian Journal of…	2
Test Service Bulletin	2
Advances in Health Sciences…	1
Alberta Journal of…	1
More ▼

Wilcox, Rand R.	14
Livingston, Samuel A.	12
Lord, Frederic M.	12
Brennan, Robert L.	10
Lee, Won-Chan	8
Kolen, Michael J.	7
Dimitrov, Dimiter M.	6
Haberman, Shelby J.	6
Mellenbergh, Gideon J.	6
Werts, Charles E.	6
von Davier, Alina A.	6
Cliff, Norman	5
Hanson, Bradley A.	5
Werts, C. E.	5
Eignor, Daniel R.	4
Harris, Chester W.	4
Linn, Robert L.	4
Qian, Jiahe	4
Zimmerman, Donald W.	4
Cureton, Edward E.	3
Feldt, Leonard S.	3
Huynh, Huynh	3
Jackson, Paul H.	3
Kolen, Michael	3
More ▼

SAT (College Admission Test)	7
Law School Admission Test	6
Iowa Tests of Basic Skills	5
Advanced Placement…	4
Test of English as a Foreign…	4
ACT Assessment	3
College Level Examination…	2
Comprehensive Tests of Basic…	2
Graduate Record Examinations	2
Iowa Tests of Educational…	2
National Assessment of…	2
College Board Achievement…	1
Differential Aptitude Test	1
Dynamic Indicators of Basic…	1
Early Childhood Environment…	1
General Aptitude Test Battery	1
Goodenough Harris Drawing Test	1
Graduate Management Admission…	1
Illinois Test of…	1
Kit of Reference Tests for…	1
Medical College Admission Test	1
Metropolitan Readiness Tests	1
National Longitudinal Study…	1
North Carolina End of Course…	1
Praxis Series	1
More ▼