ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	2

Descriptor

Mathematical Models	59
Statistical Analysis	59
Test Reliability	59
Error of Measurement	18
Measurement Techniques	16
Test Validity	14
Item Analysis	13
True Scores	13
Comparative Analysis	12
Criterion Referenced Tests	12
Correlation	11
Test Construction	11
Analysis of Variance	10
Test Items	10
Factor Analysis	9
Test Interpretation	9
Equated Scores	8
Sampling	8
Scores	8
Test Theory	8
Evaluation Methods	7
Goodness of Fit	7
Latent Trait Theory	7
Mastery Tests	7
Norm Referenced Tests	7
More ▼

Source

Educational and Psychological…	7
Journal of Educational…	4
Psychometrika	4
Applied Psychological…	1
Journal of Educational…	1

Publication Type

Reports - Research	35
Speeches/Meeting Papers	9
Journal Articles	7
Reports - Evaluative	3
Numerical/Quantitative Data	2
Reference Materials -…	2
Reports - Descriptive	2
Guides - General	1
Guides - Non-Classroom	1
Opinion Papers	1

Education Level

Audience

Researchers

Location

California	1
South Carolina	1

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Armed Services Vocational…	1
Comprehensive Tests of Basic…	1
SAT (College Admission Test)	1
Stanford Binet Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 59 results Save | Export

A Measurement Is a Choice and Stevens' Scales of Measurement Do Not Help Make It: A Response to Chalmers

Peer reviewed

Direct link

Zumbo, Bruno D.; Kroc, Edward – Educational and Psychological Measurement, 2019

Chalmers recently published a critique of the use of ordinal a[alpha] proposed in Zumbo et al. as a measure of test reliability in certain research settings. In this response, we take up the task of refuting Chalmers' critique. We identify three broad misconceptions that characterize Chalmers' criticisms: (1) confusing assumptions with…

Descriptors: Test Reliability, Statistical Analysis, Misconceptions, Mathematical Models

AMOVA ["Accumulative Manifold Validation Analysis"]: An Advanced Statistical Methodology Designed to Measure and Test the Validity, Reliability, and Overall Efficacy of Inquiry-Based Psychometric Instruments

Peer reviewed
PDF on ERIC

Download full text

Osler, James Edward, II – Journal of Educational Technology, 2015

This monograph provides an epistemological rational for the Accumulative Manifold Validation Analysis [also referred by the acronym "AMOVA"] statistical methodology designed to test psychometric instruments. This form of inquiry is a form of mathematical optimization in the discipline of linear stochastic modelling. AMOVA is an in-depth…

Descriptors: Statistical Analysis, Test Validity, Test Reliability, Inquiry

The Kappamax Reliability Index for Decisions in Domain-Referenced Testing.

Download full text

Huynh, Huynh – 1977

The kappamax reliability index of domain-referenced tests is defined as the upper bound of kappa when all possibile cutoff scores are considered. Computational procedures for kappamax are described, as well as its approximation for long tests, based on Kuder-Richardson formula 21. The sampling error of kappamax, and the effects of test length and…

Descriptors: Criterion Referenced Tests, Mathematical Models, Statistical Analysis, Test Reliability

Test Length and the Standard Error of Measurement

Peer reviewed

Gardner, P. L. – Journal of Educational Measurement, 1970

Descriptors: Error of Measurement, Mathematical Models, Statistical Analysis, Test Reliability

A Note on Huynh's Normal Approximation Procedure for Estimating Criterion-Referenced Reliability.

Peer reviewed

Peng, Chao-Ying, J.; Subkoviak, Michael J. – Journal of Educational Measurement, 1980

Huynh (1976) suggested a method of approximating the reliability coefficient of a mastery test. The present study examines the accuracy of Huynh's approximation and also describes a computationally simpler approximation which appears to be generally more accurate than the former. (Author/RL)

Descriptors: Error of Measurement, Mastery Tests, Mathematical Models, Statistical Analysis

Estimating Reliability from a Single Administration of a Mastery Test.

Download full text

Subkoviak, Michael J. – 1976

A number of different definitions and indices of reliability for mastery tests have recently been proposed in an attempt to cope with possible lack of score variability that attenuates traditional coefficients. One promising index that has been suggested is the proportion of students in a group that are consistently assigned to the same mastery…

Descriptors: Criterion Referenced Tests, Mastery Tests, Mathematical Models, Scores

A General Method of Estimating the Reliability of a Composite.

Peer reviewed

Werts, C. E.; And Others – Educational and Psychological Measurement, 1978

A procedure for estimating the reliability of a factorially complex composite is considered. An application of its use with Scholastic Aptitude Test data is provided. (Author/JKS)

Descriptors: Correlation, Factor Analysis, Mathematical Models, Matrices

A Theoretical Study of Two-Stage Testing

Peer reviewed

Lord, Frederic M. – Psychometrika, 1971

A two-stage testing procedure, a routing test followed by one of several alternative second-stage tests, is studied in the situation where the purpose is measurement, not classification. Models are developed, examined, and compared with conventional tests and up-and-down procedures. (DG)

Descriptors: Guessing (Tests), Mathematical Models, Measurement Techniques, Scoring

Estimation of the KR20 Reliability Coefficient When Data Are Incomplete.

Download full text

Huynh, Huynh – 1977

Three techniques for estimating Kuder Richardson reliability (KR20) coefficients for incomplete data are contrasted. The methods are: (1) Henderson's Method 1 (analysis of variance, or ANOVA); (2) Henderson's Method 3 (FITCO); and (3) Koch's method of symmetric sums (SYSUM). A Monte Carlo simulation was used to assess the precision of the three…

Descriptors: Analysis of Variance, Comparative Analysis, Mathematical Models, Monte Carlo Methods

Variance-Stabilizing Transformation of the Stepped-Up Reliability Coefficient.

Download full text

Lord, Frederic M. – 1972

The stepped-up reliability coefficient does not have the same standard error as an ordinary correlation coefficient. Fisher's Z -transformation should not be applied to it. Appropriate procedures are suggested. (Author)

Descriptors: Analysis of Variance, Mathematical Models, Research, Research Reports

A Comparison of Three Methods of Analyzing Dichotomous Data in a Randomized Block Design.

Download full text

Mandeville, Garrett K.

Results of a comparative study of F and Q tests, in a randomized block design with one replication per cell, are presented. In addition to these two procedures, a multivariate test was also considered. The model and test statistics, data generation and parameter selection, results, summary and conclusions are presented. Ten tables contain the…

Descriptors: Comparative Analysis, Data Analysis, Mathematical Models, Models

Test Theory with Minimal Assumptions

Peer reviewed

Zimmerman, Donald W. – Educational and Psychological Measurement, 1976

Using the concepts of conditional probability, conditional expectation, and conditional independence, the main results of the classical test theory model can be derived in a very few steps with minimal assumptions. The present effort explores the possibility that present classical test theories can be further condensed. (Author/RC)

Descriptors: Career Development, Correlation, Mathematical Models, Measurement

Contributions to the Method of Paired Comparisons.

Peer reviewed

Kaiser, Henry F.; Serlin, Ronald C. – Applied Psychological Measurement, 1978

A least-squares solution for the method of paired comparisons is given. The approach provokes a theorem regarding the amount of data necessary and sufficient for a solution to be obtained. A measure of the internal consistency of the least-squares fit is developed. (Author/CTM)

Descriptors: Higher Education, Least Squares Statistics, Mathematical Models, Measurement

Agreement between Raters

Peer reviewed

Th.van der Kamp, Leo J.; Mellenbergh, Gideon J. – Educational and Psychological Measurement, 1976

Joreskog's model of cogeneric tests is used to analyze agreement between raters. Raters are treated as measuring instruments. The model of cogeneric tests, of which classical parallelism and tau-equivalence are shown to be special cases, is applied to teachers' ratings of students' responses on open-end questions. (Author/RC)

Descriptors: Goodness of Fit, Mathematical Models, Rating Scales, Statistical Analysis

Efficiency of Linear Equating as a Function of the Length of the Anchor Test.

Peer reviewed

Budescu, David – Journal of Educational Measurement, 1985

An important determinant of equating process efficiency is the correlation between the anchor test and components of each form. Use of some monotonic function of this correlation as a measure of equating efficiency is suggested. A model relating anchor test length and test reliability to this measure of efficiency is presented. (Author/DWH)

Descriptors: Correlation, Equated Scores, Mathematical Models, Standardized Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Huynh, Huynh	3
Subkoviak, Michael J.	3
Bashaw, W. L.	2
Benson, Jeri	2
Brennan, Robert L.	2
Feldt, Leonard S.	2
Lord, Frederic M.	2
Rentz, R. Robert	2
Werts, C. E.	2
Algina, James	1
Besel, Ronald	1
Budescu, David	1
Byrne, Barbara M.	1
Cahan, Sorel	1
Cliff, Norman	1
Cohen, Allan S., Comp.	1
Downey, Ronald G.	1
Elias, Patricia J.	1
Gardner, P. L.	1
Gilmer, Jerry S.	1
Gleser, Leon Jay	1
Grossen, Neal E.	1
Haladyna, Tom	1
Harris, Chester W.	1
More ▼