ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	2

Descriptor

Probability	11
Test Reliability	11
Rating Scales	3
Statistical Analysis	3
Statistical Significance	3
Test Validity	3
Error of Measurement	2
Factor Analysis	2
Hypothesis Testing	2
Item Analysis	2
Measurement	2
Sample Size	2
Scores	2
Scoring Formulas	2
Test Interpretation	2
Test Items	2
Test Theory	2
Achievement	1
Achievement Tests	1
Career Development	1
Computer Programs	1
Computer Simulation	1
Computer Software	1
Correlation	1
Cutting Scores	1
More ▼

Source

Educational and Psychological…

Author

Zimmerman, Donald W.	3
Aiken, Lewis R.	2
Dyer, Frank J.	1
Ellis, Jules L.	1
Hofmann, Richard J.	1
Muller, Jorg M.	1
Weber, Margaret B.	1
Wilcox, Rand R.	1

Publication Type

Journal Articles	8
Reports - Research	8
Numerical/Quantitative Data	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 11 results Save | Export

A Simple Model to Determine the Efficient Duration of Exams

Peer reviewed

Direct link

Ellis, Jules L. – Educational and Psychological Measurement, 2021

This study develops a theoretical model for the costs of an exam as a function of its duration. Two kind of costs are distinguished: (1) the costs of measurement errors and (2) the costs of the measurement. Both costs are expressed in time of the student. Based on a classical test theory model, enriched with assumptions on the context, the costs…

Descriptors: Test Length, Models, Error of Measurement, Measurement

The Probability of Obtaining Two Statistically Different Test Scores as a Test Index

Peer reviewed

Direct link

Muller, Jorg M. – Educational and Psychological Measurement, 2006

A new test index is defined as the probability of obtaining two randomly selected test scores (PDTS) as statistically different. After giving a concept definition of the test index, two simulation studies are presented. The first analyzes the influence of the distribution of test scores, test reliability, and sample size on PDTS within classical…

Descriptors: Test Reliability, Probability, Scores, Item Response Theory

Prediction Analysis and the Reliability of a Mastery Test.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1979

The classical estimate of a binomial probability function is to estimate its mean in the usual manner and to substitute the results in the appropriate expression. Two alternative estimation procedures are described and examined. Emphasis is given to the single administration estimate of the mastery test reliability. (Author/CTM)

Descriptors: Cutting Scores, Mastery Tests, Probability, Scores

Test Reliability and the Kuder-Richardson Formulas: Derivation from Probability Theory

Peer reviewed

Zimmerman, Donald W. – Educational and Psychological Measurement, 1972

Although a great deal of attention has been devoted over a period of years to the estimation of reliability from item statistics, there are still gaps in the mathematical derivation of the Kuder-Richardson results. The main purpose of this paper is to fill some of these gaps, using language consistent with modern probability theory. (Author)

Descriptors: Mathematical Applications, Probability, Scoring Formulas, Statistical Analysis

Application of Power Analysis Concepts to Test Reliability Research.

Peer reviewed

Dyer, Frank J. – Educational and Psychological Measurement, 1980

Power analysis is in essence a technique for estimating the probability of obtaining a specific minimum observed effect size. Power analysis techniques are applied to research planning problems in test reliability studies. A table for use in research planning and hypothesis testing is presented. (Author)

Descriptors: Hypothesis Testing, Mathematical Formulas, Power (Statistics), Probability

Test Theory with Minimal Assumptions

Peer reviewed

Zimmerman, Donald W. – Educational and Psychological Measurement, 1976

Using the concepts of conditional probability, conditional expectation, and conditional independence, the main results of the classical test theory model can be derived in a very few steps with minimal assumptions. The present effort explores the possibility that present classical test theories can be further condensed. (Author/RC)

Descriptors: Career Development, Correlation, Mathematical Models, Measurement

Variability of Deviation IQ's Based on Multiple-Choice Test Scores.

Peer reviewed

Zimmerman, Donald W. – Educational and Psychological Measurement, 1985

A computer program simulated guessing on multiple-choice test items and calculated deviation IQ's from observed scores which contained a guessing component. Extensive variability in deviation IQ's due entirely to chance was found. (Author/LMO)

Descriptors: Computer Simulation, Error of Measurement, Guessing (Tests), Intelligence Quotient

An Examination of the Bilevel Dimensionality of Lower and Higher Mental Processes in Probability Achievement

Peer reviewed

Weber, Margaret B. – Educational and Psychological Measurement, 1977

Bilevel dimensionality of probability was examined via factor analysis, Rasch latent trait analysis, and classical item analysis. Results suggest that when nonstandardized measures are the criteria for achievement, relying solely on estimates of content validity may lead to erroneous interpretation of test score data. (JKS)

Descriptors: Achievement, Achievement Tests, Factor Analysis, Item Analysis

On Testing a Guttman Scale for Significance.

Peer reviewed

Hofmann, Richard J. – Educational and Psychological Measurement, 1979

The Guttman scale is discussed from the viewpoint of errors in response patterns. The errors are assumed to be distributed as a binomial. A double-barreled significance test is suggested having two probabilities: high probability and low probability. (Author)

Descriptors: Error Patterns, Hypothesis Testing, Probability, Psychometrics

Three Coefficients for Analyzing the Reliability and Validity of Ratings.

Peer reviewed

Aiken, Lewis R. – Educational and Psychological Measurement, 1985

Three numerical coefficients for analyzing the validity and reliability of ratings are described. Each coefficient is computed as the ratio of an obtained to a maximum sum of differences in ratings. The coefficients are also applicable to the item analysis, agreement analysis, and cluster or factor analysis of rating-scale data. (Author/BW)

Descriptors: Computer Software, Data Analysis, Factor Analysis, Item Analysis

Content Validity and Reliability of Single Items or Questionnaires.

Peer reviewed

Aiken, Lewis R. – Educational and Psychological Measurement, 1980

Procedures for computing content validity and consistency reliability coefficients and determining the statistical significance of these coefficients are described. Procedures employing the multinomial probability distribution for small samples and normal curve probability estimates for large samples, can be used where judgments are made on…

Descriptors: Computer Programs, Measurement Techniques, Probability, Questionnaires