ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	1

Descriptor

Item Sampling	13
Statistical Analysis	13
Test Reliability	13
Achievement Tests	3
Error of Measurement	3
Sampling	3
Test Interpretation	3
Academic Achievement	2
Criterion Referenced Tests	2
Elementary Secondary Education	2
Item Analysis	2
Mathematical Models	2
Matrices	2
Pass Fail Grading	2
Predictive Validity	2
Scoring Formulas	2
Simulation	2
Statistical Bias	2
Test Construction	2
Academic Standards	1
Accuracy	1
Analysis of Variance	1
Annotated Bibliographies	1
Basic Skills	1
Bibliographies	1
More ▼

Source

Educational and Psychological…	2
Applied Psychological…	1
Practical Assessment,…	1

Author

Harris, Chester W.	2
Pandey, Tej N.	2
Shoemaker, David M.	2
Bashkov, Bozhidar M.	1
Clauser, Jerome C.	1
Epstein, Kenneth I.	1
Estes, Carole	1
Estes, Gary D.	1
Forsyth, Robert A.	1
Frederiksen, Norman	1
Haladyna, Thomas	1
Hubert, Lawrence J.	1
Knerr, Claramae S.	1
Mandeville, Garrett K.	1
Ward, William C.	1
More ▼

Publication Type

Reports - Research	5
Speeches/Meeting Papers	4
Journal Articles	1
Non-Print Media	1
Reference Materials -…	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Determining Item Screening Criteria Using Cost-Benefit Analysis

Peer reviewed
PDF on ERIC

Download full text

Bashkov, Bozhidar M.; Clauser, Jerome C. – Practical Assessment, Research & Evaluation, 2019

Successful testing programs rely on high-quality test items to produce reliable scores and defensible exams. However, determining what statistical screening criteria are most appropriate to support these goals can be daunting. This study describes and demonstrates cost-benefit analysis as an empirical approach to determining appropriate screening…

Descriptors: Test Items, Test Reliability, Evaluation Criteria, Accuracy

Estimating Means via Multiple Matrix Sampling: A Note on the Effects of Selected Data Base Characteristics

Peer reviewed

Forsyth, Robert A. – Educational and Psychological Measurement, 1976

Shoemaker's conclusions related to the influence of various data base characteristics (reliability, variability of item difficulty indices, and degree of skewness in the normative distribution) on the standard error of a mean estimated via multiple matrix sampling procedures are examined. (Author/RC)

Descriptors: Item Sampling, Statistical Analysis, Test Reliability

A Comparison of Interval Estimation of Coefficient Alpha Using the Feldt and the Jackknife Procedures.

Download full text

Pandey, Tej N.; Hubert, Lawrence J. – 1974

This investigation had two major purposes. The first was to explore the use of an inferential technique called Tukey's Jackknife in establishing a confidence interval about cooefficient alpha reliability. The second purpose was to study the robustness of the Feldt and the jackknife procedures when the data fails to satisfy usual normality…

Descriptors: Comparative Analysis, Item Sampling, Statistical Analysis, Statistics

Confidence Interval Estimation of KR sub 20--Some Monte Carlo Results.

Download full text

Mandeville, Garrett K. – 1973

An investigation is conducted which presents extensive Monte Carlo results which indicate the conditions under which a procedure using the F distribution can be used to study the robustness of the confidence interval procedures for small samples. A review of the literature is presented. Procedure uses a binary data matrix. Results indicate that…

Descriptors: Confidence Testing, Item Sampling, Literature Reviews, Monte Carlo Methods

Standard Errors of Estimate in Item-Examinee Sampling as a Function of Test Reliability, Variation in Item Difficulty Indices and Degree of Skewness in the Normative Distribution

Peer reviewed

Shoemaker, David M. – Educational and Psychological Measurement, 1972

Descriptors: Difficulty Level, Error of Measurement, Item Sampling, Simulation

Techniques for Analyzing Test Response Data.

Download full text

Harris, Chester W. – 1975

Achievement tests which are specifically linked to an instructional program and have been developed in relation to an objectives base and/or to an item generation rule are considered, as well as student response data. Three types of studies are outlined and the kind of procedures thought useful illustrated. As various methods for examining…

Descriptors: Achievement Tests, Instructional Programs, Item Banks, Item Sampling

A Fortran IV Program for Estimating Parameters through Multiple Matrix Sampling with Standard Errors of Estimate Approximated by the Jackknife.

Download full text

Shoemaker, David M. – 1972

Described and listed herein with concomitant sample input and output is the Fortran IV program which estimates parameters and standard errors of estimate per parameters for parameters estimated through multiple matrix sampling. The specific program is an improved and expanded version of an earlier version. (Author/BJG)

Descriptors: Computer Oriented Programs, Computer Programs, Error of Measurement, Error Patterns

Criterion-Referenced Test Interpretations of "Classical" Measurement Theory.

Download full text

Epstein, Kenneth I.; Knerr, Claramae S. – 1976

The literature on criterion referenced testing is full of discussions concerning whether classical measurement techniques are appropriate, whether variance is necessary, whether new indices of reliability are needed, and the like. What appears to be lacking, however, is a clear and simple discussion of why the problems occur. This paper suggests…

Descriptors: Career Development, Criterion Referenced Tests, Item Analysis, Item Sampling

Measures for the Study of Creativity in Scientific Problem-Solving

Peer reviewed

Frederiksen, Norman; Ward, William C. – Applied Psychological Measurement, 1978

A set of Tests of Scientific Thinking were developed for possible use as criterion measures in research on creativity. Scores on the tests describe both quality and quantity of ideas produced in formulating hypotheses, evaluating proposals, solving methodological problems, and devising methods for measuring constructs. (Author/CTM)

Descriptors: Creativity Tests, Higher Education, Item Sampling, Predictive Validity

Estimating a Correlation Coefficient Using a Multiple Matrix Sampling Disign.

PDF pending restoration

Estes, Carole; Estes, Gary D. – 1980

Multiple matrix sampling is a sampling design in which both test items and examinees are randomly sampled from their respective populations. This study was designed to develop and assess a method for computing an estimate of a correlation coefficient when a multiple matrix sampling design is used. The examinee populations included 212 third-grade…

Descriptors: Correlation, Elementary Secondary Education, Evaluation Methods, Grade 3

The Generalizability of District Means Using Multiple Matrix Sampling.

Pandey, Tej N. – 1978

The concept under investigation was the reliability of estimates of mean scores of groups under various assumptions of multiple-matrix sampling when reliabilities are computed according to procedures based on generalizability theory. Four different cases were compared with respect to the generalizability coefficients depending upon whether pupils…

Descriptors: Achievement Tests, Analysis of Variance, Basic Skills, Elementary Secondary Education

Achievement Test Items--Methods of Study. CSE Monograph Series in Evaluation, 6.

Harris, Chester W.; And Others – 1977

The implications of a mathematical model of test scores are explored where the data are limited to a random sample of items without replacement from an indefinitely large population or item domain in which items are scored either zero or one. The purpose is to obtain an unbiased estimate of a student's proportion of items correct in the item…

Descriptors: Academic Achievement, Achievement Tests, Annotated Bibliographies, Bibliographies

An Analysis of Two Procedures for Decisionmaking When Using Domain-Referenced Tests.

Download full text

Haladyna, Thomas – 1975

A central problem for the user of domain-referenced tests in instruction is deciding who has passed and who has failed. Two procedures were presented and discussed. The first, employing classical test theory, was found to be more useful for larger domains and where the passing standard is 70 percent or less. The sampling procedure suggested by…

Descriptors: Academic Achievement, Academic Standards, Criterion Referenced Tests, Decision Making Skills