ERIC - Search Results

Source

Journal of Educational…

Author

Huynh, Huynh	2
Jarjoura, David	2
Wainer, Howard	2
Armstrong, Ronald D.	1
Casteel, Jim	1
Hager, Willi	1
Harrison, David A.	1
Holland, Paul W.	1
Jansen, Margo G. H.	1
Kolen, Michael J.	1
Morgan, Anne	1
Thayer, Dorothy T.	1
Westermann, Rainer	1
Wilcox, Rand R.	1
Woodruff, David	1
More ▼

Publication Type

Journal Articles	13
Reports - Research	8
Reports - Evaluative	5

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
National Assessment of…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

A Review of the Beta-Binomial Model and Its Extensions.

Peer reviewed

Wilcox, Rand R. – Journal of Educational Statistics, 1981

Both the binomial and beta-binomial models are applied to various problems occurring in mental test theory. The paper reviews and critiques these models. The emphasis is on the extensions of the models that have been proposed in recent years, and that might not be familiar to many educators. (Author)

Descriptors: Error of Measurement, Item Analysis, Mathematical Models, Test Reliability

Derivations of Observed Score Linear Equating Methods Based on Test Score Models for the Common Item Nonequivalent Populations Design.

Peer reviewed

Woodruff, David – Journal of Educational Statistics, 1986

The purpose of the present paper is to derive linear equating methods for the common item nonequivalent populations design from explicitly stated congeneric type test score models. The equating methods developed are compared with previously developed methods and applied to five professionally constructed examinations administered to approximately…

Descriptors: Equated Scores, Equations (Mathematics), Mathematical Models, Scores

Section Pre-Equating in the Presence of Practice Effects.

Peer reviewed

Holland, Paul W.; Thayer, Dorothy T. – Journal of Educational Statistics, 1985

Section pre-equating (SPE) equates a new test to an old test prior to the actual use of a new test by making extensive use of experimental sections of a testing instrument. SPE theory is extended to allow for practice effects on both the old and new tests. (Author/BS)

Descriptors: Equated Scores, Mathematical Models, Statistical Studies, Test Construction

Tolerance Intervals for True Scores.

Peer reviewed

Jarjoura, David – Journal of Educational Statistics, 1985

Issues regarding tolerance and confidence intervals are discussed within the context of educational measurement, and conceptual distinctions are drawn between these two types of intervals. Points are raised about the advantages of tolerance intervals when the focus is on a particular observed score rather than a particular examinee. (Author/BW)

Descriptors: Comparative Analysis, Error of Measurement, Mathematical Models, Test Interpretation

On "State Education Statistics."

Peer reviewed

Wainer, Howard; And Others – Journal of Educational Statistics, 1985

In this paper, scores from the Department of Education's table, "State Education Statistics," are examined to see if they can be used for state-by-state comparisons to aid in the evaluation of educational policies that vary across states. (Author/LMO)

Descriptors: Educational Assessment, Educational Indicators, Multivariate Analysis, National Norms

Error Probabilities in Educational and Psychological Research.

Peer reviewed

Westermann, Rainer; Hager, Willi – Journal of Educational Statistics, 1986

The well-known problem of cumulating error probabilities is reconsidered from a general epistemological perspective, namely, the concepts of severity and of fairness of tests. It is shown that not only Type 1 but also Type 2 errors can cumulate. A new adjustment strategy is proposed and applied. (Author/JAZ)

Descriptors: Educational Research, Error of Measurement, Hypothesis Testing, Measurement Techniques

Reliability of Composite Measurements Based on the m Highest of n Equivalent Components.

Peer reviewed

Huynh, Huynh – Journal of Educational Statistics, 1986

Under the assumptions of classical measurement theory and the condition of normality, a formula is derived for the reliability of composite scores. The formula represents an extension of the Spearman-Brown formula to the case of truncated data. (Author/JAZ)

Descriptors: Computer Simulation, Error of Measurement, Expectancy Tables, Scoring Formulas

Automated Parallel Test Construction Using Classical Test Theory.

Peer reviewed

Armstrong, Ronald D.; And Others – Journal of Educational Statistics, 1994

A network-flow model is formulated for constructing parallel tests based on classical test theory while using test reliability as the criterion. Practitioners can specify a test-difficulty distribution for values of item difficulties as well as test-composition requirements. An empirical study illustrates the reliability of generated tests. (SLD)

Descriptors: Algorithms, Computer Assisted Testing, Difficulty Level, Item Banks

A Review of Estimation Procedures for the Rasch Model with an Eye toward Longish Tests.

Peer reviewed

Morgan, Anne; Wainer, Howard – Journal of Educational Statistics, 1980

Two estimation procedures for the Rasch Model of test analysis are reviewed in detail, particularly with respect to new developments that make the more statistically rigorous conditional maximum likelihood estimation practical for use with longish tests. (Author/JKS)

Descriptors: Error of Measurement, Latent Trait Theory, Maximum Likelihood Statistics, Psychometrics

A Comparison of the Minimax and Rasch Approaches to Set Simultaneous Passing Scores for Subtests.

Peer reviewed

Huynh, Huynh; Casteel, Jim – Journal of Educational Statistics, 1985

Two approaches, the minimax approach and the Rasch procedure, are described for the simultaneous determination of passing scores for subtests when the passing score for the total test is known. (Author/LMO)

Descriptors: Cutting Scores, Educational Assessment, Elementary Secondary Education, Latent Trait Theory

Standard Errors of Equipercentile Equating for the Common Stem Nonequivalent Populations Design.

Peer reviewed

Jarjoura, David; Kolen, Michael J. – Journal of Educational Statistics, 1985

An equating design in which two groups of examinees from slightly different populations are administered a different test form with a subset of common items is widely used. This paper presents standard errors and a simulation that verifies the equation for large samples for an equipercentile equating procedure for this design. (Author/BS)

Descriptors: Computer Simulation, Equated Scores, Error of Measurement, Estimation (Mathematics)

A Bayesian Version of Rasch's Multiplicative Poisson Model for the Number of Errors of an Achievement Test.

Peer reviewed

Jansen, Margo G. H. – Journal of Educational Statistics, 1986

In this paper a Bayesian procedure is developed for the simultaneous estimation of the reading ability and difficulty parameters which are assumed to be factors in reading errors by the multiplicative Poisson Model. According to several criteria, the Bayesian estimates are better than comparable maximum likelihood estimates. (Author/JAZ)

Descriptors: Achievement Tests, Bayesian Statistics, Comparative Analysis, Difficulty Level

Robustness of IRT Parameter Estimation to Violations of the Undimensionality Assumption.

Peer reviewed

Harrison, David A. – Journal of Educational Statistics, 1986

Multidimensional item response data were created. The strength of a general factor, the number of common factors, the distribution of items loadingon common factors, and the number of items in simulated tests were manipulated. LOGIST effectively recovered both item and trait parameters in nearly all of the experimental conditions. (Author/JAZ)

Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Simulation, Correlation

Test Theory	13
Mathematical Models	7
Error of Measurement	6
Statistical Studies	5
Test Items	4
Test Reliability	4
Computer Simulation	3
Equated Scores	3
Estimation (Mathematics)	3
Latent Trait Theory	3
Comparative Analysis	2
Computer Assisted Testing	2
Difficulty Level	2
Educational Assessment	2
Maximum Likelihood Statistics	2
Psychometrics	2
Test Construction	2
Test Interpretation	2
Achievement Tests	1
Adaptive Testing	1
Algorithms	1
Bayesian Statistics	1
Correlation	1
Cutting Scores	1
Educational Indicators	1
More ▼