ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	3

Source

Applied Psychological…

Author

Babcock, Ben	1
Baker, Frank B.	1
Brennan, Robert L.	1
Hanson, Bradley A.	1
Huitzing, Hiddo A.	1
Karabatsos, George	1
Lee, Won-Chan	1
Meijer, Rob R.	1
Nevo, Baruch	1
Sheu, Ching-Fan	1
Van Onna, Marieke J. H.	1
Waller, Niels G.	1
Wan, Lei	1
Wilcox, Rand R.	1
van der Linden, Wim J.	1
More ▼

Publication Type

Journal Articles	12
Reports - Evaluative	7
Reports - Descriptive	2
Reports - Research	2
Reports - General	1

Education Level

Audience

Location

Israel

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Estimating a Noncompensatory IRT Model Using Metropolis within Gibbs Sampling

Peer reviewed

Direct link

Babcock, Ben – Applied Psychological Measurement, 2011

Relatively little research has been conducted with the noncompensatory class of multidimensional item response theory (MIRT) models. A Monte Carlo simulation study was conducted exploring the estimation of a two-parameter noncompensatory item response theory (IRT) model. The estimation method used was a Metropolis-Hastings within Gibbs algorithm…

Descriptors: Item Response Theory, Sampling, Computation, Statistical Analysis

Classification Consistency and Accuracy for Complex Assessments under the Compound Multinomial Model

Peer reviewed

Direct link

Lee, Won-Chan; Brennan, Robert L.; Wan, Lei – Applied Psychological Measurement, 2009

For a test that consists of dichotomously scored items, several approaches have been reported in the literature for estimating classification consistency and accuracy indices based on a single administration of a test. Classification consistency and accuracy have not been studied much, however, for "complex" assessments--for example,…

Descriptors: Classification, Reliability, Test Items, Scoring

Commingled Samples: A Neglected Source of Bias in Reliability Analysis

Peer reviewed

Direct link

Waller, Niels G. – Applied Psychological Measurement, 2008

Reliability is a property of test scores from individuals who have been sampled from a well-defined population. Reliability indices, such as coefficient and related formulas for internal consistency reliability (KR-20, Hoyt's reliability), yield lower bound reliability estimates when (a) subjects have been sampled from a single population and when…

Descriptors: Test Items, Reliability, Scores, Psychometrics

Binomial Test Models and Item Difficulty.

Peer reviewed

van der Linden, Wim J. – Applied Psychological Measurement, 1979

The restrictions on item difficulties that must be met when binomial models are applied to domain-referenced testing are examined. Both a deterministic and a stochastic conception of item responses are discussed with respect to difficulty and Guttman-type items. (Author/BH)

Descriptors: Difficulty Level, Item Sampling, Latent Trait Theory, Mathematical Models

Estimates of the Sampling Distribution of Scalability Coefficient H

Peer reviewed

Direct link

Van Onna, Marieke J. H. – Applied Psychological Measurement, 2004

Coefficient "H" is used as an index of scalability in nonparametric item response theory (NIRT). It indicates the degree to which a set of items rank orders examinees. Theoretical sampling distributions, however, have only been derived asymptotically and only under restrictive conditions. Bootstrap methods offer an alternative possibility to…

Descriptors: Sampling, Item Response Theory, Scaling, Comparative Analysis

Item Analysis with Small Samples.

Peer reviewed

Nevo, Baruch – Applied Psychological Measurement, 1980

Computer simulations were employed to investigate sample sizes needed for item analysis when one's primary purpose is to rank order items in terms of indices of quality or to compare indices of quality against a fixed criterion. Necessary sample sizes were much smaller than those indicated by the literature. (Author/JKS)

Descriptors: Foreign Countries, Higher Education, Item Analysis, Sampling

An Investigation of the Sampling Distributions of Equating Coefficients.

Peer reviewed

Baker, Frank B. – Applied Psychological Measurement, 1996

Using the characteristic curve method for dichotomously scored test items, the sampling distributions of equating coefficients were examined. Simulations indicate that for the equating conditions studied, the sampling distributions of the equating coefficients appear to have acceptable characteristics, suggesting confidence in the values obtained…

Descriptors: Equated Scores, Item Response Theory, Sampling, Statistical Distributions

Order-Constrained Bayes Inference for Dichotomous Models of Unidimensional Nonparametric IRT

Peer reviewed

Direct link

Karabatsos, George; Sheu, Ching-Fan – Applied Psychological Measurement, 2004

This study introduces an order-constrained Bayes inference framework useful for analyzing data containing dichotomous scored item responses, under the assumptions of either the monotone homogeneity model or the double monotonicity model of nonparametric item response theory (NIRT). The framework involves the implementation of Gibbs sampling to…

Descriptors: Inferences, Nonparametric Statistics, Item Response Theory, Data Analysis

Standard Errors of Levine Linear Equating.

Peer reviewed

Hanson, Bradley A.; And Others – Applied Psychological Measurement, 1993

The delta method was used to derive standard errors (SES) of the Levine observed score and Levine true score linear test equating methods using data from two test forms. SES derived without the normality assumption and bootstrap SES were very close. The situation with skewed score distributions is also discussed. (SLD)

Descriptors: Equated Scores, Equations (Mathematics), Error of Measurement, Sampling

Using Set Covering with Item Sampling to Analyze the Infeasibility of Linear Programming Test Assembly Models

Peer reviewed

Direct link

Huitzing, Hiddo A. – Applied Psychological Measurement, 2004

This article shows how set covering with item sampling (SCIS) methods can be used in the analysis and preanalysis of linear programming models for test assembly (LPTA). LPTA models can construct tests, fulfilling a set of constraints set by the test assembler. Sometimes, no solution to the LPTA model exists. The model is then said to be…

Descriptors: Mathematical Applications, Simulation, Item Sampling, Item Response Theory

An Approach to Measuring the Achievement or Proficiency of an Examinee.

Peer reviewed

Wilcox, Rand R. – Applied Psychological Measurement, 1980

This paper discusses how certain recent technical advances might be extended to examine proficiency tests which are conceptualized as representing a variety of skills with one or more items per skill. In contrast to previous analyses, errors in the item level are included. (Author/BW)

Descriptors: Mastery Tests, Minimum Competencies, Minimum Competency Testing, Sampling

Reliability Estimation for Single Dichotomous Items Based on Mokken's IRT Model.

Peer reviewed

Meijer, Rob R.; And Others – Applied Psychological Measurement, 1995

Three methods based on the nonparametric item response theory (IRT) of R. J. Mokken for the estimation of the reliability of single dichotomous test items are discussed. Analytical and Monte Carlo studies show that one method, designated "MS," is superior because of smaller bias and smaller sampling variance. (SLD)

Descriptors: Estimation (Mathematics), Item Response Theory, Monte Carlo Methods, Nonparametric Statistics

Test Items	12
Sampling	10
Item Response Theory	6
Statistical Analysis	4
Test Construction	4
Item Sampling	3
Reliability	3
Bayesian Statistics	2
Bias	2
Computation	2
Equated Scores	2
Models	2
Monte Carlo Methods	2
Nonparametric Statistics	2
Simulation	2
Statistical Distributions	2
Test Format	2
Classification	1
Comparative Analysis	1
Computer Assisted Testing	1
Data Analysis	1
Difficulty Level	1
Equations (Mathematics)	1
Error of Measurement	1
Estimation (Mathematics)	1
More ▼