ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	3

Descriptor

Statistical Analysis	11
Sampling	9
Reliability	5
Error of Measurement	4
Test Items	4
Analysis of Variance	3
Hypothesis Testing	3
Item Sampling	3
Research Design	3
Comparative Analysis	2
Bayesian Statistics	1
Bias	1
Computation	1
Creativity Tests	1
Difficulty Level	1
Equated Scores	1
Higher Education	1
Item Response Theory	1
Latent Trait Theory	1
Markov Processes	1
Mastery Tests	1
Mathematical Models	1
Matrices	1
Measurement Techniques	1
Minimum Competencies	1
More ▼

Source

Applied Psychological…

Author

Forsyth, Robert A.	2
Alsawalmeh, Yousef M.	1
Babcock, Ben	1
Feldt, Leonard S.	1
Frederiksen, Norman	1
Levin, Joel R.	1
MacCallum, Robert C.	1
Manalo, Jonathan R.	1
Rijmen, Frank	1
Subkoviak, Michael J.	1
Waller, Niels G.	1
Ward, William C.	1
Wilcox, Rand R.	1
van der Linden, Wim J.	1
von Davier, Alina A.	1
More ▼

Publication Type

Journal Articles	7
Reports - Evaluative	4
Reports - Research	2
Reports - General	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Estimating a Noncompensatory IRT Model Using Metropolis within Gibbs Sampling

Peer reviewed

Direct link

Babcock, Ben – Applied Psychological Measurement, 2011

Relatively little research has been conducted with the noncompensatory class of multidimensional item response theory (MIRT) models. A Monte Carlo simulation study was conducted exploring the estimation of a two-parameter noncompensatory item response theory (IRT) model. The estimation method used was a Metropolis-Hastings within Gibbs algorithm…

Descriptors: Item Response Theory, Sampling, Computation, Statistical Analysis

Asymptotic and Sampling-Based Standard Errors for Two Population Invariance Measures in the Linear Equating Case

Peer reviewed

Direct link

Rijmen, Frank; Manalo, Jonathan R.; von Davier, Alina A. – Applied Psychological Measurement, 2009

This article describes two methods for obtaining the standard errors of two commonly used population invariance measures of equating functions: the root mean square difference of the subpopulation equating functions from the overall equating function and the root expected mean square difference. The delta method relies on an analytical…

Descriptors: Error of Measurement, Sampling, Equated Scores, Statistical Analysis

Commingled Samples: A Neglected Source of Bias in Reliability Analysis

Peer reviewed

Direct link

Waller, Niels G. – Applied Psychological Measurement, 2008

Reliability is a property of test scores from individuals who have been sampled from a well-defined population. Reliability indices, such as coefficient and related formulas for internal consistency reliability (KR-20, Hoyt's reliability), yield lower bound reliability estimates when (a) subjects have been sampled from a single population and when…

Descriptors: Test Items, Reliability, Scores, Psychometrics

Binomial Test Models and Item Difficulty.

Peer reviewed

van der Linden, Wim J. – Applied Psychological Measurement, 1979

The restrictions on item difficulties that must be met when binomial models are applied to domain-referenced testing are examined. Both a deterministic and a stochastic conception of item responses are discussed with respect to difficulty and Guttman-type items. (Author/BH)

Descriptors: Difficulty Level, Item Sampling, Latent Trait Theory, Mathematical Models

Testing the Equality of Two Related Intraclass Reliability Coefficients.

Peer reviewed

Alsawalmeh, Yousef M.; Feldt, Leonard S. – Applied Psychological Measurement, 1994

An approximate statistical test of the equality of two intraclass reliability coefficients based on the same sample of people is derived. Such a test is needed when a researcher wishes to compare the reliability of two measurement procedures, and both procedures can be applied to results from the same group. (SLD)

Descriptors: Comparative Analysis, Measurement Techniques, Reliability, Sampling

A Note on "Planning an Experiment in the Company of Measurement Error" by Levin and Subkoviak.

Peer reviewed

Forsyth, Robert A. – Applied Psychological Measurement, 1978

This note shows that, under conditions specified by Levin and Subkoviak (TM 503 420), it is not necessary to specify the reliabilities of observed scores when comparing completely randomized designs with randomized block designs. Certain errors in their illustrative example are also discussed. (Author/CTM)

Descriptors: Analysis of Variance, Error of Measurement, Hypothesis Testing, Reliability

Correcting "Planning an Experiment in the Company of Measurement Error."

Peer reviewed

Levin, Joel R.; Subkoviak, Michael J. – Applied Psychological Measurement, 1978

Comments (TM 503 706) on an earlier article (TM 503 420) concerning the comparison of the completely randomized design and the randomized block design are acknowledged and appreciated. In addition, potentially misleading notions arising from these comments are addressed and clarified. (See also TM 503 708). (Author/CTM)

Descriptors: Analysis of Variance, Error of Measurement, Hypothesis Testing, Reliability

Some Additional Comments on "Planning an Experiment in the Company of Measurement Error."

Peer reviewed

Forsyth, Robert A. – Applied Psychological Measurement, 1978

This note continues the discussion of earlier articles (TM 503 420, TM 503 706, and TM 503 707), comparing the completely randomized design with the randomized block design. (CTM)

Descriptors: Analysis of Variance, Error of Measurement, Hypothesis Testing, Reliability

Validity and Cross-Validity of Metric and Nonmetric Multiple Regression.

Peer reviewed

MacCallum, Robert C.; And Others – Applied Psychological Measurement, 1979

Questions are raised concerning differences between traditional metric multiple regression, which assumes all variables to be measured on interval scales, and nonmetric multiple regression. The ordinal model is generally superior in fitting derivation samples but the metric technique fits better than the nonmetric in cross-validation samples.…

Descriptors: Comparative Analysis, Multiple Regression Analysis, Nonparametric Statistics, Personnel Evaluation

An Approach to Measuring the Achievement or Proficiency of an Examinee.

Peer reviewed

Wilcox, Rand R. – Applied Psychological Measurement, 1980

This paper discusses how certain recent technical advances might be extended to examine proficiency tests which are conceptualized as representing a variety of skills with one or more items per skill. In contrast to previous analyses, errors in the item level are included. (Author/BW)

Descriptors: Mastery Tests, Minimum Competencies, Minimum Competency Testing, Sampling

Measures for the Study of Creativity in Scientific Problem-Solving

Peer reviewed

Frederiksen, Norman; Ward, William C. – Applied Psychological Measurement, 1978

A set of Tests of Scientific Thinking were developed for possible use as criterion measures in research on creativity. Scores on the tests describe both quality and quantity of ideas produced in formulating hypotheses, evaluating proposals, solving methodological problems, and devising methods for measuring constructs. (Author/CTM)

Descriptors: Creativity Tests, Higher Education, Item Sampling, Predictive Validity