ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	5

Descriptor

Test Items	11
Test Theory	11
Item Response Theory	3
Comparative Analysis	2
Computation	2
Correlation	2
Elementary School Students	2
Equations (Mathematics)	2
Error of Measurement	2
Foreign Countries	2
Measures (Individuals)	2
Models	2
Reliability	2
Responses	2
Simulation	2
Test Construction	2
Tests	2
Adaptive Testing	1
Adults	1
Algorithms	1
Anxiety	1
Attitudes	1
Bayesian Statistics	1
Bias	1
Cheating	1
More ▼

Source

Applied Psychological…

Author

van der Linden, Wim J.	2
Almehrizi, Rashid S.	1
Andrich, David	1
Armstrong, Ronald D.	1
Borsboom, Denny	1
Budgell, Glen R.	1
Jones, Douglas H.	1
Kreiner, Svend	1
Mellenbergh, Gideon J.	1
Penfield, Randall D.	1
Raykov, Tenko	1
Sinar, Evan F.	1
Van Heerden, Jaap	1
Wollack, James A.	1
Zickar, Michael J.	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	6
Reports - Evaluative	4
Reports - Descriptive	1

Education Level

Elementary Education	2
Early Childhood Education	1
Grade 2	1
Higher Education	1
Postsecondary Education	1
Primary Education	1

Audience

Location

Australia	1
Canada	1

Laws, Policies, & Programs

Assessments and Surveys

Armed Services Vocational…

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Explaining Crossing DIF in Polytomous Items Using Differential Step Functioning Effects

Peer reviewed

Direct link

Penfield, Randall D. – Applied Psychological Measurement, 2010

Crossing, or intersecting, differential item functioning (DIF) is a form of nonuniform DIF that exists when the sign of the between-group difference in expected item performance changes across the latent trait continuum. The presence of crossing DIF presents a problem for many statistics developed for evaluating DIF because positive and negative…

Descriptors: Test Bias, Test Items, Statistics, Test Theory

Coefficient Alpha and Reliability of Scale Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Applied Psychological Measurement, 2013

The majority of large-scale assessments develop various score scales that are either linear or nonlinear transformations of raw scores for better interpretations and uses of assessment results. The current formula for coefficient alpha (a; the commonly used reliability coefficient) only provides internal consistency reliability estimates of raw…

Descriptors: Raw Scores, Scaling, Reliability, Computation

Quantifying Response Dependence between Two Dichotomous Items Using the Rasch Model

Peer reviewed

Direct link

Andrich, David; Kreiner, Svend – Applied Psychological Measurement, 2010

Models of modern test theory imply statistical independence among responses, generally referred to as "local independence." One violation of local independence occurs when the response to one item governs the response to a subsequent item. Expanding on a formulation of this kind of violation as a process in the dichotomous Rasch model,…

Descriptors: Test Theory, Item Response Theory, Test Items, Correlation

Predictive Control of Speededness in Adaptive Testing

Peer reviewed

Direct link

van der Linden, Wim J. – Applied Psychological Measurement, 2009

An adaptive testing method is presented that controls the speededness of a test using predictions of the test takers' response times on the candidate items in the pool. Two different types of predictions are investigated: posterior predictions given the actual response times on the items already administered and posterior predictions that use the…

Descriptors: Simulation, Adaptive Testing, Vocational Aptitude, Bayesian Statistics

Different Kinds of DIF: A Distinction between Absolute and Relative Forms of Measurement Invariance and Bias

Peer reviewed

Direct link

Borsboom, Denny; Mellenbergh, Gideon J.; Van Heerden, Jaap – Applied Psychological Measurement, 2002

In this article, a distinction is made between absolute and relative measurement. Absolute measurement refers to the measurement of traits on a group-invariant scale, and relative measurement refers to the within-group measurement of traits, where the scale of measurement is expressed in terms of the within-group position on a trait. Relative…

Descriptors: Test Items, Measures (Individuals), Test Theory

Coefficient Alpha and Composite Reliability with Interrelated Nonhomogeneous Items.

Peer reviewed

Raykov, Tenko – Applied Psychological Measurement, 1998

Examines the relationship between Cronbach's coefficient alpha and the reliability of a composite of a prespecified set of interrelated nonhomogeneous components through simulation. Shows that alpha can over- or underestimate scale reliability at the population level. Illustrates the bias in terms of structural parameters. (SLD)

Descriptors: Reliability, Simulation, Statistical Bias, Structural Equation Models

A Nominal Response Model Approach for Detecting Answer Copying.

Peer reviewed

Wollack, James A. – Applied Psychological Measurement, 1997

Introduces a new Item Response Theory (IRT) based statistic for detecting answer copying. Compares this omega statistic with the best classical test theory-based statistic under various conditions, and finds omega superior based on Type I error rate and power. (SLD)

Descriptors: Cheating, Identification, Item Response Theory, Power (Statistics)

Evaluating the Robustness of Graded Response Model and Classical Test Theory Parameter Estimates to Deviant Items.

Peer reviewed

Sinar, Evan F.; Zickar, Michael J. – Applied Psychological Measurement, 2002

Examined the influence of deviant scale items on item parameter estimates of focal scale items and person parameter estimates through a comparison of item response theory (IRT) and classical test theory (CTT) models. Used Monte Carlo methods to explore results from a pilot investigation of job attitude data. Discusses implications for researchers…

Descriptors: Attitudes, Estimation (Mathematics), Monte Carlo Methods, Robustness (Statistics)

Equating Error in Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Applied Psychological Measurement, 2006

Traditionally, error in equating observed scores on two versions of a test is defined as the difference between the transformations that equate the quantiles of their distributions in the sample and population of test takers. But it is argued that if the goal of equating is to adjust the scores of test takers on one version of the test to make…

Descriptors: Equated Scores, Evaluation Criteria, Models, Error of Measurement

Polynomial Algorithms for Item Matching.

Peer reviewed

Armstrong, Ronald D.; Jones, Douglas H. – Applied Psychological Measurement, 1992

Polynomial algorithms are presented that are used to solve selected problems in test theory, and computational results from sample problems with several hundred decision variables are provided that demonstrate the benefits of these algorithms. The algorithms are based on optimization theory in networks (graphs). (SLD)

Descriptors: Algorithms, Decision Making, Equations (Mathematics), Mathematical Models

Analysis of Differential Item Functioning in Translated Assessment Instruments.

Peer reviewed

Budgell, Glen R.; And Others – Applied Psychological Measurement, 1995

The usefulness of three item response theory-based methods and the Mantel Haenszel technique in evaluating the measurement equivalence of translated assessment instruments was demonstrated in a study involving 2,000 French-speaking Canadian adults who took a French test translation and 2,000 English-speaking adults who took the English original.…

Descriptors: Adults, Chi Square, Cultural Awareness, Culture Fair Tests