ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	10

Descriptor

Simulation	16
Item Response Theory	10
Scores	7
Test Items	7
Models	4
Probability	4
Comparative Analysis	3
Correlation	3
Evaluation Methods	3
Factor Analysis	3
Nonparametric Statistics	3
Computation	2
Design	2
Error of Measurement	2
Goodness of Fit	2
Measurement	2
Monte Carlo Methods	2
Psychometrics	2
Questionnaires	2
Reliability	2
Sampling	2
Test Bias	2
Test Construction	2
Test Length	2
Test Reliability	2
More ▼

Source

Multivariate Behavioral…	5
Applied Psychological…	3
Journal of Educational…	3
Psychometrika	2
Educational and Psychological…	1
International Journal of…	1
Journal of Educational and…	1

Publication Type

Journal Articles	16
Reports - Research	9
Reports - Evaluative	7

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Estimating Difference-Score Reliability in Pretest-Posttest Settings

Peer reviewed

Direct link

Gu, Zhengguo; Emons, Wilco H. M.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2021

Clinical, medical, and health psychologists use difference scores obtained from pretest--posttest designs employing the same test to assess intraindividual change possibly caused by an intervention addressing, for example, anxiety, depression, eating disorder, or addiction. Reliability of difference scores is important for interpreting observed…

Descriptors: Test Reliability, Scores, Pretests Posttests, Computation

The Effect of Differential Motivation on IRT Linking

Peer reviewed

Direct link

Mittelhaëuser, Marie-Anne; Béguin, Anton A.; Sijtsma, Klaas – Journal of Educational Measurement, 2015

The purpose of this study was to investigate whether simulated differential motivation between the stakes for operational tests and anchor items produces an invalid linking result if the Rasch model is used to link the operational tests. This was done for an external anchor design and a variation of a pretest design. The study also investigated…

Descriptors: Item Response Theory, Simulation, High Stakes Tests, Pretesting

A Flexible Latent Class Approach to Estimating Test-Score Reliability

Peer reviewed

Direct link

van der Palm, Daniël W.; van der Ark, L. Andries; Sijtsma, Klaas – Journal of Educational Measurement, 2014

The latent class reliability coefficient (LCRC) is improved by using the divisive latent class model instead of the unrestricted latent class model. This results in the divisive latent class reliability coefficient (DLCRC), which unlike LCRC avoids making subjective decisions about the best solution and thus avoids judgment error. A computational…

Descriptors: Test Reliability, Scores, Computation, Simulation

Minimum Sample Size Requirements for Mokken Scale Analysis

Peer reviewed

Direct link

Straat, J. Hendrik; van der Ark, L. Andries; Sijtsma, Klaas – Educational and Psychological Measurement, 2014

An automated item selection procedure in Mokken scale analysis partitions a set of items into one or more Mokken scales, if the data allow. Two algorithms are available that pursue the same goal of selecting Mokken scales of maximum length: Mokken's original automated item selection procedure (AISP) and a genetic algorithm (GA). Minimum…

Descriptors: Sampling, Test Items, Effect Size, Scaling

Testing Manifest Monotonicity Using Order-Constrained Statistical Inference

Peer reviewed

Direct link

Tijmstra, Jesper; Hessen, David J.; van der Heijden, Peter G. M.; Sijtsma, Klaas – Psychometrika, 2013

Most dichotomous item response models share the assumption of latent monotonicity, which states that the probability of a positive response to an item is a nondecreasing function of a latent variable intended to be measured. Latent monotonicity cannot be evaluated directly, but it implies manifest monotonicity across a variety of observed scores,…

Descriptors: Item Response Theory, Statistical Inference, Probability, Psychometrics

A Latent Class Approach to Estimating Test-Score Reliability

Peer reviewed

Direct link

van der Ark, L. Andries; van der Palm, Daniel W.; Sijtsma, Klaas – Applied Psychological Measurement, 2011

This study presents a general framework for single-administration reliability methods, such as Cronbach's alpha, Guttman's lambda-2, and method MS. This general framework was used to derive a new approach to estimating test-score reliability by means of the unrestricted latent class model. This new approach is the latent class reliability…

Descriptors: Simulation, Reliability, Measurement, Psychology

Test Length and Decision Quality in Personnel Selection: When Is Short Too Short?

Peer reviewed

Direct link

Kruyen, Peter M.; Emons, Wilco H. M.; Sijtsma, Klaas – International Journal of Testing, 2012

Personnel selection shows an enduring need for short stand-alone tests consisting of, say, 5 to 15 items. Despite their efficiency, short tests are more vulnerable to measurement error than longer test versions. Consequently, the question arises to what extent reducing test length deteriorates decision quality due to increased impact of…

Descriptors: Measurement, Personnel Selection, Decision Making, Error of Measurement

On the Usefulness of a Multilevel Logistic Regression Approach to Person-Fit Analysis

Peer reviewed

Direct link

Conijn, Judith M.; Emons, Wilco H. M.; van Assen, Marcel A. L. M.; Sijtsma, Klaas – Multivariate Behavioral Research, 2011

The logistic person response function (PRF) models the probability of a correct response as a function of the item locations. Reise (2000) proposed to use the slope parameter of the logistic PRF as a person-fit measure. He reformulated the logistic PRF model as a multilevel logistic regression model and estimated the PRF parameters from this…

Descriptors: Monte Carlo Methods, Patients, Probability, Item Response Theory

Detecting Answer Copying Using Alternate Test Forms and Seat Locations in Small-Scale Examinations

Peer reviewed

Direct link

van der Ark, L. Andries; Emons, Wilco H. M.; Sijtsma, Klaas – Journal of Educational Measurement, 2008

Two types of answer-copying statistics for detecting copiers in small-scale examinations are proposed. One statistic identifies the "copier-source" pair, and the other in addition suggests who is copier and who is source. Both types of statistics can be used when the examination has alternate test forms. A simulation study shows that the…

Descriptors: Cheating, Statistics, Test Format, Measures (Individuals)

Multiple Imputation of Item Scores in Test and Questionnaire Data, and Influence on Psychometric Results

Peer reviewed

Direct link

van Ginkel, Joost R.; van der Ark, L. Andries; Sijtsma, Klaas – Multivariate Behavioral Research, 2007

The performance of five simple multiple imputation methods for dealing with missing data were compared. In addition, random imputation and multivariate normal imputation were used as lower and upper benchmark, respectively. Test data were simulated and item scores were deleted such that they were either missing completely at random, missing at…

Descriptors: Evaluation Methods, Psychometrics, Item Response Theory, Scores

A Comparative Study of Test Data Dimensionality Assessment Procedures Under Nonparametric IRT Models

Peer reviewed

Direct link

van Abswoude, Alexandra A. H.; van der Ark, L. Andries; Sijtsma, Klaas – Applied Psychological Measurement, 2004

In this article, an overview of nonparametric item response theory methods for determining the dimensionality of item response data is provided. Four methods were considered: MSP, DETECT, HCA/CCPROX, and DIMTEST. First, the methods were compared theoretically. Second, a simulation study was done to compare the effectiveness of MSP, DETECT, and…

Descriptors: Comparative Analysis, Computer Software, Simulation, Nonparametric Statistics

Influence of Imputation and EM Methods on Factor Analysis When Item Nonresponse in Questionnaire Data Is Nonignorable.

Peer reviewed

Bernaards, Coen A.; Sijtsma, Klaas – Multivariate Behavioral Research, 2000

Using simulation, studied the influence of each of 12 imputation methods and 2 methods using the EM algorithm on the results of maximum likelihood factor analysis as compared with results from the complete data factor analysis (no missing scores). Discusses why EM methods recovered complete data factor loadings better than imputation methods. (SLD)

Descriptors: Factor Analysis, Maximum Likelihood Statistics, Questionnaires, Simulation

Comparing Simulated and Theoretical Sampling Distributions of the U3 Person-Fit Statistic.

Peer reviewed

Emons, Wilco H. M.; Meijer, Rob R.; Sijtsma, Klaas – Applied Psychological Measurement, 2002

Studied whether the theoretical sampling distribution of the U3 person-fit statistic is in agreement with the simulated sampling distribution under different item response theory models and varying item and test characteristics. Simulation results suggest that the use of standard normal deviates for the standardized version of the U3 statistic may…

Descriptors: Item Response Theory, Sampling, Simulation, Statistical Distributions

Factor Analysis of Multidimensional Polytomous Item Response Data Suffering from Ignorable Item Nonresponse.

Peer reviewed

Bernaards A., Coen; Sijtsma, Klaas – Multivariate Behavioral Research, 1999

Used simulation to study the problem of missing item responses in tests and questionnaires when factor analysis is used to study the structure of the items. Factor loadings based on the EM algorithm best approximated the loading structure, with imputation of the mean per person across the scores for that person being the best alternative. (SLD)

Descriptors: Factor Analysis, Factor Structure, Item Response Theory, Simulation

The Person Response Function as a Tool in Person-Fit Research.

Peer reviewed

Sijtsma, Klaas; Meijer, Rob R. – Psychometrika, 2001

Studied the use of the person response function (PRF) for identifying nonfitting item score patterns. Proposed a person-fit method reformulated in a nonparametric item response theory (IRT) context. Conducted a simulation study to compare the use of the PRF with a person-fit statistic, resulting in the conclusion that the PRF can be used as a…

Descriptors: Item Response Theory, Monte Carlo Methods, Nonparametric Statistics, Scores

Previous Page | Next Page »

Pages: 1 | 2

Sijtsma, Klaas	16
Emons, Wilco H. M.	6
van der Ark, L. Andries	6
Meijer, Rob R.	3
Bernaards A., Coen	1
Bernaards, Coen A.	1
Béguin, Anton A.	1
Conijn, Judith M.	1
Gu, Zhengguo	1
Hessen, David J.	1
Kruyen, Peter M.	1
Mittelhaëuser, Marie-Anne	1
Straat, J. Hendrik	1
Tijmstra, Jesper	1
van Abswoude, Alexandra A. H.	1
van Assen, Marcel A. L. M.	1
van Ginkel, Joost R.	1
van der Heijden, Peter G. M.	1
van der Palm, Daniel W.	1
van der Palm, Daniël W.	1
More ▼