Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 4 |
Descriptor
Source
Educational and Psychological… | 2 |
ProQuest LLC | 2 |
Applied Psychological… | 1 |
Journal of Outcome Measurement | 1 |
Research in Mathematics… | 1 |
Author
Meijer, Rob R. | 2 |
Bramley, Tom | 1 |
Camilli, Gregory | 1 |
Chalmers, Robert Philip | 1 |
Chang, Shun-Wen | 1 |
Davey, T. C. | 1 |
Hsu, Tse-Chi | 1 |
Ito, Kyoko | 1 |
Kirisci, Levent | 1 |
Lin, Zhongtian | 1 |
MacDonald, George T. | 1 |
More ▼ |
Publication Type
Reports - Evaluative | 10 |
Speeches/Meeting Papers | 8 |
Journal Articles | 5 |
Reports - Research | 4 |
Dissertations/Theses -… | 2 |
Reports - Descriptive | 1 |
Education Level
Secondary Education | 1 |
Audience
Researchers | 1 |
Location
United Kingdom (England) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Paek, Insu; Lin, Zhongtian; Chalmers, Robert Philip – Educational and Psychological Measurement, 2023
To reduce the chance of Heywood cases or nonconvergence in estimating the 2PL or the 3PL model in the marginal maximum likelihood with the expectation-maximization (MML-EM) estimation method, priors for the item slope parameter in the 2PL model or for the pseudo-guessing parameter in the 3PL model can be used and the marginal maximum a posteriori…
Descriptors: Models, Item Response Theory, Test Items, Intervals
Bramley, Tom – Research in Mathematics Education, 2017
This study compared models of assessment structure for achieving differentiation across the range of examinee attainment in the General Certificate of Secondary Education (GCSE) examination taken by 16-year-olds in England. The focus was on the "adjacent levels" model, where papers are targeted at three specific non-overlapping ranges of…
Descriptors: Foreign Countries, Mathematics Education, Student Certification, Student Evaluation
MacDonald, George T. – ProQuest LLC, 2014
A simulation study was conducted to explore the performance of the linear logistic test model (LLTM) when the relationships between items and cognitive components were misspecified. Factors manipulated included percent of misspecification (0%, 1%, 5%, 10%, and 15%), form of misspecification (under-specification, balanced misspecification, and…
Descriptors: Simulation, Item Response Theory, Models, Test Items
Park, Sangwook – ProQuest LLC, 2011
Many studies have been conducted to evaluate the performance of DIF detection methods, when two groups have different ability distributions. Such studies typically have demonstrated factors that are associated with inflation of Type I error rates in DIF detection, such as mean ability differences. However, no study has examined how the direction…
Descriptors: Test Bias, Regression (Statistics), Sample Size, Simulation
Monahan, Patrick – 2000
Previous studies that investigated the effect of unequal ability distributions on the Type I error (TIE) of the Mantel-Haenszel chi-square test for detecting differential item functioning (DIF) simulated ability distributions that differed only in means. This simulation study suggests that the magnitude of TIE inflation is increased, and the type…
Descriptors: Ability, Chi Square, Item Bias, Simulation
Oshima, T. C.; Davey, T. C. – 1994
This paper evaluated multidimensional linking procedures with which multidimensional test data from two separate calibrations were put on a common scale. Data were simulated with known ability distributions varying on two factors which made linking necessary: mean vector differences and variance-covariance (v-c) matrix differences. After the…
Descriptors: Ability, Estimation (Mathematics), Evaluation Methods, Matrices
Chang, Shun-Wen; Twu, Bor-Yaun – 2001
To satisfy the security requirements of computerized adaptive tests (CATs), efforts have been made to control the exposure rates of optimal items directly by incorporating statistical methods into the item selection procedure. Since differences are likely to occur between the exposure control parameter derivation stage and the operational CAT…
Descriptors: Adaptive Testing, Computer Assisted Testing, Selection, Simulation
Sotaridona, Leonardo S.; Meijer, Rob R. – 2001
Two new indices to detect answer copying on a multiple-choice test, S(1) and S(2) (subscripts), are proposed. The S(1) index is similar to the K-index (P. Holland, 1996) and the K-overscore(2), (K2) index (L. Sotaridona and R. Meijer, in press), but the distribution of the number of matching incorrect answers of the source (examinee s) and the…
Descriptors: Cheating, Multiple Choice Tests, Responses, Sample Size

Mount, Robert E.; Schumacker, Randall E. – Journal of Outcome Measurement, 1998
A Monte Carlo study was conducted using simulated dichotomous data to determine the effects of guessing on Rasch item fit statistics and the Logit Residual Index. Results indicate that no significant differences were found between the mean Rasch item fit statistics for each distribution type as the probability of guessing the correct answer…
Descriptors: Goodness of Fit, Guessing (Tests), Item Response Theory, Monte Carlo Methods

Kirisci, Levent; Hsu, Tse-Chi – 1995
The main goal of this study was to assess how sensitive unidimensional parameter estimates derived from BILOG were when the unidimensionality assumption was violated and the underlying ability distribution was not multivariate normal. A multidimensional three-parameter logistic distribution that was a straightforward generalization of the…
Descriptors: Ability, Comparative Analysis, Correlation, Difficulty Level
Meijer, Rob R.; van Krimpen-Stoop, Edith M. L. A. – 1998
Several person-fit statistics have been proposed to detect item score patterns that do not fit an item response theory model. To classify response patterns as not fitting a model, a distribution of a person-fit statistic is needed. The null distributions of several fit statistics have been investigated using conventionally administered tests, but…
Descriptors: Ability, Adaptive Testing, Foreign Countries, Item Response Theory
Veldkamp, Bernard P.; van der Linden, Wim J. – 1999
A method of item pool design is proposed that uses an optimal blueprint for the item pool calculated from the test specifications. The blueprint is a document that specifies the attributes that the items in the computerized adaptive test (CAT) pool should have. The blueprint can be a starting point for the item writing process, and it can be used…
Descriptors: Ability, Adaptive Testing, Classification, Computer Assisted Testing

Smith, Richard M. – Educational and Psychological Measurement, 1994
Simulated data are used to assess the appropriateness of using separate calibration and between-fit approaches to detecting item bias in the Rasch rating scale model. Results indicate that Type I error rates for the null distribution hold even when there are different ability levels for reference and focal groups. (SLD)
Descriptors: Ability, Goodness of Fit, Identification, Item Bias
Nandakumar, Ratna; Yu, Feng – 1994
DIMTEST is a statistical test procedure for assessing essential unidimensionality of binary test item responses. The test statistic T used for testing the null hypothesis of essential unidimensionality is a nonparametric statistic. That is, there is no particular parametric distribution assumed for the underlying ability distribution or for the…
Descriptors: Ability, Content Validity, Correlation, Nonparametric Statistics

Camilli, Gregory – Applied Psychological Measurement, 1992
A mathematical model is proposed to describe how group differences in distributions of abilities, which are distinct from the target ability, influence the probability of a correct item response. In the multidimensional approach, differential item functioning is considered a function of the educational histories of the examinees. (SLD)
Descriptors: Ability, Comparative Analysis, Equations (Mathematics), Factor Analysis
Previous Page | Next Page ยป
Pages: 1 | 2