ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	4

Descriptor

Simulation	17
Statistical Distributions	17
Test Items	17
Ability	10
Item Response Theory	8
Sample Size	6
Adaptive Testing	4
Difficulty Level	4
Estimation (Mathematics)	4
Comparative Analysis	3
Computer Assisted Testing	3
Correlation	3
Foreign Countries	3
Item Bias	3
Models	3
Responses	3
Robustness (Statistics)	3
Scores	3
Statistical Analysis	3
Test Construction	3
Classification	2
Goodness of Fit	2
Guessing (Tests)	2
Item Banks	2
Power (Statistics)	2
More ▼

Source

Educational and Psychological…	2
ProQuest LLC	2
Applied Psychological…	1
Journal of Outcome Measurement	1
Research in Mathematics…	1

Publication Type

Reports - Evaluative	10
Speeches/Meeting Papers	8
Journal Articles	5
Reports - Research	4
Dissertations/Theses -…	2
Reports - Descriptive	1

Education Level

Secondary Education

Audience

Researchers

Location

United Kingdom (England)

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Investigating Confidence Intervals of Item Parameters When Some Item Parameters Take Priors in the 2PL and 3PL Models

Peer reviewed

Direct link

Paek, Insu; Lin, Zhongtian; Chalmers, Robert Philip – Educational and Psychological Measurement, 2023

To reduce the chance of Heywood cases or nonconvergence in estimating the 2PL or the 3PL model in the marginal maximum likelihood with the expectation-maximization (MML-EM) estimation method, priors for the item slope parameter in the 2PL model or for the pseudo-guessing parameter in the 3PL model can be used and the marginal maximum a posteriori…

Descriptors: Models, Item Response Theory, Test Items, Intervals

Some Implications of Choice of Tiering Model in GCSE Mathematics for Inferences about What Students Know and Can Do

Peer reviewed

Direct link

Bramley, Tom – Research in Mathematics Education, 2017

This study compared models of assessment structure for achieving differentiation across the range of examinee attainment in the General Certificate of Secondary Education (GCSE) examination taken by 16-year-olds in England. The focus was on the "adjacent levels" model, where papers are targeted at three specific non-overlapping ranges of…

Descriptors: Foreign Countries, Mathematics Education, Student Certification, Student Evaluation

The Performance of the Linear Logistic Test Model When the Q-Matrix Is Misspecified: A Simulation Study

Direct link

MacDonald, George T. – ProQuest LLC, 2014

A simulation study was conducted to explore the performance of the linear logistic test model (LLTM) when the relationships between items and cognitive components were misspecified. Factors manipulated included percent of misspecification (0%, 1%, 5%, 10%, and 15%), form of misspecification (under-specification, balanced misspecification, and…

Descriptors: Simulation, Item Response Theory, Models, Test Items

Unexpected Direction of Differential Item Functioning

Direct link

Park, Sangwook – ProQuest LLC, 2011

Many studies have been conducted to evaluate the performance of DIF detection methods, when two groups have different ability distributions. Such studies typically have demonstrated factors that are associated with inflation of Type I error rates in DIF detection, such as mean ability differences. However, no study has examined how the direction…

Descriptors: Test Bias, Regression (Statistics), Sample Size, Simulation

The Effect of Unequal Variances in the Ability Distributions on the Type I Error Rate of the Mantel-Haenszel Chi-Square Test for Detecting DIF.

Download full text

Monahan, Patrick – 2000

Previous studies that investigated the effect of unequal ability distributions on the Type I error (TIE) of the Mantel-Haenszel chi-square test for detecting differential item functioning (DIF) simulated ability distributions that differed only in means. This simulation study suggests that the magnitude of TIE inflation is increased, and the type…

Descriptors: Ability, Chi Square, Item Bias, Simulation

Evaluation of Procedures for Linking Multidimensional Item Calibrations.

Download full text

Oshima, T. C.; Davey, T. C. – 1994

This paper evaluated multidimensional linking procedures with which multidimensional test data from two separate calibrations were put on a common scale. Data were simulated with known ability distributions varying on two factors which made linking necessary: mean vector differences and variance-covariance (v-c) matrix differences. After the…

Descriptors: Ability, Estimation (Mathematics), Evaluation Methods, Matrices

Effects of Changes in the Examinees' Ability Distribution on the Exposure Control Methods in CAT.

Download full text

Chang, Shun-Wen; Twu, Bor-Yaun – 2001

To satisfy the security requirements of computerized adaptive tests (CATs), efforts have been made to control the exposure rates of optimal items directly by incorporating statistical methods into the item selection procedure. Since differences are likely to occur between the exposure control parameter derivation stage and the operational CAT…

Descriptors: Adaptive Testing, Computer Assisted Testing, Selection, Simulation

Two New Statistics To Detect Answer Copying. Research Report.

Download full text

Sotaridona, Leonardo S.; Meijer, Rob R. – 2001

Two new indices to detect answer copying on a multiple-choice test, S(1) and S(2) (subscripts), are proposed. The S(1) index is similar to the K-index (P. Holland, 1996) and the K-overscore(2), (K2) index (L. Sotaridona and R. Meijer, in press), but the distribution of the number of matching incorrect answers of the source (examinee s) and the…

Descriptors: Cheating, Multiple Choice Tests, Responses, Sample Size

Identifying Measurement Disturbance Effects Using Rasch Item Fit Statistics and the Logit Residual Index.

Peer reviewed

Mount, Robert E.; Schumacker, Randall E. – Journal of Outcome Measurement, 1998

A Monte Carlo study was conducted using simulated dichotomous data to determine the effects of guessing on Rasch item fit statistics and the Logit Residual Index. Results indicate that no significant differences were found between the mean Rasch item fit statistics for each distribution type as the probability of guessing the correct answer…

Descriptors: Goodness of Fit, Guessing (Tests), Item Response Theory, Monte Carlo Methods

The Robustness of BILOG to Violations of the Assumptions of Unidimensionality of Test Items and Normality of Ability Distribution.

PDF pending restoration

Kirisci, Levent; Hsu, Tse-Chi – 1995

The main goal of this study was to assess how sensitive unidimensional parameter estimates derived from BILOG were when the unidimensionality assumption was violated and the underlying ability distribution was not multivariate normal. A multidimensional three-parameter logistic distribution that was a straightforward generalization of the…

Descriptors: Ability, Comparative Analysis, Correlation, Difficulty Level

Simulating the Null Distribution of Person-Fit Statistics for Conventional and Adaptive Tests. Research Report 98-02.

Download full text

Meijer, Rob R.; van Krimpen-Stoop, Edith M. L. A. – 1998

Several person-fit statistics have been proposed to detect item score patterns that do not fit an item response theory model. To classify response patterns as not fitting a model, a distribution of a person-fit statistic is needed. The null distributions of several fit statistics have been investigated using conventionally administered tests, but…

Descriptors: Ability, Adaptive Testing, Foreign Countries, Item Response Theory

Designing Item Pools for Computerized Adaptive Testing. Research Report 99-03.

Download full text

Veldkamp, Bernard P.; van der Linden, Wim J. – 1999

A method of item pool design is proposed that uses an optimal blueprint for the item pool calculated from the test specifications. The blueprint is a document that specifies the attributes that the items in the computerized adaptive test (CAT) pool should have. The blueprint can be a starting point for the item writing process, and it can be used…

Descriptors: Ability, Adaptive Testing, Classification, Computer Assisted Testing

Detecting Item Bias in the Rasch Rating Scale Model.

Peer reviewed

Smith, Richard M. – Educational and Psychological Measurement, 1994

Simulated data are used to assess the appropriateness of using separate calibration and between-fit approaches to detecting item bias in the Rasch rating scale model. Results indicate that Type I error rates for the null distribution hold even when there are different ability levels for reference and focal groups. (SLD)

Descriptors: Ability, Goodness of Fit, Identification, Item Bias

Testing the Robustness of DIMTEST on Nonnormal Ability Distributions.

Download full text

Nandakumar, Ratna; Yu, Feng – 1994

DIMTEST is a statistical test procedure for assessing essential unidimensionality of binary test item responses. The test statistic T used for testing the null hypothesis of essential unidimensionality is a nonparametric statistic. That is, there is no particular parametric distribution assumed for the underlying ability distribution or for the…

Descriptors: Ability, Content Validity, Correlation, Nonparametric Statistics

A Conceptual Analysis of Differential Item Functioning in Terms of a Multidimensional Item Response Model.

Peer reviewed

Camilli, Gregory – Applied Psychological Measurement, 1992

A mathematical model is proposed to describe how group differences in distributions of abilities, which are distinct from the target ability, influence the probability of a correct item response. In the multidimensional approach, differential item functioning is considered a function of the educational histories of the examinees. (SLD)

Descriptors: Ability, Comparative Analysis, Equations (Mathematics), Factor Analysis

Previous Page | Next Page »

Pages: 1 | 2

Meijer, Rob R.	2
Bramley, Tom	1
Camilli, Gregory	1
Chalmers, Robert Philip	1
Chang, Shun-Wen	1
Davey, T. C.	1
Hsu, Tse-Chi	1
Ito, Kyoko	1
Kirisci, Levent	1
Lin, Zhongtian	1
MacDonald, George T.	1
Monahan, Patrick	1
Mount, Robert E.	1
Nandakumar, Ratna	1
Oshima, T. C.	1
Paek, Insu	1
Park, Sangwook	1
Sarvela, Paul D.	1
Schumacker, Randall E.	1
Smith, Richard M.	1
Sotaridona, Leonardo S.	1
Sykes, Robert C.	1
Twu, Bor-Yaun	1
Veldkamp, Bernard P.	1
Yu, Feng	1
More ▼