ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	8

Descriptor

Difficulty Level	32
Simulation	32
Test Items	22
Item Response Theory	20
Computer Assisted Testing	8
Estimation (Mathematics)	7
Ability	6
Adaptive Testing	6
Guessing (Tests)	6
Error of Measurement	5
Test Construction	5
Algorithms	4
Comparative Analysis	4
Equated Scores	4
Item Banks	4
Models	4
Bayesian Statistics	3
Correlation	3
Data Analysis	3
Item Bias	3
Mathematical Models	3
Comparative Testing	2
Factor Analysis	2
Higher Education	2
Latent Trait Theory	2
More ▼

Source

Applied Psychological…	2
Educational and Psychological…	2
Journal of Educational…	2
Alberta Journal of…	1
Applied Measurement in…	1
ETS Research Report Series	1
Journal of Experimental…	1
Journal of Outcome Measurement	1
Learning Organization	1
Practical Assessment,…	1
Teaching in Higher Education	1
More ▼

Publication Type

Reports - Evaluative	32
Journal Articles	14
Speeches/Meeting Papers	14
Reports - Research	1

Education Level

Higher Education	2
Postsecondary Education	2
Adult Education	1

Audience

Location

Australia	1
Sweden	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 32 results Save | Export

Using E-Z Reader to Examine the Consequences of Fixation-Location Measurement Error

Peer reviewed

Direct link

Reichle, Erik D.; Drieghe, Denis – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2015

There is an ongoing debate about whether fixation durations during reading are only influenced by the processing difficulty of the words being fixated (i.e., the serial-attention hypothesis) or whether they are also influenced by the processing difficulty of the previous and/or upcoming words (i.e., the attention-gradient hypothesis). This article…

Descriptors: Reading, Eye Movements, Error of Measurement, Difficulty Level

Theorising Simulation in Higher Education: Difficulty for Learners as an Emergent Phenomenon

Peer reviewed

Direct link

Abrandt Dahlgren, Madeleine; Fenwick, Tara; Hopwood, Nick – Teaching in Higher Education, 2016

Despite the widespread interest in using and researching simulation in higher education, little discussion has yet to address a key pedagogical concern: difficulty. A "sociomaterial" view of learning, explained in this paper, goes beyond cognitive considerations to highlight dimensions of material, situational, representational and…

Descriptors: Simulation, Higher Education, Social Theories, Experiential Learning

Estimating Item Difficulty with Comparative Judgments. Research Report. ETS RR-14-39

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Saldivia, Luis; Jackson, Carol; Schuppan, Fred; Wanamaker, Wilbur – ETS Research Report Series, 2014

Previous investigations of the ability of content experts and test developers to estimate item difficulty have, for themost part, produced disappointing results. These investigations were based on a noncomparative method of independently rating the difficulty of items. In this article, we argue that, by eliciting comparative judgments of…

Descriptors: Test Items, Difficulty Level, Comparative Analysis, College Entrance Examinations

A Generalized Model with Internal Restrictions on Item Difficulty for Polytomous Items

Peer reviewed

Direct link

Wang, Wen-Chung; Jin, Kuan-Yu – Educational and Psychological Measurement, 2010

In this study, the authors extend the standard item response model with internal restrictions on item difficulty (MIRID) to fit polytomous items using cumulative logits and adjacent-category logits. Moreover, the new model incorporates discrimination parameters and is rooted in a multilevel framework. It is a nonlinear mixed model so that existing…

Descriptors: Difficulty Level, Test Items, Item Response Theory, Generalization

Termination Criteria for Computerized Classification Testing

Peer reviewed

Direct link

Thompson, Nathan A. – Practical Assessment, Research & Evaluation, 2011

Computerized classification testing (CCT) is an approach to designing tests with intelligent algorithms, similar to adaptive testing, but specifically designed for the purpose of classifying examinees into categories such as "pass" and "fail." Like adaptive testing for point estimation of ability, the key component is the…

Descriptors: Adaptive Testing, Computer Assisted Testing, Classification, Probability

Fitting the Rasch Model to Account for Variation in Item Discrimination

Peer reviewed

Direct link

Weitzman, R. A. – Educational and Psychological Measurement, 2009

Building on the Kelley and Gulliksen versions of classical test theory, this article shows that a logistic model having only a single item parameter can account for varying item discrimination, as well as difficulty, by using item-test correlations to adjust incorrect-correct (0-1) item responses prior to an initial model fit. The fit occurs…

Descriptors: Item Response Theory, Test Items, Difficulty Level, Test Bias

Robustness of Lord's Formulas for Item Difficulty and Discrimination Conversions between Classical and Item Response Theory Models

Peer reviewed

Direct link

Dawber, Teresa; Rogers, W. Todd; Carbonaro, Michael – Alberta Journal of Educational Research, 2009

Lord (1980) proposed formulas that provide direct relationships between IRT discrimination and difficulty parameters and conventional item statistics. The purpose of the present study was to determine the robustness of the formulas beyond the initial and restrictive conditions identified by Lord. Simulation and real achievement data were employed.…

Descriptors: Test Items, Simulation, Achievement Tests, Robustness (Statistics)

Item Position and Item Difficulty Change in an IRT-Based Common Item Equating Design

Peer reviewed

Direct link

Meyers, Jason L.; Miller, G. Edward; Way, Walter D. – Applied Measurement in Education, 2009

In operational testing programs using item response theory (IRT), item parameter invariance is threatened when an item appears in a different location on the live test than it did when it was field tested. This study utilizes data from a large state's assessments to model change in Rasch item difficulty (RID) as a function of item position change,…

Descriptors: Test Items, Test Content, Testing Programs, Simulation

A Comparison of Linking and Concurrent Calibration under Item Response Theory.

Peer reviewed

Kim, Seock-Ho; Cohen, Allan S. – Applied Psychological Measurement, 1998

Compared three methods for developing a common metric under item response theory through simulation. For smaller numbers of common items, linking using the characteristic curve method yielded smaller root mean square differences for both item discrimination and difficulty parameters. For larger numbers of common items, the three methods were…

Descriptors: Comparative Analysis, Difficulty Level, Item Response Theory, Simulation

The Effect of Sample Size on the Functioning of the Mantel-Haenszel Statistic.

Download full text

Mazor, Kathleen M.; And Others – 1991

The Mantel-Haenszel (MH) procedure has become one of the most popular procedures for detecting differential item functioning. Valid results with relatively small numbers of examinees represent one of the advantages typically attributed to this procedure. In this study, examinee item responses were simulated to contain differentially functioning…

Descriptors: Difficulty Level, Item Bias, Item Response Theory, Sample Size

Computerized Classification Testing under Practical Constraints with a Polytomous Model.

Download full text

Lau, C. Allen; Wang, Tianyou – 1999

A study was conducted to extend the sequential probability ratio testing (SPRT) procedure with the polytomous model under some practical constraints in computerized classification testing (CCT), such as methods to control item exposure rate, and to study the effects of other variables, including item information algorithms, test difficulties, item…

Descriptors: Algorithms, Computer Assisted Testing, Difficulty Level, Item Banks

Measuring Individual Differences in Change with Multidimensional Rasch Models.

Peer reviewed

Wang, Wen-chung; Wilson, Mark; Adams, Raymond J. – Journal of Outcome Measurement, 1998

Another Rasch approach to the measurement of change, the multidimensional random coefficient multinomial logit model (MRCML), is proposed. The MRCML model can be applied to polytomous items and the investigation of variations in item difficulties. Some simulation studies demonstrate good parameter recovery for the MRCML model under various testing…

Descriptors: Change, Difficulty Level, Individual Differences, Item Response Theory

Examination of Various Influences on the Mantel-Haenszel Statistic.

Clauser, Brian E.; And Others – 1991

Item bias has been a major concern for test developers during recent years. The Mantel-Haenszel statistic has been among the preferred methods for identifying biased items. The statistic's performance in identifying uniform bias in simulated data modeled by producing various levels of difference in the (item difficulty) b-parameter for reference…

Descriptors: Comparative Testing, Difficulty Level, Item Bias, Item Response Theory

The Relationship between Item Parameters and Item Fit

Peer reviewed

Direct link

Dodeen, Hamzeh – Journal of Educational Measurement, 2004

The effect of item parameters (discrimination, difficulty, and level of guessing) on the item-fit statistic was investigated using simulated dichotomous data. Nine tests were simulated using 1,000 persons, 50 items, three levels of item discrimination, three levels of item difficulty, and three levels of guessing. The item fit was estimated using…

Descriptors: Item Response Theory, Difficulty Level, Test Items, Guessing (Tests)

Does Cheating on CAT Pay: NOT!

Download full text

Gershon, Richard; Bergstrom, Betty – 1995

When examinees are allowed to review responses on an adaptive test, can they "cheat" the adaptive algorithm in order to take an easier test and improve their performance? Theoretically, deliberately answering items incorrectly will lower the examinee ability estimate and easy test items will be administered. If review is then allowed,…

Descriptors: Adaptive Testing, Algorithms, Cheating, Computer Assisted Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3

Li, Yuan H.	2
Abrandt Dahlgren, Madeleine	1
Adams, Raymond J.	1
Attali, Yigal	1
Berger, Martijn P. F.	1
Bergstrom, Betty	1
Carbonaro, Michael	1
Clauser, Brian E.	1
Cohen, Allan S.	1
Dawber, Teresa	1
De Ayala, R. J.	1
Dodeen, Hamzeh	1
Drieghe, Denis	1
Fenwick, Tara	1
Fox, Jean-Paul	1
Gershon, Richard	1
Glas, Cees A. W.	1
Griffith, William D.	1
Harkema, Saskia	1
Hicks, Marilyn M.	1
Hopwood, Nick	1
Hsu, Tse-Chi	1
Ito, Kyoko	1
Jackson, Carol	1
More ▼