ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	54

Descriptor

Test Items	247
Simulation	185
Item Response Theory	126
Adaptive Testing	66
Computer Assisted Testing	64
Computer Simulation	62
Estimation (Mathematics)	55
Test Construction	50
Item Bias	48
Ability	41
Mathematical Models	40
Comparative Analysis	36
Equations (Mathematics)	34
Evaluation Methods	32
Difficulty Level	30
Sample Size	29
Models	26
Test Bias	26
Maximum Likelihood Statistics	25
Identification	22
Selection	22
Goodness of Fit	21
Scores	21
Factor Analysis	20
Item Banks	20
More ▼

Publication Type

Reports - Evaluative	247
Journal Articles	146
Speeches/Meeting Papers	70
Reports - Research	3
Numerical/Quantitative Data	2
Guides - Non-Classroom	1
Information Analyses	1
Reports - Descriptive	1

Education Level

Elementary Secondary Education	3
Higher Education	3
Postsecondary Education	2
Elementary Education	1
Grade 6	1
Grade 7	1
Grade 8	1
Middle Schools	1

Audience

Administrators	1
Practitioners	1
Researchers	1
Teachers	1

Location

Netherlands	2
Taiwan	2
United States	2
Canada	1
Japan	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Armed Services Vocational…	4
National Assessment of…	4
Advanced Placement…	2
Graduate Record Examinations	2
SAT (College Admission Test)	2
ACT Assessment	1
Armed Forces Qualification…	1
COMPASS (Computer Assisted…	1
Graduate Management Admission…	1
Law School Admission Test	1
Program for International…	1
Test of English as a Foreign…	1
Work Keys (ACT)	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 247 results Save | Export

Comparison of Item Response Theory Ability and Item Parameters According to Classical and Bayesian Estimation Methods

Peer reviewed
PDF on ERIC

Download full text

Eray Selçuk; Ergül Demir – International Journal of Assessment Tools in Education, 2024

This research aims to compare the ability and item parameter estimations of Item Response Theory according to Maximum likelihood and Bayesian approaches in different Monte Carlo simulation conditions. For this purpose, depending on the changes in the priori distribution type, sample size, test length, and logistics model, the ability and item…

Descriptors: Item Response Theory, Item Analysis, Test Items, Simulation

A Factored Regression Model for Composite Scores with Item-Level Missing Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Egamaria Alacam; Craig K. Enders; Han Du; Brian T. Keller – Grantee Submission, 2023

Composite scores are an exceptionally important psychometric tool for behavioral science research applications. A prototypical example occurs with self-report data, where researchers routinely use questionnaires with multiple items that tap into different features of a target construct. Item-level missing data are endemic to composite score…

Descriptors: Regression (Statistics), Scores, Psychometrics, Test Items

Two IRT Fixed Parameter Calibration Methods for the Bifactor Model

Peer reviewed

Direct link

Kim, Kyung Yong – Journal of Educational Measurement, 2020

New items are often evaluated prior to their operational use to obtain item response theory (IRT) item parameter estimates for quality control purposes. Fixed parameter calibration is one linking method that is widely used to estimate parameters for new items and place them on the desired scale. This article provides detailed descriptions of two…

Descriptors: Item Response Theory, Evaluation Methods, Test Items, Simulation

Detection of Differential Item Functioning with Nonlinear Regression: A Non-IRT Approach Accounting for Guessing

Peer reviewed

Direct link

Drabinová, Adéla; Martinková, Patrícia – Journal of Educational Measurement, 2017

In this article we present a general approach not relying on item response theory models (non-IRT) to detect differential item functioning (DIF) in dichotomous items with presence of guessing. The proposed nonlinear regression (NLR) procedure for DIF detection is an extension of method based on logistic regression. As a non-IRT approach, NLR can…

Descriptors: Test Items, Regression (Statistics), Guessing (Tests), Identification

Estimating Item Difficulty with Comparative Judgments. Research Report. ETS RR-14-39

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Saldivia, Luis; Jackson, Carol; Schuppan, Fred; Wanamaker, Wilbur – ETS Research Report Series, 2014

Previous investigations of the ability of content experts and test developers to estimate item difficulty have, for themost part, produced disappointing results. These investigations were based on a noncomparative method of independently rating the difficulty of items. In this article, we argue that, by eliciting comparative judgments of…

Descriptors: Test Items, Difficulty Level, Comparative Analysis, College Entrance Examinations

Evaluating the Wald Test for Item-Level Comparison of Saturated and Reduced Models in Cognitive Diagnosis

Peer reviewed

Direct link

de la Torre, Jimmy; Lee, Young-Sun – Journal of Educational Measurement, 2013

This article used the Wald test to evaluate the item-level fit of a saturated cognitive diagnosis model (CDM) relative to the fits of the reduced models it subsumes. A simulation study was carried out to examine the Type I error and power of the Wald test in the context of the G-DINA model. Results show that when the sample size is small and a…

Descriptors: Statistical Analysis, Test Items, Goodness of Fit, Error of Measurement

Assessing Fit of Item Response Models Using the Information Matrix Test

Peer reviewed

Direct link

Ranger, Jochen; Kuhn, Jorg-Tobias – Journal of Educational Measurement, 2012

The information matrix can equivalently be determined via the expectation of the Hessian matrix or the expectation of the outer product of the score vector. The identity of these two matrices, however, is only valid in case of a correctly specified model. Therefore, differences between the two versions of the observed information matrix indicate…

Descriptors: Goodness of Fit, Item Response Theory, Models, Matrices

An Algorithm for Testing Unidimensionality and Clustering Items in Rasch Measurement

Peer reviewed

Direct link

Debelak, Rudolf; Arendasy, Martin – Educational and Psychological Measurement, 2012

A new approach to identify item clusters fitting the Rasch model is described and evaluated using simulated and real data. The proposed method is based on hierarchical cluster analysis and constructs clusters of items that show a good fit to the Rasch model. It thus gives an estimate of the number of independent scales satisfying the postulates of…

Descriptors: Test Items, Factor Analysis, Evaluation Methods, Simulation

Comparing the Performance of Five Multidimensional CAT Selection Procedures with Different Stopping Rules

Peer reviewed

Direct link

Yao, Lihua – Applied Psychological Measurement, 2013

Through simulated data, five multidimensional computerized adaptive testing (MCAT) selection procedures with varying test lengths are examined and compared using different stopping rules. Fixed item exposure rates are used for all the items, and the Priority Index (PI) method is used for the content constraints. Two stopping rules, standard error…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection

A Nested Logit Approach for Investigating Distractors as Causes of Differential Item Functioning

Peer reviewed

Direct link

Suh, Youngsuk; Bolt, Daniel M. – Journal of Educational Measurement, 2011

In multiple-choice items, differential item functioning (DIF) in the correct response may or may not be caused by differentially functioning distractors. Identifying distractors as causes of DIF can provide valuable information for potential item revision or the design of new test items. In this paper, we examine a two-step approach based on…

Descriptors: Test Items, Test Bias, Multiple Choice Tests, Simulation

Test-Task Authenticity: The Multiple Perspectives

Peer reviewed

Direct link

Gan, Zhengdong – Changing English: Studies in Culture and Education, 2012

Leung and Lewkowicz remind us that the debate over the past two decades that is most relevant to ELT (English languge teaching) pedagogy and curriculum concerns test-task authenticity. This paper first reviews how the authenticity debate in the literature of second language acquisition, pedagogy and testing has evolved. Drawing on a body of…

Descriptors: Teaching Methods, English (Second Language), Second Language Learning, Second Language Instruction

A Generalized Model with Internal Restrictions on Item Difficulty for Polytomous Items

Peer reviewed

Direct link

Wang, Wen-Chung; Jin, Kuan-Yu – Educational and Psychological Measurement, 2010

In this study, the authors extend the standard item response model with internal restrictions on item difficulty (MIRID) to fit polytomous items using cumulative logits and adjacent-category logits. Moreover, the new model incorporates discrimination parameters and is rooted in a multilevel framework. It is a nonlinear mixed model so that existing…

Descriptors: Difficulty Level, Test Items, Item Response Theory, Generalization

Termination Criteria for Computerized Classification Testing

Peer reviewed

Direct link

Thompson, Nathan A. – Practical Assessment, Research & Evaluation, 2011

Computerized classification testing (CCT) is an approach to designing tests with intelligent algorithms, similar to adaptive testing, but specifically designed for the purpose of classifying examinees into categories such as "pass" and "fail." Like adaptive testing for point estimation of ability, the key component is the…

Descriptors: Adaptive Testing, Computer Assisted Testing, Classification, Probability

The Effects of Small Sample Size on Identifying Polytomous DIF Using the Liu-Agresti Estimator of the Cumulative Common Odds Ratio

Peer reviewed

Direct link

Carvajal, Jorge; Skorupski, William P. – Educational and Psychological Measurement, 2010

This study is an evaluation of the behavior of the Liu-Agresti estimator of the cumulative common odds ratio when identifying differential item functioning (DIF) with polytomously scored test items using small samples. The Liu-Agresti estimator has been proposed by Penfield and Algina as a promising approach for the study of polytomous DIF but no…

Descriptors: Test Bias, Sample Size, Test Items, Computation

A Comparison of Item Selection Techniques for Testlets

Peer reviewed

Direct link

Murphy, Daniel L.; Dodd, Barbara G.; Vaughn, Brandon K. – Applied Psychological Measurement, 2010

This study examined the performance of the maximum Fisher's information, the maximum posterior weighted information, and the minimum expected posterior variance methods for selecting items in a computerized adaptive testing system when the items were grouped in testlets. A simulation study compared the efficiency of ability estimation among the…

Descriptors: Simulation, Adaptive Testing, Item Analysis, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 17

Applied Psychological…	42
Journal of Educational…	34
Educational and Psychological…	14
Applied Measurement in…	12
Psychometrika	8
Journal of Educational and…	6
Journal of Outcome Measurement	4
Multivariate Behavioral…	3
Alberta Journal of…	2
International Journal of…	2
Journal of Science Education…	2
Structural Equation Modeling:…	2
Academic Medicine	1
American Institutes for…	1
Asia Pacific Education Review	1
Changing English: Studies in…	1
ETS Research Report Series	1
Educational Technology &…	1
Grantee Submission	1
Interactive Learning…	1
International Journal of…	1
Journal of Applied Measurement	1
Journal of Educational…	1
Journal of Legal Education	1
Journal of Technology,…	1
More ▼

Nandakumar, Ratna	10
Zwick, Rebecca	9
Stocking, Martha L.	7
Chang, Hua-Hua	6
Cohen, Allan S.	6
Kim, Seock-Ho	6
Hambleton, Ronald K.	5
Meijer, Rob R.	5
Oshima, T. C.	5
Penfield, Randall D.	5
Smith, Richard M.	5
Wang, Wen-Chung	5
Ackerman, Terry A.	4
Berger, Martijn P. F.	4
Bolt, Daniel M.	4
Davey, Tim	4
De Ayala, R. J.	4
Glas, Cees A. W.	4
van der Linden, Wim J.	4
Gierl, Mark J.	3
Lee, Young-Sun	3
Lewis, Charles	3
Miller, Timothy R.	3
Muraki, Eiji	3
More ▼