ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	22

Descriptor

Test Items	32
Simulation	29
Item Response Theory	13
Computer Assisted Testing	12
Item Analysis	9
Adaptive Testing	8
Models	7
Test Construction	6
Evaluation Methods	5
Ability	4
Comparative Analysis	4
Computation	4
Error of Measurement	4
Goodness of Fit	4
Scoring	4
Classification	3
Computer Simulation	3
Computer Software	3
Data Analysis	3
Difficulty Level	3
Foreign Countries	3
Maximum Likelihood Statistics	3
Probability	3
Scores	3
Selection	3
More ▼

Source

Applied Psychological…	5
Journal of Educational and…	5
Educational Measurement:…	2
Educational and Psychological…	2
Journal of Educational…	2
Psychometrika	2
Turkish Online Journal of…	2
Assessment & Evaluation in…	1
Harvard Education Press	1
IEEE Transactions on Learning…	1
International Journal of…	1
Journal of Applied Measurement	1
Journal of Legal Education	1
Mathematics Teacher	1
Measurement and Evaluation in…	1
Multivariate Behavioral…	1
Perspectives in Education	1
Practical Assessment,…	1
More ▼

Publication Type

Reports - Descriptive	32
Journal Articles	30
Collected Works - Serial	1
Reports - Evaluative	1

Education Level

Higher Education	3
Secondary Education	2
Elementary Secondary Education	1
Grade 10	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
High Schools	1
Postsecondary Education	1
More ▼

Audience

Researchers

Location

Massachusetts	1
Oregon	1
Philippines	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 32 results Save | Export

Using Cumulative Sum Control Chart to Detect Aberrant Responses in Educational Assessments

Peer reviewed
PDF on ERIC

Download full text

Wan, Siyu; Keller, Lisa A. – Practical Assessment, Research & Evaluation, 2023

Statistical process control (SPC) charts have been widely used in the field of educational measurement. The cumulative sum (CUSUM) is an established SPC method to detect aberrant responses for educational assessments. There are many studies that investigated the performance of CUSUM in different test settings. This paper describes the CUSUM…

Descriptors: Visual Aids, Educational Assessment, Evaluation Methods, Item Response Theory

Hybrid Maximum Clique Algorithm Using Parallel Integer Programming for Uniform Test Assembly

Peer reviewed

Direct link

Fuchimoto, Kazuma; Ishii, Takatoshi; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2022

Educational assessments often require uniform test forms, for which each test form has equivalent measurement accuracy but with a different set of items. For uniform test assembly, an important issue is the increase of the number of assembled uniform tests. Although many automatic uniform test assembly methods exist, the maximum clique algorithm…

Descriptors: Simulation, Efficiency, Test Items, Educational Assessment

Digital Module 13: Monte Carlo Simulation Studies in Item Response Theory

Peer reviewed

Direct link

Leventhal, Brian; Ames, Allison – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Brian Leventhal and Dr. Allison Ames provide an overview of "Monte Carlo simulation studies" (MCSS) in "item response theory" (IRT). MCSS are utilized for a variety of reasons, one of the most compelling being that they can be used when analytic solutions are impractical or nonexistent because…

Descriptors: Item Response Theory, Monte Carlo Methods, Simulation, Test Items

Item Order and Speededness: Implications for Test Fairness in Higher Educational High-Stakes Testing

Peer reviewed

Direct link

Becker, Benjamin; van Rijn, Peter; Molenaar, Dylan; Debeer, Dries – Assessment & Evaluation in Higher Education, 2022

A common approach to increase test security in higher educational high-stakes testing is the use of different test forms with identical items but different item orders. The effects of such varied item orders are relatively well studied, but findings have generally been mixed. When multiple test forms with different item orders are used, we argue…

Descriptors: Information Security, High Stakes Tests, Computer Security, Test Items

Digital Module 08: Foundations of Operational Item Analysis https://ncme.elevate.commpartners.com

Peer reviewed

Direct link

Yoo, Hanwook; Hambleton, Ronald K. – Educational Measurement: Issues and Practice, 2019

Item analysis is an integral part of operational test development and is typically conducted within two popular statistical frameworks: classical test theory (CTT) and item response theory (IRT). In this digital ITEMS module, Hanwook Yoo and Ronald K. Hambleton provide an accessible overview of operational item analysis approaches within these…

Descriptors: Item Analysis, Item Response Theory, Guidelines, Test Construction

Sensitivity of the RMSD for Detecting Item-Level Misfit in Low-Performing Countries

Peer reviewed

Direct link

Tijmstra, Jesper; Bolsinova, Maria; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2020

Although the root-mean squared deviation (RMSD) is a popular statistical measure for evaluating country-specific item-level misfit (i.e., differential item functioning [DIF]) in international large-scale assessment, this paper shows that its sensitivity to detect misfit may depend strongly on the proficiency distribution of the considered…

Descriptors: Test Items, Goodness of Fit, Probability, Accuracy

Polytomous Rasch Models in Counseling Assessment

Peer reviewed

Direct link

Willse, John T. – Measurement and Evaluation in Counseling and Development, 2017

This article provides a brief introduction to the Rasch model. Motivation for using Rasch analyses is provided. Important Rasch model concepts and key aspects of result interpretation are introduced, with major points reinforced using a simulation demonstration. Concrete guidelines are provided regarding sample size and the evaluation of items.

Descriptors: Item Response Theory, Test Results, Test Interpretation, Simulation

Computerized Adaptive Test (CAT) Applications and Item Response Theory Models for Polytomous Items

Peer reviewed
PDF on ERIC

Download full text

Aybek, Eren Can; Demirtasli, R. Nukhet – International Journal of Research in Education and Science, 2017

This article aims to provide a theoretical framework for computerized adaptive tests (CAT) and item response theory models for polytomous items. Besides that, it aims to introduce the simulation and live CAT software to the related researchers. Computerized adaptive test algorithm, assumptions of item response theory models, nominal response…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Test Items

A Maximum Likelihood Based Offline Estimation of Student Capabilities and Question Difficulties with Guessing

Peer reviewed

Direct link

Moothedath, Shana; Chaporkar, Prasanna; Belur, Madhu N. – Perspectives in Education, 2016

In recent years, the computerised adaptive test (CAT) has gained popularity over conventional exams in evaluating student capabilities with desired accuracy. However, the key limitation of CAT is that it requires a large pool of pre-calibrated questions. In the absence of such a pre-calibrated question bank, offline exams with uncalibrated…

Descriptors: Guessing (Tests), Computer Assisted Testing, Adaptive Testing, Maximum Likelihood Statistics

Engaging Students in Survey Design and Data Collection

Peer reviewed

Direct link

Sole, Marla A. – Mathematics Teacher, 2015

Every day, people use data to make decisions that affect their personal and professional lives, trusting that the data are correct. Many times, however, the data are inaccurate, as a result of a flaw in the design or methodology of the survey used to collect the data. Researchers agree that only questions that are clearly worded, unambiguous, free…

Descriptors: Test Construction, Surveys, Student Participation, Design

Screening Test Items for Differential Item Functioning

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014

A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing

A Randomized Rounding Approach for Optimization of Test Sheet Composing and Exposure Rate Control in Computer-Assisted Testing

Peer reviewed
PDF on ERIC

Download full text

Wang, Chu-Fu; Lin, Chih-Lung; Deng, Jien-Han – Turkish Online Journal of Educational Technology - TOJET, 2012

Testing is an important stage of teaching as it can assist teachers in auditing students' learning results. A good test is able to accurately reflect the capability of a learner. Nowadays, Computer-Assisted Testing (CAT) is greatly improving traditional testing, since computers can automatically and quickly compose a proper test sheet to meet user…

Descriptors: Simulation, Test Items, Student Evaluation, Test Construction

Robust Estimation of Latent Ability in Item Response Models

Peer reviewed

Direct link

Schuster, Christof; Yuan, Ke-Hai – Journal of Educational and Behavioral Statistics, 2011

Because of response disturbances such as guessing, cheating, or carelessness, item response models often can only approximate the "true" individual response probabilities. As a consequence, maximum-likelihood estimates of ability will be biased. Typically, the nature and extent to which response disturbances are present is unknown, and, therefore,…

Descriptors: Computation, Item Response Theory, Probability, Maximum Likelihood Statistics

Multidimensional Item Response Theory Parameter Estimation with Nonsimple Structure Items

Peer reviewed

Direct link

Finch, Holmes – Applied Psychological Measurement, 2011

Estimation of multidimensional item response theory (MIRT) model parameters can be carried out using the normal ogive with unweighted least squares estimation with the normal-ogive harmonic analysis robust method (NOHARM) software. Previous simulation research has demonstrated that this approach does yield accurate and efficient estimates of item…

Descriptors: Item Response Theory, Computation, Test Items, Simulation

An Odds Ratio Approach for Assessing Differential Distractor Functioning Effects under the Nominal Response Model

Peer reviewed

Direct link

Penfield, Randall D. – Journal of Educational Measurement, 2008

Investigations of differential distractor functioning (DDF) can provide valuable information concerning the location and possible causes of measurement invariance within a multiple-choice item. In this article, I propose an odds ratio estimator of the DDF effect as modeled under the nominal response model. In addition, I propose a simultaneous…

Descriptors: Test Items, Investigations, Simulation

Previous Page | Next Page »

Pages: 1 | 2 | 3

Ames, Allison	1
Armstrong, Ronald D.	1
Aybek, Eren Can	1
Becker, Benjamin	1
Belov, Dmitry I.	1
Belur, Madhu N.	1
Beretvas, S. Natasha	1
Bolsinova, Maria	1
Bradlow, Eric T.	1
Brusco, Michael J.	1
Ceulemans, Eva	1
Chang, Hua-Hua	1
Chaporkar, Prasanna	1
Chauncey, Caroline T., Ed.	1
Debeer, Dries	1
Demirtasli, R. Nukhet	1
Deng, Jien-Han	1
Finch, Holmes	1
Fuchimoto, Kazuma	1
Hambleton, Ronald K.	1
Huang, Sheng-Yun	1
Huitzing, Hiddo A.	1
Ishii, Takatoshi	1
Keller, Lisa A.	1
Kovach, Kimberlee K.	1
More ▼