ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	10

Descriptor

Ability	14
Simulation	14
Test Length	14
Item Response Theory	9
Sample Size	7
Accuracy	5
Test Items	5
Bayesian Statistics	4
Comparative Analysis	4
Models	4
Statistical Bias	4
Adaptive Testing	3
Computation	3
Computer Assisted Testing	3
Error of Measurement	3
Nonparametric Statistics	3
Statistical Distributions	3
Classification	2
Correlation	2
Data Analysis	2
Differences	2
Estimation (Mathematics)	2
Item Banks	2
Probability	2
Achievement Tests	1
More ▼

Source

Applied Psychological…	3
Educational and Psychological…	3
Journal of Educational…	2
Educational Sciences: Theory…	1
International Journal of…	1
ProQuest LLC	1

Publication Type

Journal Articles	10
Reports - Research	7
Reports - Evaluative	6
Speeches/Meeting Papers	3
Dissertations/Theses -…	1

Education Level

Early Childhood Education	1
Preschool Education	1

Audience

Location

Turkey

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
COMPASS (Computer Assisted…	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Assessing Ability Recovery of the Sequential IRT Model with Unstructured Multiple-Attempt Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Ziying Li; A. Corinne Huggins-Manley; Walter L. Leite; M. David Miller; Eric A. Wright – Educational and Psychological Measurement, 2022

The unstructured multiple-attempt (MA) item response data in virtual learning environments (VLEs) are often from student-selected assessment data sets, which include missing data, single-attempt responses, multiple-attempt responses, and unknown growth ability across attempts, leading to a complex and complicated scenario for using this kind of…

Descriptors: Sequential Approach, Item Response Theory, Data, Simulation

Parameter Estimation Bias of Dichotomous Logistic Item Response Theory Models Using Different Variables

Peer reviewed
PDF on ERIC

Download full text

Köse, Alper; Dogan, C. Deha – International Journal of Evaluation and Research in Education, 2019

The aim of this study was to examine the precision of item parameter estimation in different sample sizes and test lengths under three parameter logistic model (3PL) item response theory (IRT) model, where the trait measured by a test was not normally distributed or had a skewed distribution. In the study, number of categories (1-0), and item…

Descriptors: Statistical Bias, Item Response Theory, Simulation, Accuracy

A Nonparametric Approach to Estimate Classification Accuracy and Consistency

Peer reviewed

Direct link

Lathrop, Quinn N.; Cheng, Ying – Journal of Educational Measurement, 2014

When cut scores for classifications occur on the total score scale, popular methods for estimating classification accuracy (CA) and classification consistency (CC) require assumptions about a parametric form of the test scores or about a parametric response model, such as item response theory (IRT). This article develops an approach to estimate CA…

Descriptors: Cutting Scores, Classification, Computation, Nonparametric Statistics

The Influence of Item Calibration Error on Variable-Length Computerized Adaptive Testing

Peer reviewed

Direct link

Patton, Jeffrey M.; Cheng, Ying; Yuan, Ke-Hai; Diao, Qi – Applied Psychological Measurement, 2013

Variable-length computerized adaptive testing (VL-CAT) allows both items and test length to be "tailored" to examinees, thereby achieving the measurement goal (e.g., scoring precision or classification) with as few items as possible. Several popular test termination rules depend on the standard error of the ability estimate, which in turn depends…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Length, Ability

An Assessment of the Nonparametric Approach for Evaluating the Fit of Item Response Models

Peer reviewed

Direct link

Liang, Tie; Wells, Craig S.; Hambleton, Ronald K. – Journal of Educational Measurement, 2014

As item response theory has been more widely applied, investigating the fit of a parametric model becomes an important part of the measurement process. There is a lack of promising solutions to the detection of model misfit in IRT. Douglas and Cohen introduced a general nonparametric approach, RISE (Root Integrated Squared Error), for detecting…

Descriptors: Item Response Theory, Measurement Techniques, Nonparametric Statistics, Models

Deriving Stopping Rules for Multidimensional Computerized Adaptive Testing

Peer reviewed

Direct link

Wang, Chun; Chang, Hua-Hua; Boughton, Keith A. – Applied Psychological Measurement, 2013

Multidimensional computerized adaptive testing (MCAT) is able to provide a vector of ability estimates for each examinee, which could be used to provide a more informative profile of an examinee's performance. The current literature on MCAT focuses on the fixed-length tests, which can generate less accurate results for those examinees whose…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Length, Item Banks

Comparing Performances (Type I Error and Power) of IRT Likelihood Ratio SIBTEST and Mantel-Haenszel Methods in the Determination of Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Atalay Kabasakal, Kübra; Arsan, Nihan; Gök, Bilge; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2014

This simulation study compared the performances (Type I error and power) of Mantel-Haenszel (MH), SIBTEST, and item response theory-likelihood ratio (IRT-LR) methods under certain conditions. Manipulated factors were sample size, ability differences between groups, test length, the percentage of differential item functioning (DIF), and underlying…

Descriptors: Comparative Analysis, Item Response Theory, Statistical Analysis, Test Bias

Mixed-Format Test Score Equating: Effect of Item-Type Multidimensionality, Length and Composition of Common-Item Set, and Group Ability Difference

Direct link

Wang, Wei – ProQuest LLC, 2013

Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…

Descriptors: Equated Scores, Test Format, Test Items, Test Length

Treatment of Not-Administered Items on Individually Administered Intelligence Tests

Peer reviewed

Direct link

He, Wei; Wolfe, Edward W. – Educational and Psychological Measurement, 2012

In administration of individually administered intelligence tests, items are commonly presented in a sequence of increasing difficulty, and test administration is terminated after a predetermined number of incorrect answers. This practice produces stochastically censored data, a form of nonignorable missing data. By manipulating four factors…

Descriptors: Individual Testing, Intelligence Tests, Test Items, Test Length

Simultaneous Estimation of Overall and Domain Abilities: A Higher-Order IRT Model Approach

Peer reviewed

Direct link

de la Torre, Jimmy; Song, Hao – Applied Psychological Measurement, 2009

Assessments consisting of different domains (e.g., content areas, objectives) are typically multidimensional in nature but are commonly assumed to be unidimensional for estimation purposes. The different domains of these assessments are further treated as multi-unidimensional tests for the purpose of obtaining diagnostic information. However, when…

Descriptors: Ability, Tests, Item Response Theory, Data Analysis

A Comparison of Logistic Regression and Analysis of Variance Differential Item Functioning Decision Methods.

Peer reviewed

Whitmore, Marjorie L.; Schumacker, Randall E. – Educational and Psychological Measurement, 1999

Compared differential item functioning detection rates for logistic regression and analysis of variance for dichotomously scored items using simulated data and varying test length, sample size, discrimination rate, and underlying ability. Explains why the logistic regression method is recommended for most applications. (SLD)

Descriptors: Ability, Analysis of Variance, Comparative Analysis, Item Bias

Testing the Robustness of DIMTEST on Nonnormal Ability Distributions.

Download full text

Nandakumar, Ratna; Yu, Feng – 1994

DIMTEST is a statistical test procedure for assessing essential unidimensionality of binary test item responses. The test statistic T used for testing the null hypothesis of essential unidimensionality is a nonparametric statistic. That is, there is no particular parametric distribution assumed for the underlying ability distribution or for the…

Descriptors: Ability, Content Validity, Correlation, Nonparametric Statistics

An Investigation of Hierarchical Bayes Procedures in Item Response Theory.

Download full text

Kim, Seock-Ho; And Others – 1992

Hierarchical Bayes procedures were compared for estimating item and ability parameters in item response theory. Simulated data sets from the two-parameter logistic model were analyzed using three different hierarchical Bayes procedures: (1) the joint Bayesian with known hyperparameters (JB1); (2) the joint Bayesian with information hyperpriors…

Descriptors: Ability, Bayesian Statistics, Comparative Analysis, Equations (Mathematics)

The Selection of Test Items for Decision Making with a Computer Adaptive Test.

Download full text

Spray, Judith A.; Reckase, Mark D. – 1994

The issue of test-item selection in support of decision making in adaptive testing is considered. The number of items needed to make a decision is compared for two approaches: selecting items from an item pool that are most informative at the decision point or selecting items that are most informative at the examinee's ability level. The first…

Descriptors: Ability, Adaptive Testing, Bayesian Statistics, Computer Assisted Testing

Cheng, Ying	2
A. Corinne Huggins-Manley	1
Arsan, Nihan	1
Atalay Kabasakal, Kübra	1
Boughton, Keith A.	1
Chang, Hua-Hua	1
Diao, Qi	1
Dogan, C. Deha	1
Eric A. Wright	1
Gök, Bilge	1
Hambleton, Ronald K.	1
He, Wei	1
Kelecioglu, Hülya	1
Kim, Seock-Ho	1
Köse, Alper	1
Lathrop, Quinn N.	1
Liang, Tie	1
M. David Miller	1
Nandakumar, Ratna	1
Patton, Jeffrey M.	1
Reckase, Mark D.	1
Schumacker, Randall E.	1
Song, Hao	1
Spray, Judith A.	1
More ▼