ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	27

Descriptor

Computation	28
Test Items	28
Test Length	28
Item Response Theory	17
Accuracy	12
Sample Size	12
Error of Measurement	9
Simulation	7
Comparative Analysis	6
Correlation	6
Maximum Likelihood Statistics	6
Statistical Analysis	6
Test Bias	6
Adaptive Testing	5
Classification	5
Computer Assisted Testing	5
Difficulty Level	5
Monte Carlo Methods	5
Scores	5
Ability	4
Bayesian Statistics	4
Models	4
Foreign Countries	3
Goodness of Fit	3
Guessing (Tests)	3
More ▼

Source

Educational and Psychological…	10
ProQuest LLC	6
Applied Psychological…	3
Advanced Education	1
Anatomical Sciences Education	1
Applied Measurement in…	1
ETS Research Report Series	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Universal Journal of…	1
More ▼

Publication Type

Journal Articles	22
Reports - Research	19
Dissertations/Theses -…	6
Reports - Evaluative	3

Education Level

Early Childhood Education	1
Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1
Preschool Education	1

Audience

Location

Alabama	1
Illinois (Chicago)	1
Indiana	1
Iran	1
Turkey	1
Ukraine	1

Laws, Policies, & Programs

Assessments and Surveys

National Longitudinal Study…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 28 results Save | Export

A Simulation Study on the Performance of Different Reliability Estimation Methods

Peer reviewed

Direct link

Edwards, Ashley A.; Joyner, Keanan J.; Schatschneider, Christopher – Educational and Psychological Measurement, 2021

The accuracy of certain internal consistency estimators have been questioned in recent years. The present study tests the accuracy of six reliability estimators (Cronbach's alpha, omega, omega hierarchical, Revelle's omega, and greatest lower bound) in 140 simulated conditions of unidimensional continuous data with uncorrelated errors with varying…

Descriptors: Reliability, Computation, Accuracy, Sample Size

Evaluation of Factors Affecting the Performance of the "S - X[superscript 2]" Item-Fit Index

Peer reviewed

Direct link

Kim, Hyung Jin; Lee, Won-Chan – Journal of Educational Measurement, 2022

Orlando and Thissen (2000) introduced the "S - X[superscript 2]" item-fit index for testing goodness-of-fit with dichotomous item response theory (IRT) models. This study considers and evaluates an alternative approach for computing "S - X[superscript 2]" values and other factors associated with collapsing tables of observed…

Descriptors: Goodness of Fit, Test Items, Item Response Theory, Computation

Can Auxiliary Information Improve Rasch Estimation at Small Sample Sizes?

Direct link

Derek Sauder – ProQuest LLC, 2020

The Rasch model is commonly used to calibrate multiple choice items. However, the sample sizes needed to estimate the Rasch model can be difficult to attain (e.g., consider a small testing company trying to pretest new items). With small sample sizes, auxiliary information besides the item responses may improve estimation of the item parameters.…

Descriptors: Item Response Theory, Sample Size, Computation, Test Length

How the Length and Characteristics of Routing Module Affect Ability Estimation in ca-MST?

Peer reviewed
PDF on ERIC

Download full text

Öztürk, Nagihan Boztunç – Universal Journal of Educational Research, 2019

In this study, how the length and characteristics of routing module in different panel designs affect measurement precision is examined. In the scope of the study, six different routing module length, nine different routing module characteristics, and two different panel design are handled. At the end of the study, the effects of conditions on…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Length, Test Format

The Performance of the Semigeneralized Partial Credit Model for Handling Item-Level Missingness

Peer reviewed

Direct link

Zhou, Sherry; Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2020

The semi-generalized partial credit model (Semi-GPCM) has been proposed as a unidimensional modeling method for handling not applicable scale responses and neutral scale responses, and it has been suggested that the model may be of use in handling missing data in scale items. The purpose of this study is to evaluate the ability of the…

Descriptors: Models, Statistical Analysis, Response Style (Tests), Test Items

Detection and Treatment of Careless Responses to Improve Item Parameter Estimation

Peer reviewed

Direct link

Patton, Jeffrey M.; Cheng, Ying; Hong, Maxwell; Diao, Qi – Journal of Educational and Behavioral Statistics, 2019

In psychological and survey research, the prevalence and serious consequences of careless responses from unmotivated participants are well known. In this study, we propose to iteratively detect careless responders and cleanse the data by removing their responses. The careless responders are detected using person-fit statistics. In two simulation…

Descriptors: Test Items, Response Style (Tests), Identification, Computation

Comparison of Confirmatory Factor Analysis Estimation Methods on Mixed-Format Data

Peer reviewed
PDF on ERIC

Download full text

Kilic, Abdullah Faruk; Dogan, Nuri – International Journal of Assessment Tools in Education, 2021

Weighted least squares (WLS), weighted least squares mean-and-variance-adjusted (WLSMV), unweighted least squares mean-and-variance-adjusted (ULSMV), maximum likelihood (ML), robust maximum likelihood (MLR) and Bayesian estimation methods were compared in mixed item response type data via Monte Carlo simulation. The percentage of polytomous items,…

Descriptors: Factor Analysis, Computation, Least Squares Statistics, Maximum Likelihood Statistics

Computer Adaptive Language Testing According to NATO STANAG 6001 Requirements

Peer reviewed
PDF on ERIC

Download full text

Gawliczek, Piotr; Krykun, Viktoriia; Tarasenko, Nataliya; Tyshchenko, Maksym; Shapran, Oleksandr – Advanced Education, 2021

The article deals with the innovative, cutting age solution within the language testing realm, namely computer adaptive language testing (CALT) in accordance with the NATO Standardization Agreement 6001 (NATO STANAG 6001) requirements for further implementation in foreign language training of personnel of the Armed Forces of Ukraine (AF of…

Descriptors: Computer Assisted Testing, Adaptive Testing, Language Tests, Second Language Instruction

Using Generalizability Analysis to Estimate Parameters for Anatomy Assessments: A Multi-institutional Study

Peer reviewed

Direct link

Byram, Jessica N.; Seifert, Mark F.; Brooks, William S.; Fraser-Cotlin, Laura; Thorp, Laura E.; Williams, James M.; Wilson, Adam B. – Anatomical Sciences Education, 2017

With integrated curricula and multidisciplinary assessments becoming more prevalent in medical education, there is a continued need for educational research to explore the advantages, consequences, and challenges of integration practices. This retrospective analysis investigated the number of items needed to reliably assess anatomical knowledge in…

Descriptors: Anatomy, Science Tests, Test Items, Test Reliability

Item Parameter Drift in a Time-Varying Predictor

Peer reviewed

Direct link

Lee, HyeSun – Applied Measurement in Education, 2018

The current simulation study examined the effects of Item Parameter Drift (IPD) occurring in a short scale on parameter estimates in multilevel models where scores from a scale were employed as a time-varying predictor to account for outcome scores. Five factors, including three decisions about IPD, were considered for simulation conditions. It…

Descriptors: Test Items, Hierarchical Linear Modeling, Predictor Variables, Scores

Effects of Differential Item Functioning on Examinees' Test Performance and Reliability of Test

Peer reviewed

Direct link

Lee, Yi-Hsuan; Zhang, Jinming – International Journal of Testing, 2017

Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The…

Descriptors: Test Bias, Test Reliability, Performance, Scores

Accuracy and Variability of Item Parameter Estimates from Marginal Maximum a Posteriori Estimation and Bayesian Inference via Gibbs Samplers

Direct link

Wu, Yi-Fang – ProQuest LLC, 2015

Item response theory (IRT) uses a family of statistical models for estimating stable characteristics of items and examinees and defining how these characteristics interact in describing item and test performance. With a focus on the three-parameter logistic IRT (Birnbaum, 1968; Lord, 1980) model, the current study examines the accuracy and…

Descriptors: Item Response Theory, Test Items, Accuracy, Computation

Assessing Dimensionality of Noncompensatory Multidimensional Item Response Theory with Complex Structures

Peer reviewed

Direct link

Svetina, Dubravka – Educational and Psychological Measurement, 2013

The purpose of this study was to investigate the effect of complex structure on dimensionality assessment in noncompensatory multidimensional item response models using dimensionality assessment procedures based on DETECT (dimensionality evaluation to enumerate contributing traits) and NOHARM (normal ogive harmonic analysis robust method). Five…

Descriptors: Item Response Theory, Statistical Analysis, Computation, Test Length

Mutual Information Item Selection Method in Cognitive Diagnostic Computerized Adaptive Testing with Short Test Length

Peer reviewed

Direct link

Wang, Chun – Educational and Psychological Measurement, 2013

Cognitive diagnostic computerized adaptive testing (CD-CAT) purports to combine the strengths of both CAT and cognitive diagnosis. Cognitive diagnosis models aim at classifying examinees into the correct mastery profile group so as to pinpoint the strengths and weakness of each examinee whereas CAT algorithms choose items to determine those…

Descriptors: Computer Assisted Testing, Adaptive Testing, Cognitive Tests, Diagnostic Tests

Item Pool Design for an Operational Variable-Length Computerized Adaptive Test

Peer reviewed

Direct link

He, Wei; Reckase, Mark D. – Educational and Psychological Measurement, 2014

For computerized adaptive tests (CATs) to work well, they must have an item pool with sufficient numbers of good quality items. Many researchers have pointed out that, in developing item pools for CATs, not only is the item pool size important but also the distribution of item parameters and practical considerations such as content distribution…

Descriptors: Item Banks, Test Length, Computer Assisted Testing, Adaptive Testing

Previous Page | Next Page »

Pages: 1 | 2

He, Wei	2
Lee, Yi-Hsuan	2
Wang, Wen-Chung	2
Zhang, Jinming	2
Banks, Kathleen	1
Brooks, William S.	1
Byram, Jessica N.	1
Cappaert, Kevin	1
Chen, Cheng-Te	1
Cheng, Ying	1
Chernyshenko, Oleksandr S.	1
Cui, Zhongmin	1
Deng, Nina	1
Derek Sauder	1
Diao, Qi	1
Dogan, Nuri	1
Edwards, Ashley A.	1
Finch, Holmes	1
Fraser-Cotlin, Laura	1
Fu, Qiong	1
Gawliczek, Piotr	1
Hong, Maxwell	1
Huggins-Manley, Anne Corinne	1
Joyner, Keanan J.	1
Kilic, Abdullah Faruk	1
More ▼