ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	19

Descriptor

Accuracy	19
Computation	19
Test Length	19
Item Response Theory	13
Test Items	12
Sample Size	8
Classification	6
Bayesian Statistics	5
Simulation	5
Adaptive Testing	4
Computer Assisted Testing	4
Correlation	4
Maximum Likelihood Statistics	4
Monte Carlo Methods	4
Reliability	4
Scoring	4
Ability	3
Comparative Analysis	3
Cutting Scores	3
Difficulty Level	3
Markov Processes	3
Sampling	3
Error of Measurement	2
Evaluation Methods	2
Foreign Countries	2
More ▼

Source

Educational and Psychological…	5
ProQuest LLC	5
Applied Psychological…	2
International Journal of…	2
Advanced Education	1
ETS Research Report Series	1
Grantee Submission	1
Journal of Educational…	1
Universal Journal of…	1

Publication Type

Journal Articles	13
Reports - Research	13
Dissertations/Theses -…	5
Reports - Evaluative	1

Education Level

Early Childhood Education	1
High Schools	1
Preschool Education	1
Secondary Education	1

Audience

Location

Michigan	1
Turkey	1
Ukraine	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

A Simulation Study on the Performance of Different Reliability Estimation Methods

Peer reviewed

Direct link

Edwards, Ashley A.; Joyner, Keanan J.; Schatschneider, Christopher – Educational and Psychological Measurement, 2021

The accuracy of certain internal consistency estimators have been questioned in recent years. The present study tests the accuracy of six reliability estimators (Cronbach's alpha, omega, omega hierarchical, Revelle's omega, and greatest lower bound) in 140 simulated conditions of unidimensional continuous data with uncorrelated errors with varying…

Descriptors: Reliability, Computation, Accuracy, Sample Size

A Note on Improving Variational Estimation for Multidimensional Item Response Theory

Peer reviewed

Direct link

Chenchen Ma; Jing Ouyang; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Survey instruments and assessments are frequently used in many domains of social science. When the constructs that these assessments try to measure become multifaceted, multidimensional item response theory (MIRT) provides a unified framework and convenient statistical tool for item analysis, calibration, and scoring. However, the computational…

Descriptors: Algorithms, Item Response Theory, Scoring, Accuracy

A Guide for More Accurate and Precise Estimations in Simulative Unidimensional IRT Models

Peer reviewed
PDF on ERIC

Download full text

Baris Pekmezci, Fulya; Sengul Avsar, Asiye – International Journal of Assessment Tools in Education, 2021

There is a great deal of research about item response theory (IRT) conducted by simulations. Item and ability parameters are estimated with varying numbers of replications under different test conditions. However, it is not clear what the appropriate number of replications should be. The aim of the current study is to develop guidelines for the…

Descriptors: Item Response Theory, Computation, Accuracy, Monte Carlo Methods

How the Length and Characteristics of Routing Module Affect Ability Estimation in ca-MST?

Peer reviewed
PDF on ERIC

Download full text

Öztürk, Nagihan Boztunç – Universal Journal of Educational Research, 2019

In this study, how the length and characteristics of routing module in different panel designs affect measurement precision is examined. In the scope of the study, six different routing module length, nine different routing module characteristics, and two different panel design are handled. At the end of the study, the effects of conditions on…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Length, Test Format

Comparison of Confirmatory Factor Analysis Estimation Methods on Mixed-Format Data

Peer reviewed
PDF on ERIC

Download full text

Kilic, Abdullah Faruk; Dogan, Nuri – International Journal of Assessment Tools in Education, 2021

Weighted least squares (WLS), weighted least squares mean-and-variance-adjusted (WLSMV), unweighted least squares mean-and-variance-adjusted (ULSMV), maximum likelihood (ML), robust maximum likelihood (MLR) and Bayesian estimation methods were compared in mixed item response type data via Monte Carlo simulation. The percentage of polytomous items,…

Descriptors: Factor Analysis, Computation, Least Squares Statistics, Maximum Likelihood Statistics

Computer Adaptive Language Testing According to NATO STANAG 6001 Requirements

Peer reviewed
PDF on ERIC

Download full text

Gawliczek, Piotr; Krykun, Viktoriia; Tarasenko, Nataliya; Tyshchenko, Maksym; Shapran, Oleksandr – Advanced Education, 2021

The article deals with the innovative, cutting age solution within the language testing realm, namely computer adaptive language testing (CALT) in accordance with the NATO Standardization Agreement 6001 (NATO STANAG 6001) requirements for further implementation in foreign language training of personnel of the Armed Forces of Ukraine (AF of…

Descriptors: Computer Assisted Testing, Adaptive Testing, Language Tests, Second Language Instruction

Modelling Student Misconceptions Using Nested Logit Item Response Models

Direct link

Yildiz, Mustafa – ProQuest LLC, 2017

Student misconceptions have been studied for decades from a curricular/instructional perspective and from the assessment/test level perspective. Numerous misconception assessment tools have been developed in order to measure students' misconceptions relative to the correct content. Often, these tools are used to make a variety of educational…

Descriptors: Misconceptions, Students, Item Response Theory, Models

A Nonparametric Approach to Estimate Classification Accuracy and Consistency

Peer reviewed

Direct link

Lathrop, Quinn N.; Cheng, Ying – Journal of Educational Measurement, 2014

When cut scores for classifications occur on the total score scale, popular methods for estimating classification accuracy (CA) and classification consistency (CC) require assumptions about a parametric form of the test scores or about a parametric response model, such as item response theory (IRT). This article develops an approach to estimate CA…

Descriptors: Cutting Scores, Classification, Computation, Nonparametric Statistics

Two Approaches to Estimation of Classification Accuracy Rate under Item Response Theory

Peer reviewed

Direct link

Lathrop, Quinn N.; Cheng, Ying – Applied Psychological Measurement, 2013

Within the framework of item response theory (IRT), there are two recent lines of work on the estimation of classification accuracy (CA) rate. One approach estimates CA when decisions are made based on total sum scores, the other based on latent trait estimates. The former is referred to as the Lee approach, and the latter, the Rudner approach,…

Descriptors: Item Response Theory, Accuracy, Classification, Computation

Accuracy and Variability of Item Parameter Estimates from Marginal Maximum a Posteriori Estimation and Bayesian Inference via Gibbs Samplers

Direct link

Wu, Yi-Fang – ProQuest LLC, 2015

Item response theory (IRT) uses a family of statistical models for estimating stable characteristics of items and examinees and defining how these characteristics interact in describing item and test performance. With a focus on the three-parameter logistic IRT (Birnbaum, 1968; Lord, 1980) model, the current study examines the accuracy and…

Descriptors: Item Response Theory, Test Items, Accuracy, Computation

Assessing Dimensionality of Noncompensatory Multidimensional Item Response Theory with Complex Structures

Peer reviewed

Direct link

Svetina, Dubravka – Educational and Psychological Measurement, 2013

The purpose of this study was to investigate the effect of complex structure on dimensionality assessment in noncompensatory multidimensional item response models using dimensionality assessment procedures based on DETECT (dimensionality evaluation to enumerate contributing traits) and NOHARM (normal ogive harmonic analysis robust method). Five…

Descriptors: Item Response Theory, Statistical Analysis, Computation, Test Length

Mutual Information Item Selection Method in Cognitive Diagnostic Computerized Adaptive Testing with Short Test Length

Peer reviewed

Direct link

Wang, Chun – Educational and Psychological Measurement, 2013

Cognitive diagnostic computerized adaptive testing (CD-CAT) purports to combine the strengths of both CAT and cognitive diagnosis. Cognitive diagnosis models aim at classifying examinees into the correct mastery profile group so as to pinpoint the strengths and weakness of each examinee whereas CAT algorithms choose items to determine those…

Descriptors: Computer Assisted Testing, Adaptive Testing, Cognitive Tests, Diagnostic Tests

Item Pool Design for an Operational Variable-Length Computerized Adaptive Test

Peer reviewed

Direct link

He, Wei; Reckase, Mark D. – Educational and Psychological Measurement, 2014

For computerized adaptive tests (CATs) to work well, they must have an item pool with sufficient numbers of good quality items. Many researchers have pointed out that, in developing item pools for CATs, not only is the item pool size important but also the distribution of item parameters and practical considerations such as content distribution…

Descriptors: Item Banks, Test Length, Computer Assisted Testing, Adaptive Testing

Bi-Factor Multidimensional Item Response Theory Modeling for Subscores Estimation, Reliability, and Classification

Direct link

Md Desa, Zairul Nor Deana – ProQuest LLC, 2012

In recent years, there has been increasing interest in estimating and improving subscore reliability. In this study, the multidimensional item response theory (MIRT) and the bi-factor model were combined to estimate subscores, to obtain subscores reliability, and subscores classification. Both the compensatory and partially compensatory MIRT…

Descriptors: Item Response Theory, Computation, Reliability, Classification

Treatment of Not-Administered Items on Individually Administered Intelligence Tests

Peer reviewed

Direct link

He, Wei; Wolfe, Edward W. – Educational and Psychological Measurement, 2012

In administration of individually administered intelligence tests, items are commonly presented in a sequence of increasing difficulty, and test administration is terminated after a predetermined number of incorrect answers. This practice produces stochastically censored data, a form of nonignorable missing data. By manipulating four factors…

Descriptors: Individual Testing, Intelligence Tests, Test Items, Test Length

Previous Page | Next Page »

Pages: 1 | 2

Cheng, Ying	2
He, Wei	2
Lathrop, Quinn N.	2
Baris Pekmezci, Fulya	1
Chenchen Ma	1
Chun Wang	1
Deng, Nina	1
Dogan, Nuri	1
Edwards, Ashley A.	1
Fu, Qiong	1
Gawliczek, Piotr	1
Gongjun Xu	1
Hao, Shiqi	1
Jing Ouyang	1
Joyner, Keanan J.	1
Kandathil, Mathew	1
Kilic, Abdullah Faruk	1
Krykun, Viktoriia	1
Li, Deping	1
Md Desa, Zairul Nor Deana	1
Oranje, Andreas	1
Reckase, Mark D.	1
Schatschneider, Christopher	1
Sengul Avsar, Asiye	1
Shapran, Oleksandr	1
More ▼