ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	28

Descriptor

Test Length	28
Test Items	19
Item Response Theory	15
Simulation	14
Comparative Analysis	12
Accuracy	10
Sample Size	10
Computation	8
Models	8
Adaptive Testing	7
Computer Assisted Testing	7
Classification	5
Difficulty Level	5
Sampling	5
Test Format	5
Correlation	4
Educational Testing	4
Error of Measurement	4
Goodness of Fit	4
Markov Processes	4
Monte Carlo Methods	4
Bayesian Statistics	3
Educational Assessment	3
Equated Scores	3
Investigations	3
More ▼

Source

ProQuest LLC

Publication Type

Dissertations/Theses -…

Education Level

Higher Education	2
Elementary Education	1
Grade 6	1
Intermediate Grades	1
Middle Schools	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
Law School Admission Test	1
Nelson Denny Reading Tests	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 28 results Save | Export

The Impact of Scoring Later on Mixed Format Adaptive Testing

Direct link

Jing Ma – ProQuest LLC, 2024

This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…

Descriptors: Scoring, Adaptive Testing, Test Items, Classification

Evaluation of the Goodness-of-Fit Index M[subscript ord] in Polytomous DCMS with Hierarchical Attribute Structures

Direct link

Haimiao Yuan – ProQuest LLC, 2022

The application of diagnostic classification models (DCMs) in the field of educational measurement is getting more attention in recent years. To make a valid inference from the model, it is important to ensure that the model fits the data. The purpose of the present study was to investigate the performance of the limited information…

Descriptors: Goodness of Fit, Educational Assessment, Educational Diagnosis, Models

Can Auxiliary Information Improve Rasch Estimation at Small Sample Sizes?

Direct link

Derek Sauder – ProQuest LLC, 2020

The Rasch model is commonly used to calibrate multiple choice items. However, the sample sizes needed to estimate the Rasch model can be difficult to attain (e.g., consider a small testing company trying to pretest new items). With small sample sizes, auxiliary information besides the item responses may improve estimation of the item parameters.…

Descriptors: Item Response Theory, Sample Size, Computation, Test Length

Item-Reduction Methodologies for Complex Educational Assessments: A Comparative Methodological Exploration

Direct link

Lance M. Kruse – ProQuest LLC, 2019

This study explores six item-reduction methodologies used to shorten an existing complex problem-solving non-objective test by evaluating how each shortened form performs across three sources of validity evidence (i.e., test content, internal structure, and relationships with other variables). Two concerns prompted the development of the present…

Descriptors: Educational Assessment, Comparative Analysis, Test Format, Test Length

Examining the Impact of Differential Item Functioning on Growth Models

Direct link

Samonte, Kelli Marie – ProQuest LLC, 2017

Longitudinal data analysis assumes that scales meet the assumption of longitudinal measurement invariance (i.e., that scales function equivalently across measurement occasions). This simulation study examines the impact of violations to the assumption of longitudinal measurement invariance on growth models and whether modeling the invariance…

Descriptors: Test Bias, Growth Models, Longitudinal Studies, Simulation

A Novel Approach to the Nelson Denny Reading Test: Determining Extended Time Efficacy in Specific Learning Disorder with Impairment in Reading

Direct link

Klim, Joseph T. – ProQuest LLC, 2019

Specific Learning Disorder with impairment in reading (SLD-R) is the most widely diagnosed neurodevelopmental disorder. Individuals with SLD-R face many academic, social, and work challenges. To alleviate these difficulties, accommodations are provided, the most common being extended time for tests. The literature on extended time efficacy for…

Descriptors: Reading Comprehension, Reading Tests, Vocabulary, Learning Disabilities

Modelling Student Misconceptions Using Nested Logit Item Response Models

Direct link

Yildiz, Mustafa – ProQuest LLC, 2017

Student misconceptions have been studied for decades from a curricular/instructional perspective and from the assessment/test level perspective. Numerous misconception assessment tools have been developed in order to measure students' misconceptions relative to the correct content. Often, these tools are used to make a variety of educational…

Descriptors: Misconceptions, Students, Item Response Theory, Models

A Fair Comparison of the Performance of Computerized Adaptive Testing and Multistage Adaptive Testing

Direct link

Wang, Keyin – ProQuest LLC, 2017

The comparison of item-level computerized adaptive testing (CAT) and multistage adaptive testing (MST) has been researched extensively (e.g., Kim & Plake, 1993; Luecht et al., 1996; Patsula, 1999; Jodoin, 2003; Hambleton & Xing, 2006; Keng, 2008; Zheng, 2012). Various CAT and MST designs have been investigated and compared under the same…

Descriptors: Comparative Analysis, Computer Assisted Testing, Adaptive Testing, Test Items

Identifying Aberrant Responding: Use of Multiple Measures

Direct link

Steinkamp, Susan Christa – ProQuest LLC, 2017

For test scores that rely on the accurate estimation of ability via an IRT model, their use and interpretation is dependent upon the assumption that the IRT model fits the data. Examinees who do not put forth full effort in answering test questions, have prior knowledge of test content, or do not approach a test with the intent of answering…

Descriptors: Test Items, Item Response Theory, Scores, Test Wiseness

Comparing Three Estimation Methods for the Three-Parameter Logistic IRT Model

Direct link

Lamsal, Sunil – ProQuest LLC, 2015

Different estimation procedures have been developed for the unidimensional three-parameter item response theory (IRT) model. These techniques include the marginal maximum likelihood estimation, the fully Bayesian estimation using Markov chain Monte Carlo simulation techniques, and the Metropolis-Hastings Robbin-Monro estimation. With each…

Descriptors: Item Response Theory, Monte Carlo Methods, Maximum Likelihood Statistics, Markov Processes

Accuracy and Variability of Item Parameter Estimates from Marginal Maximum a Posteriori Estimation and Bayesian Inference via Gibbs Samplers

Direct link

Wu, Yi-Fang – ProQuest LLC, 2015

Item response theory (IRT) uses a family of statistical models for estimating stable characteristics of items and examinees and defining how these characteristics interact in describing item and test performance. With a focus on the three-parameter logistic IRT (Birnbaum, 1968; Lord, 1980) model, the current study examines the accuracy and…

Descriptors: Item Response Theory, Test Items, Accuracy, Computation

Examination of the Parameter Estimate Bias When Violating the Orthogonality Assumption of the Bifactor Model

Direct link

Zheng, Chunmei – ProQuest LLC, 2013

Educational and psychological constructs are normally measured by multifaceted dimensions. The measured construct is defined and measured by a set of related subdomains. A bifactor model can accurately describe such data with both the measured construct and the related subdomains. However, a limitation of the bifactor model is the orthogonality…

Descriptors: Educational Testing, Measurement Techniques, Test Items, Models

Equating Multidimensional Tests under a Random Groups Design: A Comparison of Various Equating Procedures

Direct link

Lee, Eunjung – ProQuest LLC, 2013

The purpose of this research was to compare the equating performance of various equating procedures for the multidimensional tests. To examine the various equating procedures, simulated data sets were used that were generated based on a multidimensional item response theory (MIRT) framework. Various equating procedures were examined, including…

Descriptors: Equated Scores, Tests, Comparative Analysis, Item Response Theory

Cognitive Diagnostic Analysis Using Hierarchically Structured Skills

Direct link

Su, Yu-Lan – ProQuest LLC, 2013

This dissertation proposes two modified cognitive diagnostic models (CDMs), the deterministic, inputs, noisy, "and" gate with hierarchy (DINA-H) model and the deterministic, inputs, noisy, "or" gate with hierarchy (DINO-H) model. Both models incorporate the hierarchical structures of the cognitive skills in the model estimation…

Descriptors: Models, Diagnostic Tests, Cognitive Processes, Thinking Skills

Mixed-Format Test Score Equating: Effect of Item-Type Multidimensionality, Length and Composition of Common-Item Set, and Group Ability Difference

Direct link

Wang, Wei – ProQuest LLC, 2013

Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…

Descriptors: Equated Scores, Test Format, Test Items, Test Length

Previous Page | Next Page »

Pages: 1 | 2

Deng, Nina	1
Derek Sauder	1
Evans, Josiah Jeremiah	1
Foley, Brett Patrick	1
Fu, Qiong	1
Haimiao Yuan	1
Huo, Yan	1
Jing Ma	1
Kim, Jihye	1
Kim, Jiseon	1
Klim, Joseph T.	1
Lamsal, Sunil	1
Lance M. Kruse	1
Lee, Eunjung	1
Liu, Qian	1
Md Desa, Zairul Nor Deana	1
Qian, Hong	1
Samonte, Kelli Marie	1
Seo, Dong Gi	1
Steinkamp, Susan Christa	1
Su, Yu-Lan	1
Sunnassee, Devdass	1
Wang, Keyin	1
Wang, Wei	1
Wei, Youhua	1
More ▼