ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	11
Since 2016 (last 10 years)	22
Since 2006 (last 20 years)	60

Descriptor

Comparative Analysis	96
Test Length	96
Item Response Theory	42
Test Items	39
Sample Size	31
Computer Assisted Testing	27
Simulation	27
Adaptive Testing	20
Test Format	19
Error of Measurement	16
Statistical Analysis	16
Scores	15
Test Reliability	15
Item Analysis	14
Models	14
Correlation	13
Monte Carlo Methods	13
Accuracy	12
Difficulty Level	12
Higher Education	12
Test Validity	12
Computation	11
Mathematical Models	11
Maximum Likelihood Statistics	11
Classification	10
More ▼

Publication Type

Reports - Research	64
Journal Articles	56
Speeches/Meeting Papers	20
Reports - Evaluative	19
Dissertations/Theses -…	12
Numerical/Quantitative Data	2
Tests/Questionnaires	2
Information Analyses	1

Education Level

Higher Education	5
Elementary Secondary Education	3
Postsecondary Education	3
Elementary Education	2
Secondary Education	2
Grade 6	1
Grade 7	1
High Schools	1
Intermediate Grades	1
Middle Schools	1

Audience

Researchers

Location

Turkey	4
Asia	1
Canada	1
China	1
Michigan	1
Netherlands	1
Singapore	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

Wechsler Adult Intelligence…	3
Kaufman Brief Intelligence…	2
Minnesota Multiphasic…	2
ACTFL Oral Proficiency…	1
Advanced Placement…	1
Center for Epidemiologic…	1
Law School Admission Test	1
Marlowe Crowne Social…	1
NEO Five Factor Inventory	1
Program for International…	1
SAT (College Admission Test)	1
School and College Ability…	1
Sensation Seeking Scale	1
Trends in International…	1
Wechsler Individual…	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 96 results Save | Export

IRT Characteristic Curve Linking Methods Weighted by Information for Mixed-Format Tests

Peer reviewed

Direct link

Shaojie Wang; Won-Chan Lee; Minqiang Zhang; Lixin Yuan – Applied Measurement in Education, 2024

To reduce the impact of parameter estimation errors on IRT linking results, recent work introduced two information-weighted characteristic curve methods for dichotomous items. These two methods showed outstanding performance in both simulation and pseudo-form pseudo-group analysis. The current study expands upon the concept of information…

Descriptors: Item Response Theory, Test Format, Test Length, Error of Measurement

The Effect of Ratio of Items Indicating Differential Item Functioning on Computer Adaptive and Multi-Stage Tests

Peer reviewed
PDF on ERIC

Download full text

Erdem-Kara, Basak; Dogan, Nuri – International Journal of Assessment Tools in Education, 2022

Recently, adaptive test approaches have become a viable alternative to traditional fixed-item tests. The main advantage of adaptive tests is that they reach desired measurement precision with fewer items. However, fewer items mean that each item has a more significant effect on ability estimation and therefore those tests are open to more…

Descriptors: Item Analysis, Computer Assisted Testing, Test Items, Test Construction

Modified Item-Fit Indices for Dichotomous IRT Models with Missing Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Xue Zhang; Chun Wang – Grantee Submission, 2022

Item-level fit analysis not only serves as a complementary check to global fit analysis, it is also essential in scale development because the fit results will guide item revision and/or deletion (Liu & Maydeu-Olivares, 2014). During data collection, missing response data may likely happen due to various reasons. Chi-square-based item fit…

Descriptors: Goodness of Fit, Item Response Theory, Scores, Test Length

The Comparison of Estimation Methods for the Four-Parameter Logistic Item Response Theory Model

Peer reviewed

Direct link

Kalkan, Ömür Kaya – Measurement: Interdisciplinary Research and Perspectives, 2022

The four-parameter logistic (4PL) Item Response Theory (IRT) model has recently been reconsidered in the literature due to the advances in the statistical modeling software and the recent developments in the estimation of the 4PL IRT model parameters. The current simulation study evaluated the performance of expectation-maximization (EM),…

Descriptors: Comparative Analysis, Sample Size, Test Length, Algorithms

An Evaluation of Fit Indices Used in Model Selection of Dichotomous Mixture IRT Models

Peer reviewed

Direct link

Sedat Sen; Allan S. Cohen – Educational and Psychological Measurement, 2024

A Monte Carlo simulation study was conducted to compare fit indices used for detecting the correct latent class in three dichotomous mixture item response theory (IRT) models. Ten indices were considered: Akaike's information criterion (AIC), the corrected AIC (AICc), Bayesian information criterion (BIC), consistent AIC (CAIC), Draper's…

Descriptors: Goodness of Fit, Item Response Theory, Sample Size, Classification

Two IRT Characteristic Curve Linking Methods Weighted by Information

Peer reviewed

Direct link

Wang, Shaojie; Zhang, Minqiang; Lee, Won-Chan; Huang, Feifei; Li, Zonglong; Li, Yixing; Yu, Sufang – Journal of Educational Measurement, 2022

Traditional IRT characteristic curve linking methods ignore parameter estimation errors, which may undermine the accuracy of estimated linking constants. Two new linking methods are proposed that take into account parameter estimation errors. The item- (IWCC) and test-information-weighted characteristic curve (TWCC) methods employ weighting…

Descriptors: Item Response Theory, Error of Measurement, Accuracy, Monte Carlo Methods

Closed Formula of Test Length Required for Adaptive Testing with Medium Probability of Solution

Peer reviewed

Direct link

Kárász, Judit T.; Széll, Krisztián; Takács, Szabolcs – Quality Assurance in Education: An International Perspective, 2023

Purpose: Based on the general formula, which depends on the length and difficulty of the test, the number of respondents and the number of ability levels, this study aims to provide a closed formula for the adaptive tests with medium difficulty (probability of solution is p = 1/2) to determine the accuracy of the parameters for each item and in…

Descriptors: Test Length, Probability, Comparative Analysis, Difficulty Level

A Regression Discontinuity Design Framework for Controlling Selection Bias in Evaluations of Differential Item Functioning

Peer reviewed

Direct link

Koziol, Natalie A.; Goodrich, J. Marc; Yoon, HyeonJin – Educational and Psychological Measurement, 2022

Differential item functioning (DIF) is often used to examine validity evidence of alternate form test accommodations. Unfortunately, traditional approaches for evaluating DIF are prone to selection bias. This article proposes a novel DIF framework that capitalizes on regression discontinuity design analysis to control for selection bias. A…

Descriptors: Regression (Statistics), Item Analysis, Validity, Testing Accommodations

Item Response Theory, Computer Adaptive Testing and the Risk of Self-Deception

Download full text

Benton, Tom – Research Matters, 2021

Computer adaptive testing is intended to make assessment more reliable by tailoring the difficulty of the questions a student has to answer to their level of ability. Most commonly, this benefit is used to justify the length of tests being shortened whilst retaining the reliability of a longer, non-adaptive test. Improvements due to adaptive…

Descriptors: Risk, Item Response Theory, Computer Assisted Testing, Difficulty Level

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Comparison of Confirmatory Factor Analysis Estimation Methods on Mixed-Format Data

Peer reviewed
PDF on ERIC

Download full text

Kilic, Abdullah Faruk; Dogan, Nuri – International Journal of Assessment Tools in Education, 2021

Weighted least squares (WLS), weighted least squares mean-and-variance-adjusted (WLSMV), unweighted least squares mean-and-variance-adjusted (ULSMV), maximum likelihood (ML), robust maximum likelihood (MLR) and Bayesian estimation methods were compared in mixed item response type data via Monte Carlo simulation. The percentage of polytomous items,…

Descriptors: Factor Analysis, Computation, Least Squares Statistics, Maximum Likelihood Statistics

Item-Reduction Methodologies for Complex Educational Assessments: A Comparative Methodological Exploration

Direct link

Lance M. Kruse – ProQuest LLC, 2019

This study explores six item-reduction methodologies used to shorten an existing complex problem-solving non-objective test by evaluating how each shortened form performs across three sources of validity evidence (i.e., test content, internal structure, and relationships with other variables). Two concerns prompted the development of the present…

Descriptors: Educational Assessment, Comparative Analysis, Test Format, Test Length

A Comparison of Score Aggregation Methods for Unidimensional Tests on Different Dimensions. Research Report. ETS RR-18-01

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin; Feng, Yuling – ETS Research Report Series, 2018

In this study, we propose aggregating test scores with unidimensional within-test structure and multidimensional across-test structure based on a 2-level, 1-factor model. In particular, we compare 6 score aggregation methods: average of standardized test raw scores (M1), regression factor score estimate of the 1-factor model based on the…

Descriptors: Comparative Analysis, Scores, Correlation, Standardized Tests

The Impact of Q-Matrix Designs on Diagnostic Classification Accuracy in the Presence of Attribute Hierarchies

Peer reviewed

Direct link

Liu, Ren; Huggins-Manley, Anne Corinne; Bradshaw, Laine – Educational and Psychological Measurement, 2017

There is an increasing demand for assessments that can provide more fine-grained information about examinees. In response to the demand, diagnostic measurement provides students with feedback on their strengths and weaknesses on specific skills by classifying them into mastery or nonmastery attribute categories. These attributes often form a…

Descriptors: Matrices, Classification, Accuracy, Diagnostic Tests

Assessing the Performance of Classical Test Theory Item Discrimination Estimators in Monte Carlo Simulations

Peer reviewed

Direct link

Bazaldua, Diego A. Luna; Lee, Young-Sun; Keller, Bryan; Fellers, Lauren – Asia Pacific Education Review, 2017

The performance of various classical test theory (CTT) item discrimination estimators has been compared in the literature using both empirical and simulated data, resulting in mixed results regarding the preference of some discrimination estimators over others. This study analyzes the performance of various item discrimination estimators in CTT:…

Descriptors: Test Items, Monte Carlo Methods, Item Response Theory, Correlation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

ProQuest LLC	12
Educational and Psychological…	9
Applied Psychological…	8
ETS Research Report Series	5
Psychological Assessment	4
Applied Measurement in…	3
Educational Sciences: Theory…	2
International Journal of…	2
Journal of Educational…	2
Psychometrika	2
Asia Pacific Education Review	1
College Entrance Examination…	1
Education and Information…	1
Educational Research and…	1
European Journal of Special…	1
Grantee Submission	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Psychoeducational…	1
Language Testing	1
Learning Disabilities: A…	1
Measurement in Physical…	1
Measurement:…	1
OECD Publishing (NJ1)	1
More ▼

Hambleton, Ronald K.	3
Dogan, Nuri	2
Drasgow, Fritz	2
Eggen, Theo J. H. M.	2
Frick, Theodore W.	2
Gessaroli, Marc E.	2
Kelecioglu, Hülya	2
Kim, Seock-Ho	2
Lee, Yi-Hsuan	2
Paek, Insu	2
Reckase, Mark D.	2
Schumacker, Randall E.	2
Weiss, David J.	2
Zhang, Jinming	2
Allan S. Cohen	1
Allen, Nancy L.	1
Allspach, Jill R.	1
Arsan, Nihan	1
Atalay Kabasakal, Kübra	1
Bazaldua, Diego A. Luna	1
Bejar, Isaac I.	1
Benton, Tom	1
Bergstrom, Betty A.	1
Bradshaw, Laine	1
More ▼