ERIC - Search Results

Publication Date

In 2025	1
Since 2024	7
Since 2021 (last 5 years)	21
Since 2016 (last 10 years)	30
Since 2006 (last 20 years)	36

Descriptor

Comparative Analysis	48
Error of Measurement	48
Item Analysis	48
Item Response Theory	20
Test Items	17
Scores	12
Factor Analysis	11
Foreign Countries	11
Sample Size	10
Simulation	9
Accuracy	8
Models	8
Statistical Analysis	8
Test Reliability	8
Correlation	7
Gender Differences	7
Goodness of Fit	7
Measurement Techniques	7
Achievement Tests	6
Difficulty Level	6
Regression (Statistics)	6
Test Construction	6
Classification	5
Computer Assisted Testing	5
Mathematical Models	5
More ▼

Publication Type

Reports - Research	36
Journal Articles	32
Reports - Evaluative	5
Dissertations/Theses -…	4
Speeches/Meeting Papers	3
Numerical/Quantitative Data	1
Tests/Questionnaires	1

Education Level

Elementary Education	5
Higher Education	5
Secondary Education	5
Postsecondary Education	4
Grade 8	2
Junior High Schools	2
Middle Schools	2
Early Childhood Education	1
Elementary Secondary Education	1
Grade 2	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Intermediate Grades	1
Primary Education	1
More ▼

Audience

Researchers

Location

Saudi Arabia	3
Canada	1
Canada (Toronto)	1
Chile	1
Sudan	1
Turkey	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
Beck Depression Inventory	1
Force Concept Inventory	1
National Assessment of…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 48 results Save | Export

Detecting Differential Item Functioning with Multiple Causes: A Comparison of Three Methods

Peer reviewed

Direct link

Xiaowen Liu – International Journal of Testing, 2024

Differential item functioning (DIF) often arises from multiple sources. Within the context of multidimensional item response theory, this study examined DIF items with varying secondary dimensions using the three DIF methods: SIBTEST, Mantel-Haenszel, and logistic regression. The effect of the number of secondary dimensions on DIF detection rates…

Descriptors: Item Analysis, Test Items, Item Response Theory, Correlation

A Note on Standard Errors for Multidimensional Two-Parameter Logistic Models Using Gaussian Variational Estimation

Peer reviewed

Direct link

Jiaying Xiao; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Accurate item parameters and standard errors (SEs) are crucial for many multidimensional item response theory (MIRT) applications. A recent study proposed the Gaussian Variational Expectation Maximization (GVEM) algorithm to improve computational efficiency and estimation accuracy (Cho et al., 2021). However, the SE estimation procedure has yet to…

Descriptors: Error of Measurement, Models, Evaluation Methods, Item Analysis

Latent Class Analysis with Measurement Invariance Testing: Simulation Study to Compare Overall Likelihood Ratio vs Residual Fit Statistics Based Model Selection

Peer reviewed

Direct link

Zsuzsa Bakk – Structural Equation Modeling: A Multidisciplinary Journal, 2024

A standard assumption of latent class (LC) analysis is conditional independence, that is the items of the LC are independent of the covariates given the LCs. Several approaches have been proposed for identifying violations of this assumption. The recently proposed likelihood ratio approach is compared to residual statistics (bivariate residuals…

Descriptors: Goodness of Fit, Error of Measurement, Comparative Analysis, Models

Evaluating the Performance of Estimators in SEM and IRT with Ordinal Variables

Direct link

Klauth, Bo – ProQuest LLC, 2023

In conducting confirmatory factor analysis with ordered response items, the literature suggests that when the number of responses is five and item skewness (IS) is approximately normal, researchers can employ maximum likelihood with robust standard errors (MLR). However, MLR can yield biased factor loadings (FL) and FL standard errors (FLSE) when…

Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Error of Measurement

IRT Characteristic Curve Linking Methods Weighted by Information for Mixed-Format Tests

Peer reviewed

Direct link

Shaojie Wang; Won-Chan Lee; Minqiang Zhang; Lixin Yuan – Applied Measurement in Education, 2024

To reduce the impact of parameter estimation errors on IRT linking results, recent work introduced two information-weighted characteristic curve methods for dichotomous items. These two methods showed outstanding performance in both simulation and pseudo-form pseudo-group analysis. The current study expands upon the concept of information…

Descriptors: Item Response Theory, Test Format, Test Length, Error of Measurement

Comparison of Methods for Identifying Differential Step Functioning with Polytomous Item Response Data

Peer reviewed

Direct link

Finch, Holmes – Applied Measurement in Education, 2022

Much research has been devoted to identification of differential item functioning (DIF), which occurs when the item responses for individuals from two groups differ after they are conditioned on the latent trait being measured by the scale. There has been less work examining differential step functioning (DSF), which is present for polytomous…

Descriptors: Comparative Analysis, Item Response Theory, Item Analysis, Simulation

Effects of Using Double Ratings as Item Scores on IRT Proficiency Estimation

Peer reviewed

Direct link

Song, Yoon Ah; Lee, Won-Chan – Applied Measurement in Education, 2022

This article presents the performance of item response theory (IRT) models when double ratings are used as item scores over single ratings when rater effects are present. Study 1 examined the influence of the number of ratings on the accuracy of proficiency estimation in the generalized partial credit model (GPCM). Study 2 compared the accuracy of…

Descriptors: Item Response Theory, Item Analysis, Scores, Accuracy

An Exploration of Comparability Issues in Educational Research: Scale Linking, Equating, and Propensity Score Weighting

Direct link

Wu, Tong – ProQuest LLC, 2023

This three-article dissertation aims to address three methodological challenges to ensure comparability in educational research, including scale linking, test equating, and propensity score (PS) weighting. The first study intends to improve test scale comparability by evaluating the effect of six missing data handling approaches, including…

Descriptors: Educational Research, Comparative Analysis, Equated Scores, Weighted Scores

The Study of the Effect of Item Parameter Drift on Ability Estimation Obtained from Adaptive Testing under Different Conditions

Peer reviewed
PDF on ERIC

Download full text

Sahin Kursad, Merve; Cokluk Bokeoglu, Omay; Cikrikci, Rahime Nukhet – International Journal of Assessment Tools in Education, 2022

Item parameter drift (IPD) is the systematic differentiation of parameter values of items over time due to various reasons. If it occurs in computer adaptive tests (CAT), it causes errors in the estimation of item and ability parameters. Identification of the underlying conditions of this situation in CAT is important for estimating item and…

Descriptors: Item Analysis, Computer Assisted Testing, Test Items, Error of Measurement

Investigation of the Measurement Invariance of the Social Media Addiction Scale

Peer reviewed
PDF on ERIC

Download full text

Mehtap Aktas; Nezaket Bilge Uzun; Bilge Bakir Aygar – International Journal of Contemporary Educational Research, 2023

This study aims to examine the measurement invariance of the Social Media Addiction Scale (SMAS) in terms of gender, time spent on social media accounts, and the number of social media accounts. Invariance analyses conducted within the scope of the research were carried out on 672 participants. Measurement invariance studies were examined…

Descriptors: Addictive Behavior, Scores, Comparative Analysis, Measures (Individuals)

Two IRT Characteristic Curve Linking Methods Weighted by Information

Peer reviewed

Direct link

Wang, Shaojie; Zhang, Minqiang; Lee, Won-Chan; Huang, Feifei; Li, Zonglong; Li, Yixing; Yu, Sufang – Journal of Educational Measurement, 2022

Traditional IRT characteristic curve linking methods ignore parameter estimation errors, which may undermine the accuracy of estimated linking constants. Two new linking methods are proposed that take into account parameter estimation errors. The item- (IWCC) and test-information-weighted characteristic curve (TWCC) methods employ weighting…

Descriptors: Item Response Theory, Error of Measurement, Accuracy, Monte Carlo Methods

Impact of DIF on General Factor Mean Comparisons for Bifactor, Ordinal Data

Peer reviewed

Direct link

Liu, Yixing; Thompson, Marilyn S. – Journal of Experimental Education, 2022

A simulation study was conducted to explore the impact of differential item functioning (DIF) on general factor difference estimation for bifactor, ordinal data. Common analysis misspecifications in which the generated bifactor data with DIF were fitted using models with equality constraints on noninvariant item parameters were compared under data…

Descriptors: Comparative Analysis, Item Analysis, Sample Size, Error of Measurement

A Regression Discontinuity Design Framework for Controlling Selection Bias in Evaluations of Differential Item Functioning

Peer reviewed

Direct link

Koziol, Natalie A.; Goodrich, J. Marc; Yoon, HyeonJin – Educational and Psychological Measurement, 2022

Differential item functioning (DIF) is often used to examine validity evidence of alternate form test accommodations. Unfortunately, traditional approaches for evaluating DIF are prone to selection bias. This article proposes a novel DIF framework that capitalizes on regression discontinuity design analysis to control for selection bias. A…

Descriptors: Regression (Statistics), Item Analysis, Validity, Testing Accommodations

Is the Force Concept Inventory Biased across the Intersections of Gender and Race?

Peer reviewed

Direct link

John B. Buncher; Jayson M. Nissen; Ben Van Dusen; Robert M. Talbot – Physical Review Physics Education Research, 2025

Research-based assessments (RBAs) allow researchers and practitioners to compare student performance across different contexts and institutions. In recent years, research attention has focused on the student populations these RBAs were initially developed with because much of that research was done with "samples of convenience" that were…

Descriptors: Science Tests, Physics, Comparative Analysis, Gender Differences

A Comparison of Procedures for Estimating Person Reliability Parameters in the Graded Response Model

Peer reviewed

Direct link

LaHuis, David M.; Bryant-Lees, Kinsey B.; Hakoyama, Shotaro; Barnes, Tyler; Wiemann, Andrea – Journal of Educational Measurement, 2018

Person reliability parameters (PRPs) model temporary changes in individuals' attribute level perceptions when responding to self-report items (higher levels of PRPs represent less fluctuation). PRPs could be useful in measuring careless responding and traitedness. However, it is unclear how well current procedures for estimating PRPs can recover…

Descriptors: Comparative Analysis, Reliability, Error of Measurement, Measurement Techniques

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational and Psychological…	6
Applied Measurement in…	4
ProQuest LLC	4
Journal of Educational…	3
Journal of Experimental…	3
International Journal of…	2
International Journal of…	2
Developmental Psychology	1
ETS Research Report Series	1
Grantee Submission	1
International Journal of…	1
International Journal of…	1
Journal of Advanced Academics	1
Journal of Educational and…	1
Journal of Psychoeducational…	1
Language, Speech, and Hearing…	1
Online Submission	1
Physical Review Physics…	1
Psychometrika	1
Research Papers in Education	1
Research Quarterly for…	1
Structural Equation Modeling:…	1
More ▼

Haladyna, Tom	3
Bashaw, W. L.	2
Lee, Won-Chan	2
Rentz, R. Robert	2
Roid, Gale	2
ALKursheh, Taha Okleh	1
Abulela, Mohammed A. A.	1
Al-zboon, Habis Saad	1
AlNasraween, Mo'en Salman	1
Alamri, Abeer A.	1
Alqurashi, Fahad	1
Anwyll, Steve	1
Ayan, Cansu	1
Bakhiet, Salaheldin Farah	1
Balhmar, Tahani Abdulrahman	1
Barnes, Tyler	1
Bejar, Isaac I.	1
Ben Van Dusen	1
Benson, Jeri	1
Bilge Bakir Aygar	1
Bowes, Neal	1
Brink, Nicholas E.	1
Brinton, Bonnie	1
Browne, Dillon T.	1
More ▼