ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	22

Source

Educational and Psychological…

Publication Type

Journal Articles	51
Reports - Research	38
Reports - Evaluative	11
Reports - Descriptive	3
Speeches/Meeting Papers	2
Guides - Non-Classroom	1

Education Level

Higher Education	3
Postsecondary Education	2
Elementary Education	1
Secondary Education	1

Audience

Location

Canada	2
Germany	1
Mexico	1
Pakistan	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Management Admission…	4
Graduate Record Examinations	3
Cornell Critical Thinking Test	1
General Aptitude Test Battery	1
Miller Analogies Test	1
Program for International…	1
Watson Glaser Critical…	1
Woodcock Johnson Psycho…	1

What Works Clearinghouse Rating

Educational and Psychological Measurement X

Showing 1 to 15 of 75 results Save | Export

A Note on Comparing the Bifactor and Second-Order Factor Models: Is the Bayesian Information Criterion a Routinely Dependable Index for Model Selection?

Peer reviewed

Direct link

Tenko Raykov; Christine DiStefano; Lisa Calvocoressi – Educational and Psychological Measurement, 2024

This note demonstrates that the widely used Bayesian Information Criterion (BIC) need not be generally viewed as a routinely dependable index for model selection when the bifactor and second-order factor models are examined as rival means for data description and explanation. To this end, we use an empirically relevant setting with…

Descriptors: Bayesian Statistics, Models, Decision Making, Comparative Analysis

Wald X[superscript 2] Test for Differential Item Functioning Detection with Polytomous Items in Multilevel Data

Peer reviewed

Direct link

Sijia Huang; Dubravka Svetina Valdivia – Educational and Psychological Measurement, 2024

Identifying items with differential item functioning (DIF) in an assessment is a crucial step for achieving equitable measurement. One critical issue that has not been fully addressed with existing studies is how DIF items can be detected when data are multilevel. In the present study, we introduced a Lord's Wald X[superscript 2] test-based…

Descriptors: Item Analysis, Item Response Theory, Algorithms, Accuracy

Evaluating the Quality of Classification in Mixture Model Simulations

Peer reviewed

Direct link

Jang, Yoona; Hong, Sehee – Educational and Psychological Measurement, 2023

The purpose of this study was to evaluate the degree of classification quality in the basic latent class model when covariates are either included or are not included in the model. To accomplish this task, Monte Carlo simulations were conducted in which the results of models with and without a covariate were compared. Based on these simulations,…

Descriptors: Classification, Models, Prediction, Sample Size

Performance of Coefficient Alpha and Its Alternatives: Effects of Different Types of Non-Normality

Peer reviewed

Direct link

Xiao, Leifeng; Hau, Kit-Tai – Educational and Psychological Measurement, 2023

We examined the performance of coefficient alpha and its potential competitors (ordinal alpha, omega total, Revelle's omega total [omega RT], omega hierarchical [omega h], greatest lower bound [GLB], and coefficient "H") with continuous and discrete data having different types of non-normality. Results showed the estimation bias was…

Descriptors: Statistical Bias, Statistical Analysis, Likert Scales, Statistical Distributions

Treatments of Differential Item Functioning: A Comparison of Four Methods

Peer reviewed

Direct link

Liu, Xiaowen; Jane Rogers, H. – Educational and Psychological Measurement, 2022

Test fairness is critical to the validity of group comparisons involving gender, ethnicities, culture, or treatment conditions. Detection of differential item functioning (DIF) is one component of efforts to ensure test fairness. The current study compared four treatments for items that have been identified as showing DIF: deleting, ignoring,…

Descriptors: Item Analysis, Comparative Analysis, Culture Fair Tests, Test Validity

Assessing Ability Recovery of the Sequential IRT Model with Unstructured Multiple-Attempt Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Ziying Li; A. Corinne Huggins-Manley; Walter L. Leite; M. David Miller; Eric A. Wright – Educational and Psychological Measurement, 2022

The unstructured multiple-attempt (MA) item response data in virtual learning environments (VLEs) are often from student-selected assessment data sets, which include missing data, single-attempt responses, multiple-attempt responses, and unknown growth ability across attempts, leading to a complex and complicated scenario for using this kind of…

Descriptors: Sequential Approach, Item Response Theory, Data, Simulation

Item Selection Criteria with Practical Constraints in Cognitive Diagnostic Computerized Adaptive Testing

Peer reviewed

Direct link

Lin, Chuan-Ju; Chang, Hua-Hua – Educational and Psychological Measurement, 2019

For item selection in cognitive diagnostic computerized adaptive testing (CD-CAT), ideally, a single item selection index should be created to simultaneously regulate precision, exposure status, and attribute balancing. For this purpose, in this study, we first proposed an attribute-balanced item selection criterion, namely, the standardized…

Descriptors: Test Items, Selection Criteria, Computer Assisted Testing, Adaptive Testing

Investigating Approaches to Estimating Covariate Effects in Growth Mixture Modeling: A Simulation Study

Peer reviewed

Direct link

Li, Ming; Harring, Jeffrey R. – Educational and Psychological Measurement, 2017

Researchers continue to be interested in efficient, accurate methods of estimating coefficients of covariates in mixture modeling. Including covariates related to the latent class analysis not only may improve the ability of the mixture model to clearly differentiate between subjects but also makes interpretation of latent group membership more…

Descriptors: Simulation, Comparative Analysis, Monte Carlo Methods, Guidelines

Correcting Model Fit Criteria for Small Sample Latent Growth Models with Incomplete Data

Peer reviewed

Direct link

McNeish, Daniel; Harring, Jeffrey R. – Educational and Psychological Measurement, 2017

To date, small sample problems with latent growth models (LGMs) have not received the amount of attention in the literature as related mixed-effect models (MEMs). Although many models can be interchangeably framed as a LGM or a MEM, LGMs uniquely provide criteria to assess global data-model fit. However, previous studies have demonstrated poor…

Descriptors: Growth Models, Goodness of Fit, Error Correction, Sampling

The Impact of Ignoring the Level of Nesting Structure in Nonparametric Multilevel Latent Class Models

Peer reviewed

Direct link

Park, Jungkyu; Yu, Hsiu-Ting – Educational and Psychological Measurement, 2016

The multilevel latent class model (MLCM) is a multilevel extension of a latent class model (LCM) that is used to analyze nested structure data structure. The nonparametric version of an MLCM assumes a discrete latent variable at a higher-level nesting structure to account for the dependency among observations nested within a higher-level unit. In…

Descriptors: Hierarchical Linear Modeling, Nonparametric Statistics, Data Analysis, Simulation

The Impact of Varied Discrimination Parameters on Mixed-Format Item Response Theory Model Selection

Peer reviewed

Direct link

Whittaker, Tiffany A.; Chang, Wanchen; Dodd, Barbara G. – Educational and Psychological Measurement, 2013

Whittaker, Chang, and Dodd compared the performance of model selection criteria when selecting among mixed-format IRT models and found that the criteria did not perform adequately when selecting the more parameterized models. It was suggested by M. S. Johnson that the problems when selecting the more parameterized models may be because of the low…

Descriptors: Item Response Theory, Models, Selection Criteria, Accuracy

Does Acquiescence Affect Individual Items Consistently?

Peer reviewed

Direct link

Kam, Chester Chun Seng; Zhou, Mingming – Educational and Psychological Measurement, 2015

Previous research has found the effects of acquiescence to be generally consistent across item "aggregates" within a single survey (i.e., essential tau-equivalence), but it is unknown whether this phenomenon is consistent at the" individual item" level. This article evaluated the often assumed but inadequately tested…

Descriptors: Test Items, Surveys, Criteria, Correlation

Item Selection Criteria with Practical Constraints for Computerized Classification Testing

Peer reviewed

Direct link

Lin, Chuan-Ju – Educational and Psychological Measurement, 2011

This study compares four item selection criteria for a two-category computerized classification testing: (1) Fisher information (FI), (2) Kullback-Leibler information (KLI), (3) weighted log-odds ratio (WLOR), and (4) mutual information (MI), with respect to the efficiency and accuracy of classification decision using the sequential probability…

Descriptors: Computer Assisted Testing, Adaptive Testing, Selection, Test Items

Rotation Criteria and Hypothesis Testing for Exploratory Factor Analysis: Implications for Factor Pattern Loadings and Interfactor Correlations

Peer reviewed

Direct link

Schmitt, Thomas A.; Sass, Daniel A. – Educational and Psychological Measurement, 2011

Exploratory factor analysis (EFA) has long been used in the social sciences to depict the relationships between variables/items and latent traits. Researchers face many choices when using EFA, including the choice of rotation criterion, which can be difficult given that few research articles have discussed and/or demonstrated their differences.…

Descriptors: Hypothesis Testing, Factor Analysis, Correlation, Criteria

Validity of Multiprocess IRT Models for Separating Content and Response Styles

Peer reviewed

Direct link

Plieninger, Hansjörg; Meiser, Thorsten – Educational and Psychological Measurement, 2014

Response styles, the tendency to respond to Likert-type items irrespective of content, are a widely known threat to the reliability and validity of self-report measures. However, it is still debated how to measure and control for response styles such as extreme responding. Recently, multiprocess item response theory models have been proposed that…

Descriptors: Validity, Item Response Theory, Rating Scales, Models

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Michael, William B.	3
Collins, Burton A.	2
Halpin, Gerald	2
Harring, Jeffrey R.	2
Lin, Chuan-Ju	2
A. Corinne Huggins-Manley	1
Aamodt, Michael G.	1
Abedi, Jamal	1
Alliger, George M.	1
Alvidres, Maria D.	1
Ansari, Z. A.	1
Ayers, Jerry B.	1
Baker, Eva L.	1
Bean, Andrew	1
Campbell, Hilary L.	1
Chadwick, Carole S.	1
Chang, Hua-Hua	1
Chang, Wanchen	1
Christine DiStefano	1
Dagenais, Denyse L.	1
Davis, Gary L.	1
Davison, Mark L.	1
De Corte, Wilfried	1
Ding, Cody S.	1
More ▼

Evaluation Criteria	35
Higher Education	21
Admission Criteria	18
Test Validity	18
Criteria	15
Predictive Validity	13
Predictor Variables	12
Factor Analysis	11
Grade Point Average	11
Correlation	10
Models	10
Validity	10
Classification	8
Decision Making	8
Predictive Measurement	8
Simulation	8
Academic Achievement	7
College Entrance Examinations	7
Statistical Analysis	7
Test Construction	7
Test Reliability	7
Comparative Analysis	6
Evaluation Methods	6
Item Analysis	6
Item Response Theory	6
More ▼