ERIC - Search Results

Publication Date

In 2025	1
Since 2024	3
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	19

Descriptor

Error of Measurement	24
Evaluation Methods	24
Item Analysis	24
Item Response Theory	10
Models	8
Measurement Techniques	7
Sample Size	7
Factor Analysis	6
Simulation	6
Correlation	5
Evaluation Problems	5
Scores	4
Academic Achievement	3
Accuracy	3
Comparative Analysis	3
Effect Size	3
Evaluation Criteria	3
Evaluation Research	3
Probability	3
Psychometrics	3
Reliability	3
Sampling	3
Statistical Bias	3
Structural Equation Models	3
Test Bias	3
More ▼

Source

Educational and Psychological…	5
ETS Research Report Series	2
Journal of Educational…	2
ProQuest LLC	2
Applied Measurement in…	1
Comparative Education Review	1
Educational Policy…	1
Grantee Submission	1
International Journal of…	1
Learning Disability Quarterly	1
Multivariate Behavioral…	1
Psychometrika	1
RAND Corporation	1
Structural Equation Modeling:…	1
Teaching in Higher Education	1
More ▼

Publication Type

Journal Articles	16
Reports - Research	13
Reports - Evaluative	4
Reports - Descriptive	3
Dissertations/Theses -…	2
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	2
High Schools	2
Adult Education	1
Higher Education	1
Secondary Education	1

Audience

Researchers

Location

Indonesia	1
Israel	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	1
Progress in International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

A Note on Standard Errors for Multidimensional Two-Parameter Logistic Models Using Gaussian Variational Estimation

Peer reviewed

Direct link

Jiaying Xiao; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Accurate item parameters and standard errors (SEs) are crucial for many multidimensional item response theory (MIRT) applications. A recent study proposed the Gaussian Variational Expectation Maximization (GVEM) algorithm to improve computational efficiency and estimation accuracy (Cho et al., 2021). However, the SE estimation procedure has yet to…

Descriptors: Error of Measurement, Models, Evaluation Methods, Item Analysis

Evaluating the Performance of Estimators in SEM and IRT with Ordinal Variables

Direct link

Klauth, Bo – ProQuest LLC, 2023

In conducting confirmatory factor analysis with ordered response items, the literature suggests that when the number of responses is five and item skewness (IS) is approximately normal, researchers can employ maximum likelihood with robust standard errors (MLR). However, MLR can yield biased factor loadings (FL) and FL standard errors (FLSE) when…

Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Error of Measurement

Comparing Mimic and Mimic-Interaction to Alignment Methods for Investigating Measurement Invariance Concerning a Continuous Violator

Peer reviewed

Direct link

Yuanfang Liu; Mark H. C. Lai; Ben Kelcey – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Measurement invariance holds when a latent construct is measured in the same way across different levels of background variables (continuous or categorical) while controlling for the true value of that construct. Using Monte Carlo simulation, this paper compares the multiple indicators, multiple causes (MIMIC) model and MIMIC-interaction to a…

Descriptors: Classification, Accuracy, Error of Measurement, Correlation

Linear and Nonlinear Indices of Score Accuracy and Item Effectiveness for Measures That Contain Locally Dependent Items

Peer reviewed

Direct link

Pere J. Ferrando; David Navarro-González; Fabia Morales-Vives – Educational and Psychological Measurement, 2025

The problem of local item dependencies (LIDs) is very common in personality and attitude measures, particularly in those that measure narrow-bandwidth dimensions. At the structural level, these dependencies can be modeled by using extended factor analytic (FA) solutions that include correlated residuals. However, the effects that LIDs have on the…

Descriptors: Scores, Accuracy, Evaluation Methods, Factor Analysis

Evaluation of Structure Complexity Magnitude, Degree of Cross-Loading on Secondary Dimension and Model Specification on MIRT Parameter Estimation

Direct link

Hosseinzadeh, Mostafa – ProQuest LLC, 2021

In real-world situations, multidimensional data may appear on large-scale tests or attitudinal surveys. A simple structure, multidimensional model may be used to evaluate the items, ignoring the cross-loading of some items on the secondary dimension. The purpose of this study was to investigate the influence of structure complexity magnitude of…

Descriptors: Item Response Theory, Models, Simulation, Evaluation Methods

A Log-Linear Modeling Approach for Differential Item Functioning Detection in Polytomously Scored Items

Peer reviewed

Direct link

Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020

A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…

Descriptors: Simulation, Sample Size, Item Analysis, Scores

Differential Item Functioning Effect Size from the Multigroup Confirmatory Factor Analysis for a Meta-Analysis: A Simulation Study

Peer reviewed

Direct link

Park, Sung Eun; Ahn, Soyeon; Zopluoglu, Cengiz – Educational and Psychological Measurement, 2021

This study presents a new approach to synthesizing differential item functioning (DIF) effect size: First, using correlation matrices from each study, we perform a multigroup confirmatory factor analysis (MGCFA) that examines measurement invariance of a test item between two subgroups (i.e., focal and reference groups). Then we synthesize, across…

Descriptors: Item Analysis, Effect Size, Difficulty Level, Monte Carlo Methods

A Modified "a"-Stratified Method for Computerized Adaptive Testing. Research Report. ETS RR-19-10

Peer reviewed
PDF on ERIC

Download full text

Gu, Lixiong; Ling, Guangming; Qu, Yanxuan – ETS Research Report Series, 2019

Research has found that the "a"-stratified item selection strategy (STR) for computerized adaptive tests (CATs) may lead to insufficient use of high a items at later stages of the tests and thus to reduced measurement precision. A refined approach, unequal item selection across strata (USTR), effectively improves test precision over the…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Use, Test Items

On Studying Common Factor Dominance and Approximate Unidimensionality in Multicomponent Measuring Instruments with Discrete Items

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2018

This article outlines a procedure for examining the degree to which a common factor may be dominating additional factors in a multicomponent measuring instrument consisting of binary items. The procedure rests on an application of the latent variable modeling methodology and accounts for the discrete nature of the manifest indicators. The method…

Descriptors: Measurement Techniques, Factor Analysis, Item Response Theory, Likert Scales

Evaluation of Two Types of Differential Item Functioning in Factor Mixture Models with Binary Outcomes

Peer reviewed

Direct link

Lee, HwaYoung; Beretvas, S. Natasha – Educational and Psychological Measurement, 2014

Conventional differential item functioning (DIF) detection methods (e.g., the Mantel-Haenszel test) can be used to detect DIF only across observed groups, such as gender or ethnicity. However, research has found that DIF is not typically fully explained by an observed variable. True sources of DIF may include unobserved, latent variables, such as…

Descriptors: Item Analysis, Factor Structure, Bayesian Statistics, Goodness of Fit

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

A Review of ETS Differential Item Functioning Assessment Procedures: Flagging Rules, Minimum Sample Size Requirements, and Criterion Refinement. Research Report. ETS RR-12-08

Peer reviewed
PDF on ERIC

Download full text

Zwick, Rebecca – ETS Research Report Series, 2012

Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. The goal of this project was to review the status of ETS DIF analysis procedures, focusing on three aspects: (a) the nature and stringency of the statistical rules used to flag items, (b) the minimum sample size…

Descriptors: Test Bias, Sample Size, Bayesian Statistics, Evaluation Methods

Household Possessions Indices as Wealth Measures: A Validity Evaluation

Peer reviewed

Direct link

Traynor, Anne; Raykov, Tenko – Comparative Education Review, 2013

In international achievement studies, questionnaires typically ask about the presence of particular household assets in students' homes. Responses to the assets questions are used to compute a total score, which is intended to represent household wealth in models of test performance. This study uses item analysis and confirmatory factor analysis…

Descriptors: Secondary School Students, Academic Achievement, Validity, Psychometrics

Measuring Educational Quality by Appraising Theses and Dissertations: Pitfalls and Remedies

Peer reviewed

Direct link

Hamilton, Patti; Johnson, Robert; Poudrier, Chelsey – Teaching in Higher Education, 2010

In this paper, we argue that, as indicators of the educational quality of graduate degree programs, student theses and dissertations are best used in specific contexts. High-quality theses and dissertations, that is, may be the result of factors such as verbal skills students already possessed at admission or of complex interactions between…

Descriptors: Educational Quality, Doctoral Dissertations, Theses, Change Strategies

Mokken Scale Analysis for Dichotomous Items Using Marginal Models

Peer reviewed

Direct link

van der Ark, L. Andries; Croon, Marcel A.; Sijtsma, Klaas – Psychometrika, 2008

Scalability coefficients play an important role in Mokken scale analysis. For a set of items, scalability coefficients have been defined for each pair of items, for each individual item, and for the entire scale. Hypothesis testing with respect to these scalability coefficients has not been fully developed. This study introduces marginal modelling…

Descriptors: Hypothesis Testing, Item Response Theory, Error of Measurement, Scaling

Previous Page | Next Page »

Pages: 1 | 2

Raykov, Tenko	2
Ahn, Soyeon	1
Ben Kelcey	1
Beretvas, S. Natasha	1
Busch, John Christian	1
Chun Wang	1
Conley, David T.	1
Croon, Marcel A.	1
David Navarro-González	1
Dirkzwager, Arie	1
Emrick, John A.	1
Fabia Morales-Vives	1
Gongjun Xu	1
Gu, Lixiong	1
Hamilton, Laura S.	1
Hamilton, Patti	1
Hartig, Johannes	1
Holzel, Britta	1
Hosseinzadeh, Mostafa	1
Jaeger, Richard M.	1
Jiaying Xiao	1
Johnson, Robert	1
Klauth, Bo	1
Koretz, Daniel M.	1
Lee, HwaYoung	1
More ▼