ERIC - Search Results

Publication Date

In 2025	0
Since 2024	5
Since 2021 (last 5 years)	11
Since 2016 (last 10 years)	23
Since 2006 (last 20 years)	33

Descriptor

Accuracy	33
Error of Measurement	33
Simulation	33
Sample Size	11
Item Response Theory	10
Test Items	10
Comparative Analysis	7
Computation	7
Evaluation Methods	6
Factor Analysis	6
Statistical Analysis	6
Statistical Bias	6
Adaptive Testing	5
Computer Assisted Testing	5
Correlation	5
Data Analysis	5
Maximum Likelihood Statistics	5
Sampling	5
Foreign Countries	4
Goodness of Fit	4
Item Analysis	4
Models	4
Regression (Statistics)	4
Scores	4
Classification	3
More ▼

Publication Type

Reports - Research	28
Journal Articles	27
Dissertations/Theses -…	2
Reports - Evaluative	2
Numerical/Quantitative Data	1
Reports - Descriptive	1

Education Level

Elementary Secondary Education	1
Secondary Education	1

Audience

Location

Saudi Arabia	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

Big Five Inventory	1
Cognitive Abilities Test	1
Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 33 results Save | Export

Evaluating Close Fit in Ordinal Factor Analysis Models with Multiply Imputed Data

Peer reviewed

Direct link

Dexin Shi; Bo Zhang; Ren Liu; Zhehan Jiang – Educational and Psychological Measurement, 2024

Multiple imputation (MI) is one of the recommended techniques for handling missing data in ordinal factor analysis models. However, methods for computing MI-based fit indices under ordinal factor analysis models have yet to be developed. In this short note, we introduced the methods of using the standardized root mean squared residual (SRMR) and…

Descriptors: Goodness of Fit, Factor Analysis, Simulation, Accuracy

Establishing Practical Equivalence of Factor Loadings in Multigroup Confirmatory Factor Analysis

Direct link

Christopher E. Shank – ProQuest LLC, 2024

This dissertation compares the performance of equivalence test (EQT) and null hypothesis test (NHT) procedures for identifying invariant and noninvariant factor loadings under a range of experimental manipulations. EQT is the statistically appropriate approach when the research goal is to find evidence of group similarity rather than group…

Descriptors: Factor Analysis, Goodness of Fit, Intervals, Comparative Analysis

Using Multiple Imputation to Account for the Uncertainty Due to Missing Data in the Context of Factor Retention

Peer reviewed

Direct link

Yan Xia; Selim Havan – Educational and Psychological Measurement, 2024

Although parallel analysis has been found to be an accurate method for determining the number of factors in many conditions with complete data, its application under missing data is limited. The existing literature recommends that, after using an appropriate multiple imputation method, researchers either apply parallel analysis to every imputed…

Descriptors: Data Interpretation, Factor Analysis, Statistical Inference, Research Problems

Temporal Misalignment in Intensive Longitudinal Data: Consequences and Solutions Based on Dynamic Structural Equation Models

Peer reviewed

Direct link

Xiaohui Luo; Yueqin Hu – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Intensive longitudinal data has been widely used to examine reciprocal or causal relations between variables. However, these variables may not be temporally aligned. This study examined the consequences and solutions of the problem of temporal misalignment in intensive longitudinal data based on dynamic structural equation models. First the impact…

Descriptors: Structural Equation Models, Longitudinal Studies, Data Analysis, Causal Models

Evaluation of Statistical Methods Used to Meta-Analyse Results from Interrupted Time Series Studies: A Simulation Study

Peer reviewed

Direct link

Korevaar, Elizabeth; Turner, Simon L.; Forbes, Andrew B.; Karahalios, Amalia; Taljaard, Monica; McKenzie, Joanne E. – Research Synthesis Methods, 2023

Interrupted time series (ITS) are often meta-analysed to inform public health and policy decisions but examination of the statistical methods for ITS analysis and meta-analysis in this context is limited. We simulated meta-analyses of ITS studies with continuous outcome data, analysed the studies using segmented linear regression with two…

Descriptors: Meta Analysis, Maximum Likelihood Statistics, Factor Analysis, Public Health

Using the Standardized Root Mean Squared Residual (SRMR) to Assess Exact Fit in Structural Equation Models

Peer reviewed

Direct link

Pavlov, Goran; Maydeu-Olivares, Alberto; Shi, Dexin – Educational and Psychological Measurement, 2021

We examine the accuracy of p values obtained using the asymptotic mean and variance (MV) correction to the distribution of the sample standardized root mean squared residual (SRMR) proposed by Maydeu-Olivares to assess the exact fit of SEM models. In a simulation study, we found that under normality, the MV-corrected SRMR statistic provides…

Descriptors: Structural Equation Models, Goodness of Fit, Simulation, Error of Measurement

Improved Estimation of Poisson Rate Distributions through a Multimode Survey Design

Peer reviewed

Direct link

Hitczenko, Marcin – Sociological Methods & Research, 2022

Researchers interested in studying the frequency of events or behaviors among a population must rely on count data provided by sampled individuals. Often, this involves a decision between live event counting, such as a behavioral diary, and recalled aggregate counts. Diaries are generally more accurate, but their greater cost and respondent burden…

Descriptors: Surveys, Social Science Research, Recall (Psychology), Diaries

A Comparison of Type I Error and Power Rates in Procedures Used Determining Test Dimensionality

Peer reviewed
PDF on ERIC

Download full text

Guler, Gul; Cikrikci, Rahime Nukhet – International Journal of Assessment Tools in Education, 2022

The purpose of this study was to investigate the Type I Error findings and power rates of the methods used to determine dimensionality in unidimensional and bidimensional psychological constructs for various conditions (characteristic of the distribution, sample size, length of the test, and interdimensional correlation) and to examine the joint…

Descriptors: Comparative Analysis, Error of Measurement, Decision Making, Factor Analysis

Statistical Power When Adjusting for Multiple Hypothesis Tests: Methodology Expansions and Software Tools

Peer reviewed

Direct link

Kristin Porter; Luke Miratrix; Kristen Hunter – Society for Research on Educational Effectiveness, 2021

Background: Researchers are often interested in testing the effectiveness of an intervention on multiple outcomes, for multiple subgroups, at multiple points in time, or across multiple treatment groups. The resulting multiplicity of statistical hypothesis tests can lead to spurious findings of effects. Multiple testing procedures (MTPs)…

Descriptors: Statistical Analysis, Hypothesis Testing, Computer Software, Randomized Controlled Trials

Impact of Item Parameter Drift on Rasch Scale Stability in Small Samples over Multiple Administrations

Peer reviewed

Direct link

Kopp, Jason P.; Jones, Andrew T. – Applied Measurement in Education, 2020

Traditional psychometric guidelines suggest that at least several hundred respondents are needed to obtain accurate parameter estimates under the Rasch model. However, recent research indicates that Rasch equating results in accurate parameter estimates with sample sizes as small as 25. Item parameter drift under the Rasch model has been…

Descriptors: Item Response Theory, Psychometrics, Sample Size, Sampling

Comparing Small-Sample Equating with Angoff Judgement for Linking Cut-Scores on Two Tests

Download full text

Bramley, Tom – Research Matters, 2020

The aim of this study was to compare, by simulation, the accuracy of mapping a cut-score from one test to another by expert judgement (using the Angoff method) versus the accuracy with a small-sample equating method (chained linear equating). As expected, the standard-setting method resulted in more accurate equating when we assumed a higher level…

Descriptors: Cutting Scores, Standard Setting (Scoring), Equated Scores, Accuracy

Variational Estimation for Multidimensional Generalized Partial Credit Model

Peer reviewed

Direct link

Chengyu Cui; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Multidimensional item response theory (MIRT) models have generated increasing interest in the psychometrics literature. Efficient approaches for estimating MIRT models with dichotomous responses have been developed, but constructing an equally efficient and robust algorithm for polytomous models has received limited attention. To address this gap,…

Descriptors: Item Response Theory, Accuracy, Simulation, Psychometrics

Evaluating a Computerized Adaptive Testing Version of a Cognitive Ability Test Using a Simulation Study

Peer reviewed

Direct link

Tsaousis, Ioannis; Sideridis, Georgios D.; AlGhamdi, Hannan M. – Journal of Psychoeducational Assessment, 2021

This study evaluated the psychometric quality of a computerized adaptive testing (CAT) version of the general cognitive ability test (GCAT), using a simulation study protocol put forth by Han, K. T. (2018a). For the needs of the analysis, three different sets of items were generated, providing an item pool of 165 items. Before evaluating the…

Descriptors: Computer Assisted Testing, Adaptive Testing, Cognitive Tests, Cognitive Ability

Estimation of Expected Fisher Information for IRT Models

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2019

In item response theory (IRT) modeling, the Fisher information matrix is used for numerous inferential procedures such as estimating parameter standard errors, constructing test statistics, and facilitating test scoring. In principal, these procedures may be carried out using either the expected information or the observed information. However, in…

Descriptors: Item Response Theory, Error of Measurement, Scoring, Inferences

Sensitivity of the RMSD for Detecting Item-Level Misfit in Low-Performing Countries

Peer reviewed

Direct link

Tijmstra, Jesper; Bolsinova, Maria; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2020

Although the root-mean squared deviation (RMSD) is a popular statistical measure for evaluating country-specific item-level misfit (i.e., differential item functioning [DIF]) in international large-scale assessment, this paper shows that its sensitivity to detect misfit may depend strongly on the proficiency distribution of the considered…

Descriptors: Test Items, Goodness of Fit, Probability, Accuracy

Previous Page | Next Page »

Pages: 1 | 2 | 3

Educational and Psychological…	6
Journal of Educational…	3
ETS Research Report Series	2
International Journal of…	2
ProQuest LLC	2
Research Synthesis Methods	2
Society for Research on…	2
Applied Measurement in…	1
Educational Measurement:…	1
Grantee Submission	1
International Journal of…	1
International Journal of…	1
Journal of Educational and…	1
Journal of Psychoeducational…	1
Journal of Research on…	1
Journal of Special Education	1
MDRC	1
Psicologica: International…	1
Research Matters	1
Sociological Methods &…	1
Structural Equation Modeling:…	1
More ▼

Bloom, Howard S.	2
Moses, Tim	2
Porter, Kristin E.	2
Reardon, Sean F.	2
Unlu, Fatih	2
Aksu Dunya, Beyza	1
AlGhamdi, Hannan M.	1
Ayan, Cansu	1
Ayres, Kevin M.	1
Bo Zhang	1
Bolsinova, Maria	1
Botella, Juan	1
Bramley, Tom	1
Castellano, Katherine E.	1
Chengyu Cui	1
Cho, Sun-Joo	1
Christopher E. Shank	1
Chun Wang	1
Cikrikci, Nukhet	1
Cikrikci, Rahime Nukhet	1
Cimpian, Joseph R.	1
Dexin Shi	1
Forbes, Andrew B.	1
Garnier-Villarreal, Mauricio	1
Gongjun Xu	1
More ▼