ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	28
Since 2006 (last 20 years)	89

Descriptor

Item Response Theory	102
Statistical Analysis	102
Simulation	99
Test Items	40
Models	30
Comparative Analysis	24
Computation	23
Sample Size	21
Scores	18
Test Bias	16
Error of Measurement	15
Computer Assisted Testing	13
Correlation	13
Achievement Tests	11
Evaluation Methods	11
Foreign Countries	11
Item Analysis	11
Classification	10
Equated Scores	10
Goodness of Fit	10
Psychometrics	10
Data Analysis	9
Factor Analysis	9
Monte Carlo Methods	9
Regression (Statistics)	9
More ▼

Publication Type

Journal Articles	84
Reports - Research	76
Reports - Evaluative	14
Dissertations/Theses -…	7
Speeches/Meeting Papers	7
Reports - Descriptive	3
Collected Works - Proceedings	2

Education Level

Secondary Education	7
Higher Education	6
Postsecondary Education	6
Elementary Secondary Education	5
Elementary Education	4
High Schools	3
Intermediate Grades	3
Junior High Schools	3
Middle Schools	3
Grade 4	2
Grade 12	1
Grade 6	1
Grade 8	1
More ▼

Audience

Location

Turkey	2
Afghanistan	1
Canada	1
Finland	1
Florida	1
France	1
Illinois (Chicago)	1
Singapore	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	4
Trends in International…	3
Indiana Statewide Testing for…	2
Florida Comprehensive…	1
National Assessment of…	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1
United States Medical…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 102 results Save | Export

Implementing a Standardized Effect Size in the POLYSIBTEST Procedure

Peer reviewed

Direct link

Weese, James D.; Turner, Ronna C.; Liang, Xinya; Ames, Allison; Crawford, Brandon – Educational and Psychological Measurement, 2023

A study was conducted to implement the use of a standardized effect size and corresponding classification guidelines for polytomous data with the POLYSIBTEST procedure and compare those guidelines with prior recommendations. Two simulation studies were included. The first identifies new unstandardized test heuristics for classifying moderate and…

Descriptors: Effect Size, Classification, Guidelines, Statistical Analysis

Gaussian Variational Estimation for Multidimensional Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Direct link

Cho, April E.; Wang, Chun; Zhang, Xue; Xu, Gongjun – Grantee Submission, 2020

Multidimensional Item Response Theory (MIRT) is widely used in assessment and evaluation of educational and psychological tests. It models the individual response patterns by specifying functional relationship between individuals' multiple latent traits and their responses to test items. One major challenge in parameter estimation in MIRT is that…

Descriptors: Item Response Theory, Mathematics, Statistical Inference, Maximum Likelihood Statistics

Bias, Type I Error Rates, and Statistical Power of a Latent Mediation Model in the Presence of Violations of Invariance

Peer reviewed

Direct link

Olivera-Aguilar, Margarita; Rikoon, Samuel H.; Gonzalez, Oscar; Kisbu-Sakarya, Yasemin; MacKinnon, David P. – Educational and Psychological Measurement, 2018

When testing a statistical mediation model, it is assumed that factorial measurement invariance holds for the mediating construct across levels of the independent variable X. The consequences of failing to address the violations of measurement invariance in mediation models are largely unknown. The purpose of the present study was to…

Descriptors: Error of Measurement, Statistical Analysis, Factor Analysis, Simulation

Probing for Bias: Comparing Populations Using Item Response Curves

Peer reviewed
PDF on ERIC

Download full text

Paul J. Walter; Edward Nuhfer; Crisel Suarez – Numeracy, 2021

We introduce an approach for making a quantitative comparison of the item response curves (IRCs) of any two populations on a multiple-choice test instrument. In this study, we employ simulated and actual data. We apply our approach to a dataset of 12,187 participants on the 25-item Science Literacy Concept Inventory (SLCI), which includes ample…

Descriptors: Item Analysis, Multiple Choice Tests, Simulation, Data Analysis

The Empirical Selection of Anchor Items Using a Multistage Approach

Direct link

Craig, Brandon – ProQuest LLC, 2017

The purpose of this study was to determine if using a multistage approach for the empirical selection of anchor items would lead to more accurate DIF detection rates than the anchor selection methods proposed by Kopf, Zeileis, & Strobl (2015b). A simulation study was conducted in which the sample size, percentage of DIF, and balance of DIF…

Descriptors: Simulation, Sample Size, Item Response Theory, Item Analysis

Large Sample Confidence Intervals for Item Response Theory Reliability Coefficients

Peer reviewed

Direct link

Andersson, Björn; Xin, Tao – Educational and Psychological Measurement, 2018

In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability…

Descriptors: Item Response Theory, Test Reliability, Test Items, Scores

Response Time Based Nonparametric Kullback-Leibler Divergence Measure for Detecting Aberrant Test-Taking Behavior

Peer reviewed

Direct link

Man, Kaiwen; Harring, Jeffery R.; Ouyang, Yunbo; Thomas, Sarah L. – International Journal of Testing, 2018

Many important high-stakes decisions--college admission, academic performance evaluation, and even job promotion--depend on accurate and reliable scores from valid large-scale assessments. However, examinees sometimes cheat by copying answers from other test-takers or practicing with test items ahead of time, which can undermine the effectiveness…

Descriptors: Reaction Time, High Stakes Tests, Test Wiseness, Cheating

Extension of Caution Indices to Mixed-Format Tests

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2018

Tatsuoka (1984) suggested several extended caution indices and their standardized versions that have been used as person-fit statistics by researchers such as Drasgow, Levine, and McLaughlin (1987), Glas and Meijer (2003), and Molenaar and Hoijtink (1990). However, these indices are only defined for tests with dichotomous items. This paper extends…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Error Patterns

IRT Item Parameter Scaling for Developing New Item Pools

Peer reviewed

Direct link

Kang, Hyeon-Ah; Lu, Ying; Chang, Hua-Hua – Applied Measurement in Education, 2017

Increasing use of item pools in large-scale educational assessments calls for an appropriate scaling procedure to achieve a common metric among field-tested items. The present study examines scaling procedures for developing a new item pool under a spiraled block linking design. The three scaling procedures are considered: (a) concurrent…

Descriptors: Item Response Theory, Accuracy, Educational Assessment, Test Items

On Using Simulations to Inform Decision Making during Instrument Development

Peer reviewed

Direct link

Morgan, Grant B.; Moore, Courtney A.; Floyd, Harlee S. – Journal of Psychoeducational Assessment, 2018

Although content validity--how well each item of an instrument represents the construct being measured--is foundational in the development of an instrument, statistical validity is also important to the decisions that are made based on the instrument. The primary purpose of this study is to demonstrate how simulation studies can be used to assist…

Descriptors: Simulation, Decision Making, Test Construction, Validity

Examining Differential Item Functioning: IRT-Based Detection in the Framework of Confirmatory Factor Analysis

Peer reviewed

Direct link

Dimitrov, Dimiter M. – Measurement and Evaluation in Counseling and Development, 2017

This article offers an approach to examining differential item functioning (DIF) under its item response theory (IRT) treatment in the framework of confirmatory factor analysis (CFA). The approach is based on integrating IRT- and CFA-based testing of DIF and using bias-corrected bootstrap confidence intervals with a syntax code in Mplus.

Descriptors: Test Bias, Item Response Theory, Factor Analysis, Evaluation Methods

Accuracy of a Classical Test Theory-Based Procedure for Estimating the Reliability of a Multistage Test. Research Report. ETS RR-17-02

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Livingston, Samuel A. – ETS Research Report Series, 2017

The purpose of this simulation study was to assess the accuracy of a classical test theory (CTT)-based procedure for estimating the alternate-forms reliability of scores on a multistage test (MST) having 3 stages. We generated item difficulty and discrimination parameters for 10 parallel, nonoverlapping forms of the complete 3-stage test and…

Descriptors: Accuracy, Test Theory, Test Reliability, Adaptive Testing

The Consequences of Ignoring Item Parameter Drift in Longitudinal Item Response Models

Peer reviewed

Direct link

Lee, Wooyeol; Cho, Sun-Joo – Applied Measurement in Education, 2017

Utilizing a longitudinal item response model, this study investigated the effect of item parameter drift (IPD) on item parameters and person scores via a Monte Carlo study. Item parameter recovery was investigated for various IPD patterns in terms of bias and root mean-square error (RMSE), and percentage of time the 95% confidence interval covered…

Descriptors: Item Response Theory, Test Items, Bias, Computation

Effects of Various Simulation Conditions on Latent-Trait Estimates: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Kogar, Hakan – International Journal of Assessment Tools in Education, 2018

The aim of this simulation study, determine the relationship between true latent scores and estimated latent scores by including various control variables and different statistical models. The study also aimed to compare the statistical models and determine the effects of different distribution types, response formats and sample sizes on latent…

Descriptors: Simulation, Context Effect, Computation, Statistical Analysis

How Does Polytomous Item Bias Affect Total-Group Survey Score Comparisons?

Peer reviewed

Direct link

Hidalgo, Ma Dolores; Benítez, Isabel; Padilla, Jose-Luis; Gómez-Benito, Juana – Sociological Methods & Research, 2017

The growing use of scales in survey questionnaires warrants the need to address how does polytomous differential item functioning (DIF) affect observed scale score comparisons. The aim of this study is to investigate the impact of DIF on the type I error and effect size of the independent samples t-test on the observed total scale scores. A…

Descriptors: Test Items, Test Bias, Item Response Theory, Surveys

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Educational and Psychological…	17
Applied Psychological…	15
ETS Research Report Series	9
Journal of Educational…	7
ProQuest LLC	7
Journal of Educational and…	5
Psychometrika	4
Applied Measurement in…	3
International Journal of…	3
Large-scale Assessments in…	3
Educational Sciences: Theory…	2
Eurasian Journal of…	2
Grantee Submission	2
International Educational…	2
Structural Equation Modeling:…	2
American Journal of…	1
Educational Research and…	1
Educational Testing Service	1
English Language Teaching	1
International Journal of…	1
International Journal of…	1
Journal of Psychoeducational…	1
Measurement and Evaluation in…	1
Measurement:…	1
Numeracy	1
More ▼

Sinharay, Sandip	6
Choi, Seung W.	3
Lu, Ying	3
Chang, Hua-Hua	2
Cho, Sun-Joo	2
De Boeck, Paul	2
DeMars, Christine E.	2
Deng, Nina	2
Finch, W. Holmes	2
French, Brian F.	2
Kelecioglu, Hülya	2
Kim, Dong-In	2
Rupp, Andre A.	2
Suh, Youngsuk	2
Tay, Louis	2
Walker, Cindy M.	2
Wan, Ping	2
Xu, Xueli	2
Abayeva, Nella F.	1
Adams, Raymond J.	1
Algina, James	1
Ali, Usama S.	1
Ames, Allison	1
Andersson, Björn	1
More ▼