ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	12
Since 2006 (last 20 years)	29

Descriptor

Correlation	30
Data Analysis	30
Item Response Theory	30
Models	12
Evaluation Methods	10
Scores	8
Foreign Countries	7
Sample Size	7
Simulation	7
Comparative Analysis	6
Monte Carlo Methods	6
Accuracy	5
Bayesian Statistics	5
Computation	5
Factor Analysis	5
Markov Processes	5
Statistical Analysis	5
Test Items	5
Tests	5
Classification	4
Measures (Individuals)	4
Computer Software	3
Difficulty Level	3
Educational Assessment	3
Error Patterns	3
More ▼

Publication Type

Journal Articles	23
Reports - Research	20
Reports - Evaluative	6
Dissertations/Theses -…	2
Speeches/Meeting Papers	2
Collected Works - Proceedings	1
Information Analyses	1
Tests/Questionnaires	1

Education Level

Secondary Education	5
Higher Education	4
Elementary Secondary Education	3
Postsecondary Education	3
Elementary Education	2
Junior High Schools	2
Middle Schools	2
Adult Education	1
Grade 6	1
Grade 8	1
High Schools	1
Intermediate Grades	1
More ▼

Audience

Researchers

Location

Australia	1
Finland	1
France	1
Iran	1
Japan	1
Malaysia	1
South Korea	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Program for International…	2
Iowa Tests of Educational…	1
Minnesota Multiphasic…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 30 results Save | Export

Predictive Fit Metrics for Item Response Models

Peer reviewed
PDF on ERIC

Download full text

Direct link

Ben Stenhaug; Ben Domingue – Grantee Submission, 2022

The fit of an item response model is typically conceptualized as whether a given model could have generated the data. We advocate for an alternative view of fit, "predictive fit", based on the model's ability to predict new data. We derive two predictive fit metrics for item response models that assess how well an estimated item response…

Descriptors: Goodness of Fit, Item Response Theory, Prediction, Models

Investigation of the Effect of Parameter Estimation and Classification Accuracy in Mixture IRT Models under Different Conditions

Peer reviewed
PDF on ERIC

Download full text

Saatcioglu, Fatima Munevver; Atar, Hakan Yavuz – International Journal of Assessment Tools in Education, 2022

This study aims to examine the effects of mixture item response theory (IRT) models on item parameter estimation and classification accuracy under different conditions. The manipulated variables of the simulation study are set as mixture IRT models (Rasch, 2PL, 3PL); sample size (600, 1000); the number of items (10, 30); the number of latent…

Descriptors: Accuracy, Classification, Item Response Theory, Programming Languages

Using the Bayes Factors to Evaluate Person Fit in the Item Response Theory

Peer reviewed

Direct link

Pan, Tianshu; Yin, Yue – Applied Measurement in Education, 2017

In this article, we propose using the Bayes factors (BF) to evaluate person fit in item response theory models under the framework of Bayesian evaluation of an informative diagnostic hypothesis. We first discuss the theoretical foundation for this application and how to analyze person fit using BF. To demonstrate the feasibility of this approach,…

Descriptors: Bayesian Statistics, Goodness of Fit, Item Response Theory, Monte Carlo Methods

Data Visualization of Item-Total Correlation by Median Smoothing

Peer reviewed
PDF on ERIC

Download full text

Yu, Chong Ho; Douglas, Samantha; Lee, Anna; An, Min – Practical Assessment, Research & Evaluation, 2016

This paper aims to illustrate how data visualization could be utilized to identify errors prior to modeling, using an example with multi-dimensional item response theory (MIRT). MIRT combines item response theory and factor analysis to identify a psychometric model that investigates two or more latent traits. While it may seem convenient to…

Descriptors: Visualization, Item Response Theory, Sample Size, Correlation

Exploring Online Learning Data Using Fractal Dimensions. Research Report. ETS RR-17-15

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen – ETS Research Report Series, 2017

Data collected from online learning and tutoring systems for individual students showed strong autocorrelation or dependence because of content connection, knowledge-based dependency, or persistence of learning behavior. When the response data show little dependence or negative autocorrelations for individual students, it is suspected that…

Descriptors: Data Collection, Electronic Learning, Intelligent Tutoring Systems, Information Utilization

Lord's Wald Test for Detecting Dif in Multidimensional Irt Models: A Comparison of Two Estimation Approaches

Peer reviewed

Direct link

Lee, Soo; Suh, Youngsuk – Journal of Educational Measurement, 2018

Lord's Wald test for differential item functioning (DIF) has not been studied extensively in the context of the multidimensional item response theory (MIRT) framework. In this article, Lord's Wald test was implemented using two estimation approaches, marginal maximum likelihood estimation and Bayesian Markov chain Monte Carlo estimation, to detect…

Descriptors: Item Response Theory, Sample Size, Models, Error of Measurement

Test Assembly Implications for Providing Reliable and Valid Subscores

Peer reviewed

Direct link

Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017

This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…

Descriptors: Scores, Test Construction, Test Reliability, Test Validity

Applications of Multidimensional Item Response Theory Models with Covariates to Longitudinal Test Data. Research Report. ETS RR-16-21

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin – ETS Research Report Series, 2016

The multidimensional item response theory (MIRT) models with covariates proposed by Haberman and implemented in the "mirt" program provide a flexible way to analyze data based on item response theory. In this report, we discuss applications of the MIRT models with covariates to longitudinal test data to measure skill differences at the…

Descriptors: Item Response Theory, Longitudinal Studies, Test Bias, Goodness of Fit

The Effects of Test Characteristics on the Hierarchical Order of Reading Skills

Peer reviewed
PDF on ERIC

Download full text

Badrasawi, Kamal J. I.; Abu Kassim, Noor Lide; Daud, Nuraihan Mat – Malaysian Journal of Learning and Instruction, 2017

Purpose: The study sought to determine the hierarchical nature of reading skills. Whether reading is a "unitary" or "multi-divisible" skill is still a contentious issue. So is the hierarchical order of reading skills. Determining the hierarchy of reading skills is challenging as item difficulty is greatly influenced by factors…

Descriptors: Foreign Countries, Secondary School Students, Reading Tests, Test Items

The Role of PS Ability and RC Skill in Predicting Growth Trajectories of Mathematics Achievement

Peer reviewed

Direct link

Vista, Alvin – Cogent Education, 2016

There are relatively few studies in Australia and South-East Asian region that combine investigating models of math growth trajectories with predictors such as reasoning ability and reading comprehension skills. Math achievement is one of the major components of overall academic achievement and it is important to determine what factors (especially…

Descriptors: Foreign Countries, Problem Solving, Reading Comprehension, Predictor Variables

How Do Raters Judge Spoken Vocabulary?

Peer reviewed
PDF on ERIC

Download full text

Li, Hui – English Language Teaching, 2016

The aim of the study was to investigate how raters come to their decisions when judging spoken vocabulary. Segmental rating was introduced to quantify raters' decision-making process. It is hoped that this simulated study brings fresh insight to future methodological considerations with spoken data. Twenty trainee raters assessed five Chinese…

Descriptors: Foreign Countries, Evaluators, Interrater Reliability, Decision Making

Assessing Dimensionality of Noncompensatory Multidimensional Item Response Theory with Complex Structures

Peer reviewed

Direct link

Svetina, Dubravka – Educational and Psychological Measurement, 2013

The purpose of this study was to investigate the effect of complex structure on dimensionality assessment in noncompensatory multidimensional item response models using dimensionality assessment procedures based on DETECT (dimensionality evaluation to enumerate contributing traits) and NOHARM (normal ogive harmonic analysis robust method). Five…

Descriptors: Item Response Theory, Statistical Analysis, Computation, Test Length

A Comparative Study of the Variables Used to Measure Syntactic Complexity and Accuracy in Task-Based Research

Peer reviewed

Direct link

Inoue, Chihiro – Language Learning Journal, 2016

The constructs of complexity, accuracy and fluency (CAF) have been used extensively to investigate learner performance on second language tasks. However, a serious concern is that the variables used to measure these constructs are sometimes used conventionally without any empirical justification. It is crucial for researchers to understand how…

Descriptors: Comparative Analysis, Syntax, Accuracy, Task Analysis

When Can Subscores Be Expected to Have Added Value? Results from Operational and Simulated Data. Research Report. ETS RR-10-16

Download full text

Sinharay, Sandip – Educational Testing Service, 2010

Recently, there has been an increasing level of interest in subscores for their potential diagnostic value. Haberman (2008) suggested a method based on classical test theory to determine whether subscores have added value over total scores. This paper provides a literature review and reports when subscores were found to have added value for…

Descriptors: Scores, Correlation, Reliability, Item Response Theory

Standard Errors and Confidence Intervals from Bootstrapping for Ramsay-Curve Item Response Theory Model Item Parameters

Peer reviewed

Direct link

Gu, Fei; Skorupski, William P.; Hoyle, Larry; Kingston, Neal M. – Applied Psychological Measurement, 2011

Ramsay-curve item response theory (RC-IRT) is a nonparametric procedure that estimates the latent trait using splines, and no distributional assumption about the latent trait is required. For item parameters of the two-parameter logistic (2-PL), three-parameter logistic (3-PL), and polytomous IRT models, RC-IRT can provide more accurate estimates…

Descriptors: Intervals, Item Response Theory, Models, Evaluation Methods

Previous Page | Next Page »

Pages: 1 | 2

Applied Psychological…	4
ETS Research Report Series	3
Educational and Psychological…	2
Online Submission	2
ProQuest LLC	2
Applied Measurement in…	1
Asia Pacific Education Review	1
Cogent Education	1
Educational Assessment	1
Educational Testing Service	1
Electronic Journal of…	1
English Language Teaching	1
Grantee Submission	1
International Educational…	1
International Journal of…	1
Journal of Educational…	1
Language Learning Journal	1
Language Testing	1
Malaysian Journal of Learning…	1
Practical Assessment,…	1
Psychological Assessment	1
Structural Equation Modeling:…	1
More ▼

de la Torre, Jimmy	3
Svetina, Dubravka	2
Abu Kassim, Noor Lide	1
An, Min	1
Atar, Hakan Yavuz	1
Badrasawi, Kamal J. I.	1
Baghaei, Purya	1
Bauer, Daniel J.	1
Ben Domingue	1
Ben Stenhaug	1
Ben-Porath, Yossef S.	1
Chavez, Oscar	1
Choi, Jaehwa	1
Daud, Nuraihan Mat	1
Douglas, Samantha	1
Fu, Jianbin	1
Graham, John R.	1
Grouws, Douglas A.	1
Gu, Fei	1
Guo, Hongwen	1
Harkness, Allan R.	1
Hong, Yuan	1
Hoyle, Larry	1
Inoue, Chihiro	1
Jiao, Hong	1
More ▼