Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 12 |
Since 2006 (last 20 years) | 29 |
Descriptor
Correlation | 30 |
Data Analysis | 30 |
Item Response Theory | 30 |
Models | 12 |
Evaluation Methods | 10 |
Scores | 8 |
Foreign Countries | 7 |
Sample Size | 7 |
Simulation | 7 |
Comparative Analysis | 6 |
Monte Carlo Methods | 6 |
More ▼ |
Source
Author
de la Torre, Jimmy | 3 |
Svetina, Dubravka | 2 |
Abu Kassim, Noor Lide | 1 |
An, Min | 1 |
Atar, Hakan Yavuz | 1 |
Badrasawi, Kamal J. I. | 1 |
Baghaei, Purya | 1 |
Bauer, Daniel J. | 1 |
Ben Domingue | 1 |
Ben Stenhaug | 1 |
Ben-Porath, Yossef S. | 1 |
More ▼ |
Publication Type
Journal Articles | 23 |
Reports - Research | 20 |
Reports - Evaluative | 6 |
Dissertations/Theses -… | 2 |
Speeches/Meeting Papers | 2 |
Collected Works - Proceedings | 1 |
Information Analyses | 1 |
Tests/Questionnaires | 1 |
Education Level
Secondary Education | 5 |
Higher Education | 4 |
Elementary Secondary Education | 3 |
Postsecondary Education | 3 |
Elementary Education | 2 |
Junior High Schools | 2 |
Middle Schools | 2 |
Adult Education | 1 |
Grade 6 | 1 |
Grade 8 | 1 |
High Schools | 1 |
More ▼ |
Audience
Researchers | 1 |
Location
Australia | 1 |
Finland | 1 |
France | 1 |
Iran | 1 |
Japan | 1 |
Malaysia | 1 |
South Korea | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 2 |
Program for International… | 2 |
Iowa Tests of Educational… | 1 |
Minnesota Multiphasic… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Ben Stenhaug; Ben Domingue – Grantee Submission, 2022
The fit of an item response model is typically conceptualized as whether a given model could have generated the data. We advocate for an alternative view of fit, "predictive fit", based on the model's ability to predict new data. We derive two predictive fit metrics for item response models that assess how well an estimated item response…
Descriptors: Goodness of Fit, Item Response Theory, Prediction, Models
Saatcioglu, Fatima Munevver; Atar, Hakan Yavuz – International Journal of Assessment Tools in Education, 2022
This study aims to examine the effects of mixture item response theory (IRT) models on item parameter estimation and classification accuracy under different conditions. The manipulated variables of the simulation study are set as mixture IRT models (Rasch, 2PL, 3PL); sample size (600, 1000); the number of items (10, 30); the number of latent…
Descriptors: Accuracy, Classification, Item Response Theory, Programming Languages
Pan, Tianshu; Yin, Yue – Applied Measurement in Education, 2017
In this article, we propose using the Bayes factors (BF) to evaluate person fit in item response theory models under the framework of Bayesian evaluation of an informative diagnostic hypothesis. We first discuss the theoretical foundation for this application and how to analyze person fit using BF. To demonstrate the feasibility of this approach,…
Descriptors: Bayesian Statistics, Goodness of Fit, Item Response Theory, Monte Carlo Methods
Yu, Chong Ho; Douglas, Samantha; Lee, Anna; An, Min – Practical Assessment, Research & Evaluation, 2016
This paper aims to illustrate how data visualization could be utilized to identify errors prior to modeling, using an example with multi-dimensional item response theory (MIRT). MIRT combines item response theory and factor analysis to identify a psychometric model that investigates two or more latent traits. While it may seem convenient to…
Descriptors: Visualization, Item Response Theory, Sample Size, Correlation
Guo, Hongwen – ETS Research Report Series, 2017
Data collected from online learning and tutoring systems for individual students showed strong autocorrelation or dependence because of content connection, knowledge-based dependency, or persistence of learning behavior. When the response data show little dependence or negative autocorrelations for individual students, it is suspected that…
Descriptors: Data Collection, Electronic Learning, Intelligent Tutoring Systems, Information Utilization
Lee, Soo; Suh, Youngsuk – Journal of Educational Measurement, 2018
Lord's Wald test for differential item functioning (DIF) has not been studied extensively in the context of the multidimensional item response theory (MIRT) framework. In this article, Lord's Wald test was implemented using two estimation approaches, marginal maximum likelihood estimation and Bayesian Markov chain Monte Carlo estimation, to detect…
Descriptors: Item Response Theory, Sample Size, Models, Error of Measurement
Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017
This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…
Descriptors: Scores, Test Construction, Test Reliability, Test Validity
Fu, Jianbin – ETS Research Report Series, 2016
The multidimensional item response theory (MIRT) models with covariates proposed by Haberman and implemented in the "mirt" program provide a flexible way to analyze data based on item response theory. In this report, we discuss applications of the MIRT models with covariates to longitudinal test data to measure skill differences at the…
Descriptors: Item Response Theory, Longitudinal Studies, Test Bias, Goodness of Fit
Badrasawi, Kamal J. I.; Abu Kassim, Noor Lide; Daud, Nuraihan Mat – Malaysian Journal of Learning and Instruction, 2017
Purpose: The study sought to determine the hierarchical nature of reading skills. Whether reading is a "unitary" or "multi-divisible" skill is still a contentious issue. So is the hierarchical order of reading skills. Determining the hierarchy of reading skills is challenging as item difficulty is greatly influenced by factors…
Descriptors: Foreign Countries, Secondary School Students, Reading Tests, Test Items
Vista, Alvin – Cogent Education, 2016
There are relatively few studies in Australia and South-East Asian region that combine investigating models of math growth trajectories with predictors such as reasoning ability and reading comprehension skills. Math achievement is one of the major components of overall academic achievement and it is important to determine what factors (especially…
Descriptors: Foreign Countries, Problem Solving, Reading Comprehension, Predictor Variables
Li, Hui – English Language Teaching, 2016
The aim of the study was to investigate how raters come to their decisions when judging spoken vocabulary. Segmental rating was introduced to quantify raters' decision-making process. It is hoped that this simulated study brings fresh insight to future methodological considerations with spoken data. Twenty trainee raters assessed five Chinese…
Descriptors: Foreign Countries, Evaluators, Interrater Reliability, Decision Making
Svetina, Dubravka – Educational and Psychological Measurement, 2013
The purpose of this study was to investigate the effect of complex structure on dimensionality assessment in noncompensatory multidimensional item response models using dimensionality assessment procedures based on DETECT (dimensionality evaluation to enumerate contributing traits) and NOHARM (normal ogive harmonic analysis robust method). Five…
Descriptors: Item Response Theory, Statistical Analysis, Computation, Test Length
Inoue, Chihiro – Language Learning Journal, 2016
The constructs of complexity, accuracy and fluency (CAF) have been used extensively to investigate learner performance on second language tasks. However, a serious concern is that the variables used to measure these constructs are sometimes used conventionally without any empirical justification. It is crucial for researchers to understand how…
Descriptors: Comparative Analysis, Syntax, Accuracy, Task Analysis
Sinharay, Sandip – Educational Testing Service, 2010
Recently, there has been an increasing level of interest in subscores for their potential diagnostic value. Haberman (2008) suggested a method based on classical test theory to determine whether subscores have added value over total scores. This paper provides a literature review and reports when subscores were found to have added value for…
Descriptors: Scores, Correlation, Reliability, Item Response Theory
Gu, Fei; Skorupski, William P.; Hoyle, Larry; Kingston, Neal M. – Applied Psychological Measurement, 2011
Ramsay-curve item response theory (RC-IRT) is a nonparametric procedure that estimates the latent trait using splines, and no distributional assumption about the latent trait is required. For item parameters of the two-parameter logistic (2-PL), three-parameter logistic (3-PL), and polytomous IRT models, RC-IRT can provide more accurate estimates…
Descriptors: Intervals, Item Response Theory, Models, Evaluation Methods
Previous Page | Next Page ยป
Pages: 1 | 2