ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	15
Since 2006 (last 20 years)	26

Descriptor

Correlation	35
Error of Measurement	35
Test Items	35
Item Response Theory	16
Item Analysis	11
Simulation	9
Difficulty Level	8
Factor Analysis	8
Foreign Countries	8
Sample Size	8
Comparative Analysis	7
Test Reliability	7
Test Theory	7
Computation	6
Scores	6
Mathematical Models	5
Mathematics Tests	5
Statistical Analysis	5
Accuracy	4
Computer Software	4
Factor Structure	4
Models	4
Monte Carlo Methods	4
Statistical Bias	4
Adaptive Testing	3
More ▼

Publication Type

Reports - Research	28
Journal Articles	25
Speeches/Meeting Papers	5
Reports - Evaluative	3
Dissertations/Theses -…	2
Reports - Descriptive	2
Numerical/Quantitative Data	1
Tests/Questionnaires	1

Education Level

Elementary Education	3
Elementary Secondary Education	2
Higher Education	2
Postsecondary Education	2
Early Childhood Education	1
Grade 2	1
Grade 3	1
Grade 7	1
Primary Education	1
Secondary Education	1

Audience

Researchers

Location

United Kingdom (England)	2
Australia	1
France	1
Germany	1
Japan	1
Portugal	1
Turkey	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	1
Students Evaluation of…	1
Test of English for…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 35 results Save | Export

Detecting Differential Item Functioning with Multiple Causes: A Comparison of Three Methods

Peer reviewed

Direct link

Xiaowen Liu – International Journal of Testing, 2024

Differential item functioning (DIF) often arises from multiple sources. Within the context of multidimensional item response theory, this study examined DIF items with varying secondary dimensions using the three DIF methods: SIBTEST, Mantel-Haenszel, and logistic regression. The effect of the number of secondary dimensions on DIF detection rates…

Descriptors: Item Analysis, Test Items, Item Response Theory, Correlation

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

Relative Robustness of CDMs and (M)IRT in Measuring Growth in Latent Skills

Peer reviewed

Direct link

Huang, Qi; Bolt, Daniel M. – Educational and Psychological Measurement, 2023

Previous studies have demonstrated evidence of latent skill continuity even in tests intentionally designed for measurement of binary skills. In addition, the assumption of binary skills when continuity is present has been shown to potentially create a lack of invariance in item and latent ability parameters that may undermine applications. In…

Descriptors: Item Response Theory, Test Items, Skill Development, Robustness (Statistics)

An Analysis of Differential Bundle Functioning in Multidimensional Tests Using the SIBTEST Procedure

Peer reviewed
PDF on ERIC

Download full text

Özdogan, Didem; Kelecioglu, Hülya – International Journal of Assessment Tools in Education, 2022

This study aims to analyze the differential bundle functioning in multidimensional tests with a specific purpose to detect this effect through differentiating the location of the item with DIF in the test, the correlation between the dimensions, the sample size, and the ratio of reference to focal group size. The first 10 items of the test that is…

Descriptors: Correlation, Sample Size, Test Items, Item Analysis

Examination of Differential Item Functioning in PISA through Univariate and Multivariate Matching Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Ahmet Yildirim; Nizamettin Koç – International Journal of Assessment Tools in Education, 2024

The present research aims to examine whether the questions in the Program for the International Student Assessment (PISA) 2009 reading literacy instrument display differential item functioning (DIF) among the Turkish, French, and American samples based on univariate and multivariate matching techniques before and after the total score, which is…

Descriptors: Test Items, Item Analysis, Correlation, Error of Measurement

Comparing Small-Sample Equating with Angoff Judgement for Linking Cut-Scores on Two Tests

Download full text

Bramley, Tom – Research Matters, 2020

The aim of this study was to compare, by simulation, the accuracy of mapping a cut-score from one test to another by expert judgement (using the Angoff method) versus the accuracy with a small-sample equating method (chained linear equating). As expected, the standard-setting method resulted in more accurate equating when we assumed a higher level…

Descriptors: Cutting Scores, Standard Setting (Scoring), Equated Scores, Accuracy

Differential Item Functioning Effect Size from the Multigroup Confirmatory Factor Analysis for a Meta-Analysis: A Simulation Study

Peer reviewed

Direct link

Park, Sung Eun; Ahn, Soyeon; Zopluoglu, Cengiz – Educational and Psychological Measurement, 2021

This study presents a new approach to synthesizing differential item functioning (DIF) effect size: First, using correlation matrices from each study, we perform a multigroup confirmatory factor analysis (MGCFA) that examines measurement invariance of a test item between two subgroups (i.e., focal and reference groups). Then we synthesize, across…

Descriptors: Item Analysis, Effect Size, Difficulty Level, Monte Carlo Methods

A Polytomous Scoring Approach to Handle Not-Reached Items in Low-Stakes Assessments

Peer reviewed

Direct link

Gorgun, Guher; Bulut, Okan – Educational and Psychological Measurement, 2021

In low-stakes assessments, some students may not reach the end of the test and leave some items unanswered due to various reasons (e.g., lack of test-taking motivation, poor time management, and test speededness). Not-reached items are often treated as incorrect or not-administered in the scoring process. However, when the proportion of…

Descriptors: Scoring, Test Items, Response Style (Tests), Mathematics Tests

Item Parameter Drift in Computer Adaptive Testing Due to Lack of Content Knowledge

Peer reviewed

Direct link

Aksu Dunya, Beyza – International Journal of Testing, 2018

This study was conducted to analyze potential item parameter drift (IPD) impact on person ability estimates and classification accuracy when drift affects an examinee subgroup. Using a series of simulations, three factors were manipulated: (a) percentage of IPD items in the CAT exam, (b) percentage of examinees affected by IPD, and (c) item pool…

Descriptors: Adaptive Testing, Classification, Accuracy, Computer Assisted Testing

When near Means Related: Evidence from Three Web Survey Experiments on Inter-Item Correlations in Grid Questions

Peer reviewed

Direct link

Silber, Henning; Roßmann, Joss; Gummer, Tobias – International Journal of Social Research Methodology, 2018

In this article, we present the results of three question design experiments on inter-item correlations, which tested a grid design against a single-item design. The first and second experiments examined the inter-item correlations of a set with five and seven items, respectively, and the third experiment examined the impact of the question design…

Descriptors: Foreign Countries, Online Surveys, Experiments, Correlation

Detecting Differential Item Discrimination (DID) and the Consequences of Ignoring DID in Multilevel Item Response Models

Peer reviewed

Direct link

Lee, Woo-yeol; Cho, Sun-Joo – Journal of Educational Measurement, 2017

Cross-level invariance in a multilevel item response model can be investigated by testing whether the within-level item discriminations are equal to the between-level item discriminations. Testing the cross-level invariance assumption is important to understand constructs in multilevel data. However, in most multilevel item response model…

Descriptors: Test Items, Item Response Theory, Item Analysis, Simulation

Item Response Theory: An Introduction to Latent Trait Models to Test and Item Development

Peer reviewed
PDF on ERIC

Download full text

Bichi, Ado Abdu; Talib, Rohaya – International Journal of Evaluation and Research in Education, 2018

Testing in educational system perform a number of functions, the results from a test can be used to make a number of decisions in education. It is therefore well accepted in the education literature that, testing is an important element of education. To effectively utilize the tests in educational policies and quality assurance its validity and…

Descriptors: Item Response Theory, Test Items, Test Construction, Decision Making

Application of the IRT and TRT Models to a Reading Comprehension Test

Direct link

Kim, Weon H. – ProQuest LLC, 2017

The purpose of the present study is to apply the item response theory (IRT) and testlet response theory (TRT) models to a reading comprehension test. This study applied the TRT models and the traditional IRT model to a seventh-grade reading comprehension test (n = 8,815) with eight testlets. These three models were compared to determine the best…

Descriptors: Item Response Theory, Test Items, Correlation, Reading Tests

The Comparison of Item Parameters Estimated from Parametric and Nonparametric Item Response Theory Models in Case of the Violance of Local Independence Assumption

Peer reviewed
PDF on ERIC

Download full text

Dirlik, Ezgi Mor – International Journal of Progressive Education, 2019

Item response theory (IRT) has so many advantages than its precedent Classical Test Theory (CTT) such as non-changing item parameters, ability parameter estimations free from the items. However, in order to get these advantages, some assumptions should be met and they are; unidimensionality, normality and local independence. However, it is not…

Descriptors: Comparative Analysis, Nonparametric Statistics, Item Response Theory, Models

Examination of Different Item Response Theory Models on Tests Composed of Testlets

Peer reviewed
PDF on ERIC

Download full text

Kogar, Esin Yilmaz; Kelecioglu, Hülya – Journal of Education and Learning, 2017

The purpose of this research is to first estimate the item and ability parameters and the standard error values related to those parameters obtained from Unidimensional Item Response Theory (UIRT), bifactor (BIF) and Testlet Response Theory models (TRT) in the tests including testlets, when the number of testlets, number of independent items, and…

Descriptors: Item Response Theory, Models, Mathematics Tests, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3

Educational and Psychological…	3
Applied Psychological…	2
International Journal of…	2
International Journal of…	2
Journal of Educational…	2
ProQuest LLC	2
Structural Equation Modeling:…	2
Assessment & Evaluation in…	1
ETS Research Report Series	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Education and…	1
Language Teaching Research	1
Measurement:…	1
Practical Assessment,…	1
Psychological Methods	1
Research Matters	1
Research Papers in Education	1
More ▼

Kelecioglu, Hülya	2
Ahmet Yildirim	1
Ahn, Soyeon	1
Aksu Dunya, Beyza	1
Alhija, Fadia Nasser-Abu	1
Andrich, David	1
Anwyll, Steve	1
Beglar, David	1
Bichi, Ado Abdu	1
Bolt, Daniel M.	1
Borrello, Gloria M.	1
Bramley, Tom	1
Bulut, Okan	1
Cho, Sun-Joo	1
Cope, Ronald T.	1
Cuttance, Peter F.	1
DeMars, Christine E.	1
Dirlik, Ezgi Mor	1
Ferrao, Maria	1
Finch, Holmes	1
Glanville, Matthew	1
Gorgun, Guher	1
Gummer, Tobias	1
He, Qingping	1
More ▼