ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	7

Descriptor

Correlation	8
Difficulty Level	8
Error of Measurement	8
Test Items	8
Item Response Theory	5
Factor Analysis	4
Comparative Analysis	3
Monte Carlo Methods	3
Sample Size	3
Simulation	3
Computation	2
Computer Software	2
Equated Scores	2
Evaluation Methods	2
Foreign Countries	2
Item Analysis	2
Models	2
Sampling	2
Science Tests	2
Statistical Analysis	2
Statistical Bias	2
Test Bias	2
Accuracy	1
Achievement Tests	1
Classification	1
More ▼

Source

Applied Psychological…	1
ETS Research Report Series	1
Educational and Psychological…	1
International Journal of…	1
Research Matters	1
Research Papers in Education	1
Structural Equation Modeling:…	1

Author

Ahn, Soyeon	1
Anwyll, Steve	1
Bramley, Tom	1
DeMars, Christine E.	1
Dirlik, Ezgi Mor	1
Finch, Holmes	1
Glanville, Matthew	1
He, Qingping	1
Holland, Paul	1
Jones, Patricia B.	1
Opposs, Dennis	1
Park, Sung Eun	1
Sinharay, Sandip	1
Zopluoglu, Cengiz	1
More ▼

Publication Type

Reports - Research	8
Journal Articles	7
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1

Education Level

Elementary Education	1
Elementary Secondary Education	1

Audience

Researchers

Location

United Kingdom (England)

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Comparing Small-Sample Equating with Angoff Judgement for Linking Cut-Scores on Two Tests

Download full text

Bramley, Tom – Research Matters, 2020

The aim of this study was to compare, by simulation, the accuracy of mapping a cut-score from one test to another by expert judgement (using the Angoff method) versus the accuracy with a small-sample equating method (chained linear equating). As expected, the standard-setting method resulted in more accurate equating when we assumed a higher level…

Descriptors: Cutting Scores, Standard Setting (Scoring), Equated Scores, Accuracy

Differential Item Functioning Effect Size from the Multigroup Confirmatory Factor Analysis for a Meta-Analysis: A Simulation Study

Peer reviewed

Direct link

Park, Sung Eun; Ahn, Soyeon; Zopluoglu, Cengiz – Educational and Psychological Measurement, 2021

This study presents a new approach to synthesizing differential item functioning (DIF) effect size: First, using correlation matrices from each study, we perform a multigroup confirmatory factor analysis (MGCFA) that examines measurement invariance of a test item between two subgroups (i.e., focal and reference groups). Then we synthesize, across…

Descriptors: Item Analysis, Effect Size, Difficulty Level, Monte Carlo Methods

The Comparison of Item Parameters Estimated from Parametric and Nonparametric Item Response Theory Models in Case of the Violance of Local Independence Assumption

Peer reviewed
PDF on ERIC

Download full text

Dirlik, Ezgi Mor – International Journal of Progressive Education, 2019

Item response theory (IRT) has so many advantages than its precedent Classical Test Theory (CTT) such as non-changing item parameters, ability parameter estimations free from the items. However, in order to get these advantages, some assumptions should be met and they are; unidimensionality, normality and local independence. However, it is not…

Descriptors: Comparative Analysis, Nonparametric Statistics, Item Response Theory, Models

An Investigation of Measurement Invariance of the Key Stage 2 National Curriculum Science Sampling Test in England

Peer reviewed

Direct link

He, Qingping; Anwyll, Steve; Glanville, Matthew; Opposs, Dennis – Research Papers in Education, 2014

Since 2010, the whole national cohort Key Stage 2 (KS2) National Curriculum test in science in England has been replaced with a sampling test taken by pupils at the age of 11 from a nationally representative sample of schools annually. The study reported in this paper compares the performance of different subgroups of the samples (classified by…

Descriptors: National Curriculum, Sampling, Foreign Countries, Factor Analysis

A Comparison of Limited-Information and Full-Information Methods in M"plus" for Estimating Item Response Theory Parameters for Nonnormal Populations

Peer reviewed

Direct link

DeMars, Christine E. – Structural Equation Modeling: A Multidisciplinary Journal, 2012

In structural equation modeling software, either limited-information (bivariate proportions) or full-information item parameter estimation routines could be used for the 2-parameter item response theory (IRT) model. Limited-information methods assume the continuous variable underlying an item response is normally distributed. For skewed and…

Descriptors: Item Response Theory, Structural Equation Models, Computation, Computer Software

Item Parameter Estimation for the MIRT Model: Bias and Precision of Confirmatory Factor Analysis-Based Models

Peer reviewed

Direct link

Finch, Holmes – Applied Psychological Measurement, 2010

The accuracy of item parameter estimates in the multidimensional item response theory (MIRT) model context is one that has not been researched in great detail. This study examines the ability of two confirmatory factor analysis models specifically for dichotomous data to properly estimate item parameters using common formulae for converting factor…

Descriptors: Item Response Theory, Computation, Factor Analysis, Models

Choice of Anchor Test in Equating. Research Report. ETS RR-06-35

Peer reviewed
PDF on ERIC

Download full text

Sinharay, Sandip; Holland, Paul – ETS Research Report Series, 2006

It is a widely held belief that anchor tests should be miniature versions (i.e., minitests), with respect to content and statistical characteristics of the tests being equated. This paper examines the foundations for this belief. It examines the requirement of statistical representativeness of anchor tests that are content representative. The…

Descriptors: Test Items, Equated Scores, Evaluation Methods, Difficulty Level

Dimensionality Assessment for Dichotomously Scored Items Using Multidimensional Scaling.

Download full text

Jones, Patricia B.; And Others – 1987

In order to determine the effectiveness of multidimensional scaling (MDS) in recovering the dimensionality of a set of dichotomously-scored items, data were simulated in one, two, and three dimensions for a variety of correlations with the underlying latent trait. Similarity matrices were constructed from these data using three margin-sensitive…

Descriptors: Cluster Analysis, Correlation, Difficulty Level, Error of Measurement