ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	55
Since 2006 (last 20 years)	120

Descriptor

Comparative Analysis	167
Statistical Analysis	167
Test Items	167
Foreign Countries	45
Item Analysis	43
Item Response Theory	41
Test Bias	37
Scores	36
Difficulty Level	30
Test Format	24
Correlation	23
Multiple Choice Tests	22
English (Second Language)	20
Test Construction	20
Test Reliability	20
Simulation	19
Equated Scores	18
Goodness of Fit	18
Mathematics Tests	17
Academic Achievement	16
Mathematical Models	16
College Students	15
Sample Size	15
Teaching Methods	15
Computer Assisted Testing	14
More ▼

Publication Type

Reports - Research	138
Journal Articles	116
Speeches/Meeting Papers	18
Reports - Evaluative	15
Dissertations/Theses -…	10
Tests/Questionnaires	9
Reports - Descriptive	3
Numerical/Quantitative Data	2
Guides - Non-Classroom	1
Reports - General	1

Education Level

Higher Education	34
Postsecondary Education	28
Secondary Education	23
Elementary Education	16
High Schools	13
Middle Schools	9
Junior High Schools	7
Elementary Secondary Education	5
Grade 4	5
Grade 5	4
Grade 8	4
Early Childhood Education	3
Grade 6	3
Grade 9	3
Intermediate Grades	3
Grade 11	2
Grade 12	2
Grade 3	2
Grade 10	1
Primary Education	1
More ▼

Audience

Researchers

Location

Germany	5
China	4
Turkey	4
Australia	3
Israel	3
Japan	3
Taiwan	3
Netherlands	2
North Carolina	2
South Korea	2
Turkey (Ankara)	2
Arkansas	1
Austria	1
Belgium	1
Bosnia and Herzegovina…	1
Botswana	1
Brazil	1
Brunei	1
California	1
Denmark	1
District of Columbia	1
France	1
Greece	1
India	1
Indonesia	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	5
Test of English as a Foreign…	4
Comprehensive Tests of Basic…	2
Program for International…	2
ACT Assessment	1
Advanced Placement…	1
Defining Issues Test	1
Graduate Record Examinations	1
Iowa Tests of Basic Skills	1
Law School Admission Test	1
Minnesota Multiphasic…	1
National Assessment of…	1
Progress in International…	1
Raven Advanced Progressive…	1
Trends in International…	1
Wechsler Adult Intelligence…	1
Woodcock Johnson Tests of…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 167 results Save | Export

Mean Comparisons of Many Groups in the Presence of DIF: An Evaluation of Linking and Concurrent Scaling Approaches

Peer reviewed

Direct link

Robitzsch, Alexander; Lüdtke, Oliver – Journal of Educational and Behavioral Statistics, 2022

One of the primary goals of international large-scale assessments in education is the comparison of country means in student achievement. This article introduces a framework for discussing differential item functioning (DIF) for such mean comparisons. We compare three different linking methods: concurrent scaling based on full invariance,…

Descriptors: Test Bias, International Assessment, Scaling, Comparative Analysis

Score Comparability Issues with At-Home Testing and How to Address Them

Peer reviewed

Direct link

Puhan, Gautam; Kim, Sooyeon – Journal of Educational Measurement, 2022

As a result of the COVID-19 pandemic, at-home testing has become a popular delivery mode in many testing programs. When programs offer at-home testing to expand their service, the score comparability between test takers testing remotely and those testing in a test center is critical. This article summarizes statistical procedures that could be…

Descriptors: Scores, Scoring, Comparative Analysis, Testing

How Useful Is Comparative Judgement of Item Difficulty for Standard Maintaining?

Download full text

Benton, Tom – Research Matters, 2020

This article reviews the evidence on the extent to which experts' perceptions of item difficulties, captured using comparative judgement, can predict empirical item difficulties. This evidence is drawn from existing published studies on this topic and also from statistical analysis of data held by Cambridge Assessment. Having reviewed the…

Descriptors: Test Items, Difficulty Level, Expertise, Comparative Analysis

Detecting Differential Item Functioning: Item Response Theory Methods versus the Mantel-Haenszel Procedure

Peer reviewed
PDF on ERIC

Download full text

Diaz, Emily; Brooks, Gordon; Johanson, George – International Journal of Assessment Tools in Education, 2021

This Monte Carlo study assessed Type I error in differential item functioning analyses using Lord's chi-square (LC), Likelihood Ratio Test (LRT), and Mantel-Haenszel (MH) procedure. Two research interests were investigated: item response theory (IRT) model specification in LC and the LRT and continuity correction in the MH procedure. This study…

Descriptors: Test Bias, Item Response Theory, Statistical Analysis, Comparative Analysis

An Investigation of Item Position Effects by Means of IRT-Based Differential Item Functioning Methods

Peer reviewed
PDF on ERIC

Download full text

Soysal, Sumeyra; Yilmaz Kogar, Esin – International Journal of Assessment Tools in Education, 2021

In this study, whether item position effects lead to DIF in the condition where different test booklets are used was investigated. To do this the methods of Lord's chi-square and Raju's unsigned area with the 3PL model under with and without item purification were used. When the performance of the methods was compared, it was revealed that…

Descriptors: Item Response Theory, Test Bias, Test Items, Comparative Analysis

Does Comparative Judgement of Scripts Provide an Effective Means of Maintaining Standards in Mathematics? Research Report

Download full text

Benton, Tom; Leech, Tony; Hughes, Sarah – Cambridge Assessment, 2020

In the context of examinations, the phrase "maintaining standards" usually refers to any activity designed to ensure that it is no easier (or harder) to achieve a given grade in one year than in another. Specifically, it tends to mean activities associated with setting examination grade boundaries. Benton et al (2020) describes a method…

Descriptors: Mathematics Tests, Equated Scores, Comparative Analysis, Difficulty Level

Improvement of Norm Score Quality via Regression-Based Continuous Norming

Peer reviewed

Direct link

Lenhard, Wolfgang; Lenhard, Alexandra – Educational and Psychological Measurement, 2021

The interpretation of psychometric test results is usually based on norm scores. We compared semiparametric continuous norming (SPCN) with conventional norming methods by simulating results for test scales with different item numbers and difficulties via an item response theory approach. Subsequently, we modeled the norm scores based on random…

Descriptors: Test Norms, Scores, Regression (Statistics), Test Items

The Effect of Mini and Midi Anchor Tests on Test Equating

Peer reviewed
PDF on ERIC

Download full text

Arikan, Çigdem Akin – International Journal of Progressive Education, 2018

The main purpose of this study is to compare the test forms to the midi anchor test and the mini anchor test performance based on item response theory. The research was conducted with using simulated data which were generated based on Rasch model. In order to equate two test forms the anchor item nonequivalent groups (internal anchor test) was…

Descriptors: Equated Scores, Comparative Analysis, Item Response Theory, Tests

Evaluating Statistical Targets for Assembling Parallel Mixed-Format Test Forms

Peer reviewed

Direct link

Debeer, Dries; Ali, Usama S.; van Rijn, Peter W. – Journal of Educational Measurement, 2017

Test assembly is the process of selecting items from an item pool to form one or more new test forms. Often new test forms are constructed to be parallel with an existing (or an ideal) test. Within the context of item response theory, the test information function (TIF) or the test characteristic curve (TCC) are commonly used as statistical…

Descriptors: Test Format, Test Construction, Statistical Analysis, Comparative Analysis

Examining Power and Type 1 Error for Step and Item Level Tests of Invariance: Investigating the Effect of the Number of Item Score Levels

Direct link

Ayodele, Alicia Nicole – ProQuest LLC, 2017

Within polytomous items, differential item functioning (DIF) can take on various forms due to the number of response categories. The lack of invariance at this level is referred to as differential step functioning (DSF). The most common DSF methods in the literature are the adjacent category log odds ratio (AC-LOR) estimator and cumulative…

Descriptors: Statistical Analysis, Test Bias, Test Items, Scores

Normal Theory Two-Stage ML Estimator When Data Are Missing at the Item Level

Peer reviewed

Direct link

Savalei, Victoria; Rhemtulla, Mijke – Journal of Educational and Behavioral Statistics, 2017

In many modeling contexts, the variables in the model are linear composites of the raw items measured for each participant; for instance, regression and path analysis models rely on scale scores, and structural equation models often use parcels as indicators of latent constructs. Currently, no analytic estimation method exists to appropriately…

Descriptors: Computation, Statistical Analysis, Test Items, Maximum Likelihood Statistics

Statistically Comparing the Performance of Multiple Automated Raters across Multiple Items

Peer reviewed

Direct link

Kieftenbeld, Vincent; Boyer, Michelle – Applied Measurement in Education, 2017

Automated scoring systems are typically evaluated by comparing the performance of a single automated rater item-by-item to human raters. This presents a challenge when the performance of multiple raters needs to be compared across multiple items. Rankings could depend on specifics of the ranking procedure; observed differences could be due to…

Descriptors: Automation, Scoring, Comparative Analysis, Test Items

IRT Item Parameter Scaling for Developing New Item Pools

Peer reviewed

Direct link

Kang, Hyeon-Ah; Lu, Ying; Chang, Hua-Hua – Applied Measurement in Education, 2017

Increasing use of item pools in large-scale educational assessments calls for an appropriate scaling procedure to achieve a common metric among field-tested items. The present study examines scaling procedures for developing a new item pool under a spiraled block linking design. The three scaling procedures are considered: (a) concurrent…

Descriptors: Item Response Theory, Accuracy, Educational Assessment, Test Items

Do Adaptive Representations of the Item-Position Effect in APM Improve Model Fit? A Simulation Study

Peer reviewed

Direct link

Zeller, Florian; Krampen, Dorothea; Reiß, Siegbert; Schweizer, Karl – Educational and Psychological Measurement, 2017

The item-position effect describes how an item's position within a test, that is, the number of previous completed items, affects the response to this item. Previously, this effect was represented by constraints reflecting simple courses, for example, a linear increase. Due to the inflexibility of these representations our aim was to examine…

Descriptors: Goodness of Fit, Simulation, Factor Analysis, Intelligence Tests

The Consequences of Ignoring Item Parameter Drift in Longitudinal Item Response Models

Peer reviewed

Direct link

Lee, Wooyeol; Cho, Sun-Joo – Applied Measurement in Education, 2017

Utilizing a longitudinal item response model, this study investigated the effect of item parameter drift (IPD) on item parameters and person scores via a Monte Carlo study. Item parameter recovery was investigated for various IPD patterns in terms of bias and root mean-square error (RMSE), and percentage of time the 95% confidence interval covered…

Descriptors: Item Response Theory, Test Items, Bias, Computation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12

ETS Research Report Series	16
Educational and Psychological…	11
ProQuest LLC	9
Journal of Educational…	8
Applied Psychological…	7
College Entrance Examination…	5
Applied Measurement in…	4
Journal of Educational and…	3
Language Testing	3
Advances in Language and…	2
CBE - Life Sciences Education	2
Educational Research and…	2
International Journal of…	2
International Journal of…	2
International Journal of…	2
International Journal of…	2
Journal of Education and…	2
Journal of Experimental…	2
Psicologica: International…	2
Accounting Education	1
Advances in Physiology…	1
African Journal of Research…	1
American Journal of…	1
Assessment & Evaluation in…	1
Australasian Journal of…	1
More ▼

von Davier, Alina A.	6
Kim, Sooyeon	5
Holland, Paul W.	4
Chang, Hua-Hua	3
Ali, Usama S.	2
Benson, Jeri	2
Benton, Tom	2
Cho, Sun-Joo	2
DeMars, Christine E.	2
Ironson, Gail H.	2
Livingston, Samuel A.	2
Magis, David	2
Puhan, Gautam	2
Qian, Jiahe	2
Reckase, Mark D.	2
Robin, Frederic	2
Sinharay, Sandip	2
Stricker, Lawrence J.	2
Suh, Youngsuk	2
Wilson, Mark	2
Acar, Tülin	1
Adams, Ray	1
Adedoyin, O. O.	1
Ahmed, Tamim	1
More ▼