ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	15
Since 2006 (last 20 years)	31

Descriptor

Achievement Tests	116
Equated Scores	116
Latent Trait Theory	28
Comparative Analysis	26
Elementary Secondary Education	26
Item Response Theory	26
Test Items	26
Standardized Tests	24
Statistical Analysis	22
Testing Programs	20
Item Analysis	19
Mathematics Tests	19
Scaling	18
Test Interpretation	18
Difficulty Level	17
Test Construction	17
Test Reliability	17
Academic Achievement	15
Test Bias	15
College Entrance Examinations	14
Elementary Education	14
Foreign Countries	14
Testing Problems	14
Mathematical Models	13
Scores	13
More ▼

Publication Type

Reports - Research	70
Journal Articles	42
Speeches/Meeting Papers	32
Reports - Evaluative	17
Numerical/Quantitative Data	9
Reports - Descriptive	9
Information Analyses	4
Guides - Non-Classroom	3
Books	2
Collected Works - General	2
Dissertations/Theses -…	2
Opinion Papers	2
Guides - Classroom - Learner	1
Guides - Classroom - Teacher	1
Reports - General	1
Tests/Questionnaires	1
More ▼

Education Level

Secondary Education	15
Elementary Education	12
Junior High Schools	10
Middle Schools	10
Grade 8	9
Grade 4	6
Elementary Secondary Education	5
Intermediate Grades	5
Early Childhood Education	4
Grade 7	4
Primary Education	4
Grade 3	3
Grade 5	3
Grade 6	3
Higher Education	3
Grade 1	1
High Schools	1
Postsecondary Education	1
More ▼

Audience

Researchers	8
Practitioners	2

Location

Turkey	4
Netherlands	3
New York	3
Florida	2
United Kingdom	2
Alaska	1
California	1
Canada	1
Hawaii	1
Idaho	1
Italy	1
Oregon	1
South Korea	1
Texas (Dallas)	1
United Kingdom (England)	1
United States	1
Washington	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	7
No Child Left Behind Act 2001	2

What Works Clearinghouse Rating

Achievement Tests X

Showing 1 to 15 of 116 results Save | Export

Impact of Differential Item Functioning on Item Model Fit Using Concurrent Equating Method

Peer reviewed

Direct link

Zeynep Uzun; Tuncay Ögretmen – Large-scale Assessments in Education, 2025

This study aimed to evaluate the item model fit by equating the forms of the PISA 2018 mathematics subtest with concurrent common items equating in samples from Türkiye, the UK, and Italy. The answers given in mathematics subtest Forms 2, 8, and 12 were used in this context. Analyzes were performed using the Dichotomous Rasch Model in the WINSTEPS…

Descriptors: Item Response Theory, Test Items, Foreign Countries, Mathematics Tests

Multiple Group Item Response Theory Applications Using "Stata irt" Package

Peer reviewed

Direct link

Zheng, Xiaying; Yang, Ji Seung – Measurement: Interdisciplinary Research and Perspectives, 2021

The purpose of this paper is to briefly introduce two most common applications of multiple group item response theory (IRT) models, namely detecting differential item functioning (DIF) analysis and nonequivalent group score linking with a simultaneous calibration. We illustrate how to conduct those analyses using the "Stata" item…

Descriptors: Item Response Theory, Test Bias, Computer Software, Statistical Analysis

Comparing Performance of Different Equating Methods in Presence and Absence of DIF Items in Anchor Test

Peer reviewed
PDF on ERIC

Download full text

Gübes, Nese; Uyar, Seyma – International Journal of Progressive Education, 2020

This study aims to compare the performance of different small sample equating methods in the presence and absence of differential item functioning (DIF) in common items. In this research, Tucker linear equating, Levine linear equating, unsmoothed and pre-smoothed (C=4) chained equipercentile equating, and simplified circle arc equating methods…

Descriptors: Test Bias, Equated Scores, Test Items, Methods

English MAP Reading Fluency Technical Report: Based on Assessments Administered during the 2020-2021 School Year

Download full text

NWEA, 2022

This technical report documents the processes and procedures employed by NWEA® to build and support the English MAP® Reading Fluency™ assessments administered during the 2020-2021 school year. It is written for measurement professionals and administrators to help evaluate the quality of MAP Reading Fluency. The seven sections of this report: (1)…

Descriptors: Achievement Tests, Reading Tests, Reading Achievement, Reading Fluency

A Comparison of Kernel Equating Methods Based on Neat Design

Peer reviewed
PDF on ERIC

Download full text

Akin Arikan, Cigdem – Eurasian Journal of Educational Research, 2019

Problem Statement: Equating can be defined as a statistical process that allows modifying the differences between test forms with similar content and difficulty so that the scores obtained from these forms can be used interchangeably. In the literature, there are many equating methods, one of which is Kernel equating. Trends in International…

Descriptors: Equated Scores, Foreign Countries, Achievement Tests, International Assessment

Validation Methods for Aggregate-Level Test Scale Linking: A Case Study Mapping School District Test Score Distributions to a Common Scale

Peer reviewed
PDF on ERIC

Download full text

Direct link

Reardon, Sean F.; Kalogrides, Demetra; Ho, Andrew D. – Journal of Educational and Behavioral Statistics, 2021

Linking score scales across different tests is considered speculative and fraught, even at the aggregate level. We introduce and illustrate validation methods for aggregate linkages, using the challenge of linking U.S. school district average test scores across states as a motivating example. We show that aggregate linkages can be validated both…

Descriptors: Equated Scores, Validity, Methods, School Districts

Linking TIMSS and NAEP Assessments to Evaluate International Trends in Achievement

Peer reviewed
PDF on ERIC

Download full text

Lim, Hwanggyu; Sireci, Stephen G. – Education Policy Analysis Archives, 2017

The Trends in International Mathematics and Science Study (TIMSS) makes it possible to compare the performance of students in the US in Mathematics and Science to the performance of students in other countries. TIMSS uses four international benchmarks for describing student achievement: Low, Intermediate, High, and Advanced. In this study, we…

Descriptors: Achievement Tests, Mathematics Achievement, Mathematics Tests, International Assessment

Evaluating the 'Similar Items Method' for Standard Maintaining. Conference Paper

Direct link

Bramley, Tom – Cambridge Assessment, 2018

The aim of the research reported here was to get some idea of the accuracy of grade boundaries (cut-scores) obtained by applying the 'similar items method' described in Bramley & Wilson (2016). In this method experts identify items on the current version of a test that are sufficiently similar to items on previous versions for them to be…

Descriptors: Accuracy, Cutting Scores, Test Items, Item Analysis

Equating TIMSS Mathematics Subtests with Nonlinear Equating Methods Using NEAT Design: Circle-Arc Equating Approaches

Peer reviewed
PDF on ERIC

Download full text

Ozdemir, Burhanettin – International Journal of Progressive Education, 2017

The purpose of this study is to equate Trends in International Mathematics and Science Study (TIMSS) mathematics subtest scores obtained from TIMSS 2011 to scores obtained from TIMSS 2007 form with different nonlinear observed score equating methods under Non-Equivalent Anchor Test (NEAT) design where common items are used to link two or more test…

Descriptors: Achievement Tests, Elementary Secondary Education, Foreign Countries, International Assessment

The Effect of Differential Item Functioning in Anchor Items on Population Invariance of Equating

Peer reviewed

Direct link

Huggins, Anne Corinne – Educational and Psychological Measurement, 2014

Invariant relationships in the internal mechanisms of estimating achievement scores on educational tests serve as the basis for concluding that a particular test is fair with respect to statistical bias concerns. Equating invariance and differential item functioning are both concerned with invariant relationships yet are treated separately in the…

Descriptors: Test Bias, Test Items, Equated Scores, Achievement Tests

Psychometric Consequences of Subpopulation Item Parameter Drift

Peer reviewed

Direct link

Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2017

This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…

Descriptors: Psychometrics, Test Items, Item Response Theory, Hypothesis Testing

Explaining Variation in Findings from Efficacy and Effectiveness Studies for English Reading Interventions for English Learners

Peer reviewed

Direct link

Barr, Christopher D.; Reutebuch, Colleen K.; Carlson, Coleen D.; Vaughn, Sharon; Francis, David J. – Journal of Research on Educational Effectiveness, 2019

Beginning in 2002, researchers developed, implemented, and evaluated the efficacy of an English reading intervention for first-grade English learners using multiple randomized control trials (RCTs). As a result of this efficacy work, researchers successfully competed for an IES Goal 4 effectiveness study using the same intervention. Unlike the…

Descriptors: Intervention, English Language Learners, Grade 1, Elementary School Students

Linking U.S. School District Test Score Distributions to a Common Scale. CEPA Working Paper No. 16-09

Download full text

Reardon, Sean F.; Kalogrides, Demetra; Ho, Andrew D. – Stanford Center for Education Policy Analysis, 2017

There is no comprehensive database of U.S. district-level test scores that is comparable across states. We describe and evaluate a method for constructing such a database. First, we estimate linear, reliability-adjusted linking transformations from state test score scales to the scale of the National Assessment of Educational Progress (NAEP). We…

Descriptors: School Districts, Scores, Statistical Distributions, Database Design

Language Effects in International Testing: The Case of PISA 2006 Science Items

Peer reviewed

Direct link

El Masri, Yasmine H.; Baird, Jo-Anne; Graesser, Art – Assessment in Education: Principles, Policy & Practice, 2016

We investigate the extent to which language versions (English, French and Arabic) of the same science test are comparable in terms of item difficulty and demands. We argue that language is an inextricable part of the scientific literacy construct, be it intended or not by the examiner. This argument has considerable implications on methodologies…

Descriptors: International Assessment, Difficulty Level, Test Items, Language Variation

The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

Peer reviewed
PDF on ERIC

Download full text

Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016

The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Journal of Educational…	10
Applied Psychological…	8
Educational Measurement:…	3
Educational and Psychological…	3
New York State Education…	3
Studies in Educational…	3
Applied Measurement in…	2
Assessment in Education:…	2
International Journal of…	2
ProQuest LLC	2
Cambridge Assessment	1
ETS Research Report Series	1
Education Digest: Essential…	1
Education Policy Analysis…	1
Educational Sciences: Theory…	1
Eurasian Journal of…	1
Evaluation Review	1
Int Rev Educ	1
Journal of Educational and…	1
Journal of Research and…	1
Journal of Research on…	1
Large-scale Assessments in…	1
Measurement:…	1
Multivariate Behavioral…	1
NWEA	1
More ▼

Cook, Linda L.	3
Forster, Fred	3
Gallas, Edwin J.	3
Harris, Deborah J.	3
Holmes, Susan E.	3
Hoover, H. D.	3
Kolen, Michael J.	3
Brennan, Robert L.	2
Doron, Rina	2
Engelhard, George, Jr.	2
Green, Donald Ross	2
Ho, Andrew D.	2
Kalogrides, Demetra	2
Linn, Robert L.	2
Loyd, Brenda H.	2
Phillips, S. E.	2
Reardon, Sean F.	2
Sireci, Stephen G.	2
Akin Arikan, Cigdem	1
Anderson, Patricia S.	1
Angoff, William H.	1
Applebaum, Wayne R.	1
Baird, Jo-Anne	1
Barr, Christopher D.	1
More ▼

SAT (College Admission Test)	7
California Achievement Tests	6
Comprehensive Tests of Basic…	6
Iowa Tests of Basic Skills	6
National Assessment of…	5
Trends in International…	5
College Board Achievement…	4
Advanced Placement…	3
Graduate Record Examinations	3
Program for International…	3
Florida Comprehensive…	2
General Educational…	2
Iowa Tests of Educational…	2
Measures of Academic Progress	2
Metropolitan Achievement Tests	2
Stanford Achievement Tests	2
ACT Assessment	1
College Level Examination…	1
Law School Admission Test	1
Sequential Tests of…	1
Test of Standard Written…	1
Woodcock Johnson Psycho…	1
More ▼