ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	49

Descriptor

Equated Scores	129
Scaling	129
Item Response Theory	44
Test Items	35
Test Construction	34
Latent Trait Theory	28
Comparative Analysis	26
College Entrance Examinations	23
Testing Programs	23
Scoring	22
Scores	21
Statistical Analysis	19
Achievement Tests	18
Educational Assessment	18
Estimation (Mathematics)	18
Test Reliability	18
Elementary Secondary Education	17
Test Validity	17
Testing Problems	15
Error of Measurement	14
Mathematical Models	14
Mathematics Tests	14
Item Analysis	13
Measurement Techniques	13
Raw Scores	13
More ▼

Publication Type

Reports - Research	65
Journal Articles	61
Reports - Evaluative	29
Speeches/Meeting Papers	29
Reports - Descriptive	17
Numerical/Quantitative Data	10
Opinion Papers	8
Tests/Questionnaires	6
Guides - Non-Classroom	4
Books	3
Collected Works - General	3
Dissertations/Theses -…	2
Book/Product Reviews	1
Guides - Classroom - Learner	1
Guides - General	1
Information Analyses	1
Reference Materials -…	1
Reports - General	1
More ▼

Education Level

Elementary Secondary Education	11
Elementary Education	9
Secondary Education	9
Higher Education	7
Postsecondary Education	7
Grade 3	6
Grade 4	6
Grade 6	6
Grade 7	6
Intermediate Grades	6
Grade 5	5
Early Childhood Education	4
Grade 8	4
High Schools	4
Junior High Schools	4
Middle Schools	4
Primary Education	4
High School Equivalency…	2
Adult Education	1
Grade 1	1
Grade 2	1
More ▼

Audience

Researchers	10
Practitioners	1
Teachers	1

Location

Australia	3
New York	3
Florida	2
United Kingdom (England)	2
United Kingdom (Wales)	2
United States	2
Arkansas	1
Austria	1
Canada	1
Illinois	1
Japan	1
Texas	1
United Kingdom	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

What Works Clearinghouse Rating

Showing 1 to 15 of 129 results Save | Export

Anchors Aweigh: How the Choice of Anchor Items Affects the Vertical Scaling of 3PL Data with the Rasch Model

Peer reviewed

Direct link

Waterbury, Glenn Thomas; DeMars, Christine E. – Educational Assessment, 2021

Vertical scaling is used to put tests of different difficulty onto a common metric. The Rasch model is often used to perform vertical scaling, despite its strict functional form. Few, if any, studies have examined anchor item choice when using the Rasch model to vertically scale data that do not fit the model. The purpose of this study was to…

Descriptors: Test Items, Equated Scores, Item Response Theory, Scaling

Efficient Estimation of Mean Ability Growth Using Vertical Scaling

Peer reviewed

Direct link

Bjermo, Jonas; Miller, Frank – Applied Measurement in Education, 2021

In recent years, the interest in measuring growth in student ability in various subjects between different grades in school has increased. Therefore, good precision in the estimated growth is of importance. This paper aims to compare estimation methods and test designs when it comes to precision and bias of the estimated growth of mean ability…

Descriptors: Scaling, Ability, Computation, Test Items

The Effect of the Ratio of Common Items and the Separation of Grade Distributions on the Precision of Vertical Scaling

Peer reviewed

Direct link

Guangming Li; Zhengyan Liang – SAGE Open, 2024

In order to investigate the influence of separation of grade distributions and ratio of common items on the precision of vertical scaling, this simulation study chooses common item design and first grade as base grade. There are four grades with 1,000 students each to take part in a test which has 100 items. Monte Carlo simulation method is used…

Descriptors: Elementary School Students, Grade 1, Grade 2, Grade 3

Rasch versus Classical Equating in the Context of Small Sample Sizes

Peer reviewed

Direct link

Babcock, Ben; Hodge, Kari J. – Educational and Psychological Measurement, 2020

Equating and scaling in the context of small sample exams, such as credentialing exams for highly specialized professions, has received increased attention in recent research. Investigators have proposed a variety of both classical and Rasch-based approaches to the problem. This study attempts to extend past research by (1) directly comparing…

Descriptors: Item Response Theory, Equated Scores, Scaling, Sample Size

Grouping Effects on Jackknifed Variance Estimation for Item Response Theory Scaling and Equating with Cluster-Based Assessment Data. Research Report. ETS RR-18-16

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2018

Educational assessment data are often collected from a set of test centers across various geographic regions, and therefore the data samples contain clusters. Such cluster-based data may result in clustering effects in variance estimation. However, in many grouped jackknife variance estimation applications, jackknife groups are often formed by a…

Descriptors: Item Response Theory, Scaling, Equated Scores, Cluster Grouping

An Evaluation of the Single-Group Growth Model as an Alternative to Common-Item Equating. Research Report. ETS RR-16-01

Peer reviewed
PDF on ERIC

Download full text

Wei, Youhua; Morgan, Rick – ETS Research Report Series, 2016

As an alternative to common-item equating when common items do not function as expected, the single-group growth model (SGGM) scaling uses common examinees or repeaters to link test scores on different forms. The SGGM scaling assumes that, for repeaters taking adjacent administrations, the conditional distribution of scale scores in later…

Descriptors: Equated Scores, Growth Models, Scaling, Computation

Long-Term Impact of Valid Case Criterion on Capturing Population-Level Growth under Item Response Theory Equating. Research Report. ETS RR-17-17

Peer reviewed
PDF on ERIC

Download full text

Deng, Weiling; Monfils, Lora – ETS Research Report Series, 2017

Using simulated data, this study examined the impact of different levels of stringency of the valid case inclusion criterion on item response theory (IRT)-based true score equating over 5 years in the context of K-12 assessment when growth in student achievement is expected. Findings indicate that the use of the most stringent inclusion criterion…

Descriptors: Item Response Theory, Equated Scores, True Scores, Educational Assessment

Adapting Accountability Systems to the Limitations of Educational Measurement

Peer reviewed

Direct link

Kane, Michael – Measurement: Interdisciplinary Research and Perspectives, 2015

Michael Kane writes in this article that he is in more or less complete agreement with Professor Koretz's characterization of the problem outlined in the paper published in this issue of "Measurement." Kane agrees that current testing practices are not adequate for test-based accountability (TBA) systems, but he writes that he is far…

Descriptors: Educational Testing, Accountability, Standardized Tests, Equated Scores

Psychometric Consequences of Subpopulation Item Parameter Drift

Peer reviewed

Direct link

Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2017

This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…

Descriptors: Psychometrics, Test Items, Item Response Theory, Hypothesis Testing

Effect of Adjusting Pseudo-Guessing Parameter Estimates on Test Scaling When Item Parameter Drift Is Present

Peer reviewed
PDF on ERIC

Download full text

Han, Kyung T.; Wells, Craig S.; Hambleton, Ronald K. – Practical Assessment, Research & Evaluation, 2015

In item response theory test scaling/equating with the three-parameter model, the scaling coefficients A and B have no impact on the c-parameter estimates of the test items since the cparameter estimates are not adjusted in the scaling/equating procedure. The main research question in this study concerned how serious the consequences would be if…

Descriptors: Item Response Theory, Monte Carlo Methods, Scaling, Test Items

The Long-Term Sustainability of IRT Scaling Methods in Mixed-Format Tests

Peer reviewed

Direct link

Keller, Lisa A.; Hambleton, Ronald K. – Journal of Educational Measurement, 2013

Due to recent research in equating methodologies indicating that some methods may be more susceptible to the accumulation of equating error over multiple administrations, the sustainability of several item response theory methods of equating over time was investigated. In particular, the paper is focused on two equating methodologies: fixed common…

Descriptors: Item Response Theory, Scaling, Test Format, Equated Scores

A Criterion to Evaluate the Individual Raw-to-Scale Equating Conversions. Research Report. ETS RR-13-05

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Puhan, Gautam; Walker, Michael – ETS Research Report Series, 2013

In this study we investigated when an equating conversion line is problematic in terms of gaps and clumps. We suggest using the conditional standard error of measurement (CSEM) to measure the scale scores that are inappropriate in the overall raw-to-scale transformation.

Descriptors: Equated Scores, Test Items, Evaluation Criteria, Error of Measurement

Quantifying Error and Uncertainty Reductions in Scaling Functions: An ITEMS Module

Peer reviewed

Direct link

Moses, Tim – Educational Measurement: Issues and Practice, 2014

This module describes and extends X-to-Y regression measures that have been proposed for use in the assessment of X-to-Y scaling and equating results. Measures are developed that are similar to those based on prediction error in regression analyses but that are directly suited to interests in scaling and equating evaluations. The regression and…

Descriptors: Scaling, Regression (Statistics), Equated Scores, Comparative Analysis

Technical Manual. The ACT®

Download full text

ACT, Inc., 2014

This manual contains technical information about the ACT® college readiness assessment. The principal purpose of this manual is to document the technical characteristics of the ACT in light of its intended purposes. ACT regularly conducts research as part of the ongoing formative evaluation of its programs. The research is intended to ensure that…

Descriptors: College Entrance Examinations, College Readiness, Career Readiness, Standards

Software Note: Using BILOG for Fixed-Anchor Item Calibration

Peer reviewed

Direct link

DeMars, Christine E.; Jurich, Daniel P. – Applied Psychological Measurement, 2012

The nonequivalent groups anchor test (NEAT) design is often used to scale item parameters from two different test forms. A subset of items, called the anchor items or common items, are administered as part of both test forms. These items are used to adjust the item calibrations for any differences in the ability distributions of the groups taking…

Descriptors: Computer Software, Item Response Theory, Scaling, Equated Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

ETS Research Report Series	12
Applied Psychological…	10
Measurement:…	6
Educational Measurement:…	5
Educational and Psychological…	5
Journal of Educational…	4
Applied Measurement in…	3
Journal of Educational…	3
New York State Education…	3
Practical Assessment,…	3
College Entrance Examination…	2
Educational Testing Service	2
GED Testing Service	2
Studies in Educational…	2
ACT, Inc.	1
American Institutes for…	1
Educational Assessment	1
Evaluation and the Health…	1
Journal of Applied Measurement	1
Journal of Experimental…	1
Mathematics Teacher	1
Ministerial Council on…	1
Pearson	1
Praeger	1
ProQuest LLC	1
More ▼

Dorans, Neil J.	5
Lissitz, Robert W.	4
Puhan, Gautam	4
DeMars, Christine E.	3
Eignor, Daniel R.	3
Guo, Hongwen	3
Hambleton, Ronald K.	3
Keller, Lisa A.	3
Liu, Jinghua	3
von Davier, Alina A.	3
Braun, Henry I.	2
Brennan, Robert L.	2
Camilli, Gregory	2
Cook, Linda L.	2
Curley, Edward	2
Forster, Fred	2
Han, Kyung T.	2
Hanson, Bradley A.	2
Hicks, Marilyn M.	2
Holmes, Susan E.	2
Huynh, Huynh	2
Jurich, Daniel P.	2
Keller, Robert R.	2
Kolen, Michael J.	2
More ▼

SAT (College Admission Test)	16
ACT Assessment	6
National Assessment of…	6
Test of English as a Foreign…	4
Advanced Placement…	2
Florida Comprehensive…	2
General Educational…	2
Graduate Record Examinations	2
Iowa Tests of Basic Skills	2
Law School Admission Test	2
ACT Interest Inventory	1
Armed Services Vocational…	1
College Board Achievement…	1
College Level Examination…	1
Comprehensive Tests of Basic…	1
Graduate Management Admission…	1
Program for International…	1
Sequential Tests of…	1
Stanford Achievement Tests	1
Test of Written English	1
Texas Essential Knowledge and…	1
More ▼