ERIC - Search Results

Publication Date

In 2025	4
Since 2024	9
Since 2021 (last 5 years)	58
Since 2016 (last 10 years)	147
Since 2006 (last 20 years)	496

Descriptor

Equated Scores	1113
Test Items	298
Item Response Theory	297
Comparative Analysis	247
Statistical Analysis	233
Test Construction	165
Error of Measurement	143
Test Format	135
Scaling	129
College Entrance Examinations	124
Difficulty Level	119
Scores	117
Achievement Tests	116
Latent Trait Theory	113
Standardized Tests	113
Item Analysis	111
Sample Size	110
Mathematical Models	106
Evaluation Methods	102
Scoring	102
Testing Problems	98
Reading Tests	97
Test Reliability	97
Simulation	95
Raw Scores	94
More ▼

Author

Bianchini, John C.	35
von Davier, Alina A.	34
Dorans, Neil J.	33
Kolen, Michael J.	31
Loret, Peter G.	31
Kim, Sooyeon	26
Moses, Tim	24
Livingston, Samuel A.	22
Holland, Paul W.	20
Puhan, Gautam	20
Liu, Jinghua	19
Hanson, Bradley A.	17
van der Linden, Wim J.	16
Sinharay, Sandip	15
Walker, Michael E.	13
Angoff, William H.	12
Brennan, Robert L.	12
Cook, Linda L.	12
Eignor, Daniel R.	12
Lee, Won-Chan	12
Linn, Robert L.	12
Guo, Hongwen	11
Haberman, Shelby J.	11
Harris, Deborah J.	10
More ▼

Education Level

Higher Education	68
Postsecondary Education	50
Secondary Education	47
Elementary Education	35
Elementary Secondary Education	34
High Schools	26
Middle Schools	22
Junior High Schools	19
Grade 8	18
Grade 4	11
Grade 7	10
Intermediate Grades	10
Grade 6	9
Grade 3	8
Early Childhood Education	7
Grade 5	6
Adult Education	5
Primary Education	5
Grade 1	3
Grade 9	3
Adult Basic Education	2
Grade 10	2
Grade 11	2
Grade 2	2
High School Equivalency…	2
More ▼

Audience

Researchers	45
Practitioners	7
Administrators	1
Policymakers	1
Students	1
Teachers	1

Location

Canada	9
Australia	8
Florida	8
United Kingdom (England)	8
Netherlands	7
New York	7
United States	7
Israel	6
Turkey	6
United Kingdom	6
California	5
Japan	4
Sweden	4
Texas	4
Delaware	3
Georgia	3
New Jersey	3
Oregon	3
United Kingdom (Wales)	3
Hungary	2
Indonesia	2
Italy	2
Michigan	2
North Carolina	2
Saudi Arabia	2
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	12
No Child Left Behind Act 2001	5
Education Consolidation…	3
Hawkins Stafford Act 1988	1
Race to the Top	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 316 to 330 of 1,113 results Save | Export

Assessing First- and Second-Order Equity for the Common-Item Nonequivalent Groups Design Using Multidimensional IRT

Direct link

Andrews, Benjamin James – ProQuest LLC, 2011

The equity properties can be used to assess the quality of an equating. The degree to which expected scores conditional on ability are similar between test forms is referred to as first-order equity. Second-order equity is the degree to which conditional standard errors of measurement are similar between test forms after equating. The purpose of…

Descriptors: Test Format, Advanced Placement, Simulation, True Scores

The Use of Quality Control and Data Mining Techniques for Monitoring Scaled Scores: An Overview. Research Report. ETS RR-12-20

Peer reviewed
PDF on ERIC

Download full text

von Davier, Alina A. – ETS Research Report Series, 2012

Maintaining comparability of test scores is a major challenge faced by testing programs that have almost continuous administrations. Among the potential problems are scale drift and rapid accumulation of errors. Many standard quality control techniques for testing programs, which can effectively detect and address scale drift for small numbers of…

Descriptors: Quality Control, Data Analysis, Trend Analysis, Scaling

Linking Parameter Estimates Derived from an Item Response Model through Separate Calibrations. Research Report. ETS RR-09-40

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2009

A regression procedure is developed to link simultaneously a very large number of item response theory (IRT) parameter estimates obtained from a large number of test forms, where each form has been separately calibrated and where forms can be linked on a pairwise basis by means of common items. An application is made to forms in which a…

Descriptors: Regression (Statistics), Item Response Theory, Models, Equated Scores

An Evaluation of Kernel Equating: Parallel Equating with Classical Methods in the SAT Subject Tests[TM] Program. Research Report. ETS RR-09-06

Download full text

Grant, Mary C.; Zhang, Lilly; Damiano, Michele – Educational Testing Service, 2009

This study investigated kernel equating methods by comparing these methods to operational equatings for two tests in the SAT Subject Tests[TM] program. GENASYS (ETS, 2007) was used for all equating methods and scaled score kernel equating results were compared to Tucker, Levine observed score, chained linear, and chained equipercentile equating…

Descriptors: Equated Scores, Methods, Comparative Analysis, College Entrance Examinations

Evaluation of Two New Smoothing Methods in Equating: The Cubic B-Spline Presmoothing Method and the Direct Presmoothing Method

Peer reviewed

Direct link

Cui, Zhongmin; Kolen, Michael J. – Journal of Educational Measurement, 2009

This article considers two new smoothing methods in equipercentile equating, the cubic B-spline presmoothing method and the direct presmoothing method. Using a simulation study, these two methods are compared with established methods, the beta-4 method, the polynomial loglinear method, and the cubic spline postsmoothing method, under three sample…

Descriptors: Equated Scores, Methods, Sample Size, Test Content

A Comparison of Statistical Significance Tests for Selecting Equating Functions

Peer reviewed

Direct link

Moses, Tim – Applied Psychological Measurement, 2009

This study compared the accuracies of nine previously proposed statistical significance tests for selecting identity, linear, and equipercentile equating functions in an equivalent groups equating design. The strategies included likelihood ratio tests for the loglinear models of tests' frequency distributions, regression tests, Kolmogorov-Smirnov…

Descriptors: Statistical Significance, Equated Scores, Comparative Analysis, Tests

New York State Testing Program 2015: English Language Arts and Mathematics Grades 3-8. Technical Report

Download full text

New York State Education Department, 2015

This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2015 Operational Tests. This report includes information about test content and test development, item (i.e.,…

Descriptors: Testing Programs, English, Language Arts, Mathematics Tests

Examining Two Strategies to Link Mixed-Format Tests Using Multiple-Choice Anchors. Research Report. ETS RR-10-18

Download full text

Walker, Michael E.; Kim, Sooyeon – Educational Testing Service, 2010

This study examined the use of an all multiple-choice (MC) anchor for linking mixed format tests containing both MC and constructed-response (CR) items, in a nonequivalent groups design. An MC-only anchor could effectively link two such test forms if either (a) the MC and CR portions of the test measured the same construct, so that the MC anchor…

Descriptors: Equated Scores, Test Format, Multiple Choice Tests, Statistical Analysis

Single- versus Double-Scoring of Trend Responses in Trend Score Equating with Constructed-Response Tests. Research Report. ETS RR-10-12

Download full text

Tan, Xuan; Ricker, Kathryn L.; Puhan, Gautam – Educational Testing Service, 2010

This study examines the differences in equating outcomes between two trend score equating designs resulting from two different scoring strategies for trend scoring when operational constructed-response (CR) items are double-scored--the single group (SG) design, where each trend CR item is double-scored, and the nonequivalent groups with anchor…

Descriptors: Equated Scores, Scoring, Responses, Test Items

A Single Population Litmus Test for Linear Scale Alignment Methods: Commentary on Kane, Mroch, Suh, and Ripkey

Peer reviewed

Direct link

Dorans, Neil J. – Measurement: Interdisciplinary Research and Perspectives, 2010

Kane, Mroch, Suh, and Ripkey (2009) describe what they call five linear equating methods for the nonequivalent groups with anchor test (NEAT) design. The authors embed these methods within a two-dimensional framework. The first dimension contrasts what the authors call a parameter substitution (PS) approach what they call a chained linear…

Descriptors: Measures (Individuals), Equated Scores, Item Response Theory, Predictor Variables

Comparisons among Small Sample Equating Methods in a Common-Item Design

Peer reviewed

Direct link

Kim, Sooyeon; Livingston, Samuel A. – Journal of Educational Measurement, 2010

Score equating based on small samples of examinees is often inaccurate for the examinee populations. We conducted a series of resampling studies to investigate the accuracy of five methods of equating in a common-item design. The methods were chained equipercentile equating of smoothed distributions, chained linear equating, chained mean equating,…

Descriptors: Equated Scores, Test Items, Item Sampling, Item Response Theory

Evidence-Centered Assessment Design and the Advanced Placement Program[R]: A Psychometrician's Perspective

Peer reviewed

Direct link

Brennan, Robert L. – Applied Measurement in Education, 2010

This paper provides an overview of evidence-centered assessment design (ECD) and some general information about of the Advanced Placement (AP[R]) Program. Then the papers in this special issue are discussed, as they relate to the use of ECD in the revision of various AP tests. This paper concludes with some observations about the need to validate…

Descriptors: Advanced Placement Programs, Equivalency Tests, Evidence, Test Construction

First Language of Test Takers and Fairness Assessment Procedures

Peer reviewed

Direct link

Sinharay, Sandip; Dorans, Neil J.; Liang, Longjuan – Educational Measurement: Issues and Practice, 2011

Over the past few decades, those who take tests in the United States have exhibited increasing diversity with respect to native language. Standard psychometric procedures for ensuring item and test fairness that have existed for some time were developed when test-taking groups were predominantly native English speakers. A better understanding of…

Descriptors: Test Bias, Testing Programs, Psychometrics, Language Proficiency

Limits on the Accuracy of Linking. Research Report. ETS RR-10-22

Download full text

Haberman, Shelby J. – Educational Testing Service, 2010

Sampling errors limit the accuracy with which forms can be linked. Limitations on accuracy are especially important in testing programs in which a very large number of forms are employed. Standard inequalities in mathematical statistics may be used to establish lower bounds on the achievable inking accuracy. To illustrate results, a variety of…

Descriptors: Testing Programs, Equated Scores, Sampling, Accuracy

A Comparison of Anchor-Item Designs for the Concurrent Calibration of Large Banks of Likert-Type Items

Peer reviewed

Direct link

Garcia-Perez, Miguel A.; Alcala-Quintana, Rocio; Garcia-Cueto, Eduardo – Applied Psychological Measurement, 2010

Current interest in measuring quality of life is generating interest in the construction of computerized adaptive tests (CATs) with Likert-type items. Calibration of an item bank for use in CAT requires collecting responses to a large number of candidate items. However, the number is usually too large to administer to each subject in the…

Descriptors: Comparative Analysis, Test Items, Equated Scores, Item Banks

« Previous Page | Next Page »

Pages: 1 | ... | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | ... | 75

Journal of Educational…	108
ETS Research Report Series	78
Applied Psychological…	69
Applied Measurement in…	55
Educational and Psychological…	43
Measurement:…	26
Educational Measurement:…	25
ProQuest LLC	25
Educational Testing Service	23
Journal of Educational and…	17
International Journal of…	13
Journal of Educational…	13
Practical Assessment,…	10
Psychometrika	10
College Board	8
ACT, Inc.	6
Educational Assessment	6
Journal of Experimental…	6
Online Submission	6
Studies in Educational…	6
College Entrance Examination…	5
Journal of Applied Measurement	5
Assessment in Education:…	4
International Journal of…	4
International Journal of…	4
More ▼

Journal Articles	596
Reports - Research	587
Reports - Evaluative	284
Speeches/Meeting Papers	201
Numerical/Quantitative Data	82
Reports - Descriptive	77
Opinion Papers	33
Dissertations/Theses -…	27
Information Analyses	24
Guides - Non-Classroom	15
Tests/Questionnaires	10
Collected Works - General	7
Guides - General	5
Reports - General	5
Books	4
Collected Works - Proceedings	4
Collected Works - Serials	3
Reference Materials -…	3
Book/Product Reviews	2
Guides - Classroom - Learner	2
Dissertations/Theses	1
Guides - Classroom - Teacher	1
Historical Materials	1
Legal/Legislative/Regulatory…	1
Non-Print Media	1
More ▼

SAT (College Admission Test)	73
Iowa Tests of Basic Skills	48
California Achievement Tests	43
Comprehensive Tests of Basic…	43
Metropolitan Achievement Tests	37
Sequential Tests of…	37
Stanford Achievement Tests	37
SRA Achievement Series	35
National Assessment of…	23
Graduate Record Examinations	20
ACT Assessment	18
Advanced Placement…	15
Law School Admission Test	13
Armed Services Vocational…	11
Gates MacGinitie Reading Tests	10
Test of English as a Foreign…	9
Program for International…	8
Preliminary Scholastic…	7
College Board Achievement…	6
Trends in International…	6
General Aptitude Test Battery	5
General Educational…	5
Graduate Management Admission…	5
National Merit Scholarship…	5
Wechsler Intelligence Scale…	5
More ▼