ERIC - Search Results

Publication Date

In 2025	4
Since 2024	9
Since 2021 (last 5 years)	58
Since 2016 (last 10 years)	147
Since 2006 (last 20 years)	496

Descriptor

Equated Scores	1113
Test Items	298
Item Response Theory	297
Comparative Analysis	247
Statistical Analysis	233
Test Construction	165
Error of Measurement	143
Test Format	135
Scaling	129
College Entrance Examinations	124
Difficulty Level	119
Scores	117
Achievement Tests	116
Latent Trait Theory	113
Standardized Tests	113
Item Analysis	111
Sample Size	110
Mathematical Models	106
Evaluation Methods	102
Scoring	102
Testing Problems	98
Reading Tests	97
Test Reliability	97
Simulation	95
Raw Scores	94
More ▼

Author

Bianchini, John C.	35
von Davier, Alina A.	34
Dorans, Neil J.	33
Kolen, Michael J.	31
Loret, Peter G.	31
Kim, Sooyeon	26
Moses, Tim	24
Livingston, Samuel A.	22
Holland, Paul W.	20
Puhan, Gautam	20
Liu, Jinghua	19
Hanson, Bradley A.	17
van der Linden, Wim J.	16
Sinharay, Sandip	15
Walker, Michael E.	13
Angoff, William H.	12
Brennan, Robert L.	12
Cook, Linda L.	12
Eignor, Daniel R.	12
Lee, Won-Chan	12
Linn, Robert L.	12
Guo, Hongwen	11
Haberman, Shelby J.	11
Harris, Deborah J.	10
More ▼

Education Level

Higher Education	68
Postsecondary Education	50
Secondary Education	47
Elementary Education	35
Elementary Secondary Education	34
High Schools	26
Middle Schools	22
Junior High Schools	19
Grade 8	18
Grade 4	11
Grade 7	10
Intermediate Grades	10
Grade 6	9
Grade 3	8
Early Childhood Education	7
Grade 5	6
Adult Education	5
Primary Education	5
Grade 1	3
Grade 9	3
Adult Basic Education	2
Grade 10	2
Grade 11	2
Grade 2	2
High School Equivalency…	2
More ▼

Audience

Researchers	45
Practitioners	7
Administrators	1
Policymakers	1
Students	1
Teachers	1

Location

Canada	9
Australia	8
Florida	8
United Kingdom (England)	8
Netherlands	7
New York	7
United States	7
Israel	6
Turkey	6
United Kingdom	6
California	5
Japan	4
Sweden	4
Texas	4
Delaware	3
Georgia	3
New Jersey	3
Oregon	3
United Kingdom (Wales)	3
Hungary	2
Indonesia	2
Italy	2
Michigan	2
North Carolina	2
Saudi Arabia	2
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	12
No Child Left Behind Act 2001	5
Education Consolidation…	3
Hawkins Stafford Act 1988	1
Race to the Top	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 391 to 405 of 1,113 results Save | Export

A Comparison of IRT Linking Procedures

Peer reviewed

Direct link

Lee, Won-Chan; Ban, Jae-Chun – Applied Measurement in Education, 2010

Various applications of item response theory often require linking to achieve a common scale for item parameter estimates obtained from different groups. This article used a simulation to examine the relative performance of four different item response theory (IRT) linking procedures in a random groups equating design: concurrent calibration with…

Descriptors: Item Response Theory, Simulation, Comparative Analysis, Measurement Techniques

Equating of Mixed-Format Tests in Large-Scale Assessments. Research Report. ETS RR-08-26

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – ETS Research Report Series, 2008

This study examined variations of the nonequivalent-groups equating design for mixed-format tests--tests containing both multiple-choice (MC) and constructed-response (CR) items--to determine which design was most effective in producing equivalent scores across the two tests to be equated. Four linking designs were examined: (a) an anchor with…

Descriptors: Equated Scores, Test Format, Multiple Choice Tests, Responses

Theoretical and Empirical Standard Errors for Two Population Invariance Measures in the Linear Equating Case. Research Report. ETS RR-08-24

Peer reviewed
PDF on ERIC

Download full text

von Davier, Alina A.; Manalo, Jonathan R.; Rijmen, Frank – ETS Research Report Series, 2008

The standard errors of the 2 most widely used population-invariance measures of equating functions, root mean square difference (RMSD) and root expected mean square difference (REMSD), are not derived for common equating methods such as linear equating. Consequently, it is unknown how much noise is contained in these estimates. This paper…

Descriptors: Equated Scores, Error of Measurement, Statistical Analysis, Sampling

Effect of Repeaters on Score Equating in a Large-Scale Licensure Test. Research Report. ETS RR-09-27

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael E. – ETS Research Report Series, 2009

This study investigated the subgroup invariance of equating functions for a licensure test in the context of a nonequivalent groups with anchor test (NEAT) design. Examinees who had taken a new, to-be-equated form of the test were divided into three subgroups according to their previous testing experience: (a) repeaters who previously took the…

Descriptors: Equated Scores, Licensing Examinations (Professions), Test Construction, Repetition

Creating Realistic Data Sets with Specified Properties via Simulation

Peer reviewed

Direct link

Goldman, Robert N.; McKenzie, John D. Jr. – Teaching Statistics: An International Journal for Teachers, 2009

We explain how to simulate both univariate and bivariate raw data sets having specified values for common summary statistics. The first example illustrates how to "construct" a data set having prescribed values for the mean and the standard deviation--for a one-sample t test with a specified outcome. The second shows how to create a bivariate data…

Descriptors: Correlation, Equated Scores, Statistical Analysis, Weighted Scores

An Empirical Comparison of Five Linear Equating Methods for the NEAT Design

Peer reviewed

Direct link

Suh, Youngsuk; Mroch, Andrew A.; Kane, Michael T.; Ripkey, Douglas R. – Measurement: Interdisciplinary Research and Perspectives, 2009

In this study, a data base containing the responses of 40,000 candidates to 90 multiple-choice questions was used to mimic data sets for 50-item tests under the "nonequivalent groups with anchor test" (NEAT) design. Using these smaller data sets, we evaluated the performance of five linear equating methods for the NEAT design with five levels of…

Descriptors: Test Items, Equated Scores, Methods, Differences

Evaluating Score Equity Assessment for State NAEP

Peer reviewed

Direct link

Wells, Craig S.; Baldwin, Su; Hambleton, Ronald K.; Sireci, Stephen G.; Karatonis, Ana; Jirka, Stephen – Applied Measurement in Education, 2009

Score equity assessment is an important analysis to ensure inferences drawn from test scores are comparable across subgroups of examinees. The purpose of the present evaluation was to assess the extent to which the Grade 8 NAEP Math and Reading assessments for 2005 were equivalent across selected states. More specifically, the present study…

Descriptors: National Competency Tests, Test Bias, Equated Scores, Grade 8

A Comparison of the Frequency Estimation and Chained Equipercentile Methods under the Common-Item Nonequivalent Groups Design

Peer reviewed

Direct link

Wang, Tianyou; Lee, Won-Chan; Brennan, Robert L.; Kolen, Michael J. – Applied Psychological Measurement, 2008

This article uses simulation to compare two test equating methods under the common-item nonequivalent groups design: the frequency estimation method and the chained equipercentile method. An item response theory model is used to define the true equating criterion, simulate group differences, and generate response data. Three linear equating…

Descriptors: Equated Scores, Item Response Theory, Simulation, Comparative Analysis

Small-Sample Equating Using a Synthetic Linking Function

Peer reviewed

Direct link

Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – Journal of Educational Measurement, 2008

This study addressed the sampling error and linking bias that occur with small samples in a nonequivalent groups anchor test design. We proposed a linking method called the synthetic function, which is a weighted average of the identity function and a traditional equating function (in this case, the chained linear equating function). Specifically,…

Descriptors: Equated Scores, Sample Size, Test Reliability, Comparative Analysis

Comparisons among Designs for Equating Mixed-Format Tests in Large-Scale Assessments

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – Journal of Educational Measurement, 2010

In this study we examined variations of the nonequivalent groups equating design for tests containing both multiple-choice (MC) and constructed-response (CR) items to determine which design was most effective in producing equivalent scores across the two tests to be equated. Using data from a large-scale exam, this study investigated the use of…

Descriptors: Measures (Individuals), Scoring, Equated Scores, Test Bias

Stability of Rasch Scales over Time

Peer reviewed

Direct link

Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2010

Item response theory (IRT) methods are generally used to create score scales for large-scale tests. Research has shown that IRT scales are stable across groups and over time. Most studies have focused on items that are dichotomously scored. Now Rasch and other IRT models are used to create scales for tests that include polytomously scored items.…

Descriptors: Measures (Individuals), Item Response Theory, Robustness (Statistics), Item Analysis

The Impact of Item Position Change on Item Parameters and Common Equating Results under the 3PL Model

Direct link

Meyers, Jason L.; Murphy, Stephen; Goodman, Joshua; Turhan, Ahmet – Pearson, 2012

Operational testing programs employing item response theory (IRT) applications benefit from of the property of item parameter invariance whereby item parameter estimates obtained from one sample can be applied to other samples (when the underlying assumptions are satisfied). In theory, this feature allows for applications such as computer-adaptive…

Descriptors: Equated Scores, Test Items, Test Format, Item Response Theory

Educational Measurement Issues and Implications of High Stakes Decision Making in Final Examinations in Secondary Education in the Netherlands

Peer reviewed

Direct link

van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012

While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…

Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making

First Language of Examinees and Its Relationship to Equating. Research Report. ETS RR-09-05

Download full text

Liang, Longjuan; Dorans, Neil J.; Sinharay, Sandip – Educational Testing Service, 2009

To ensure fairness, it is important to better understand the relationship of language proficiency with the standard procedures of psychometric analysis. This paper examines how equating results are affected by an increase in the proportion of examinees who report that English is not their first language, using the analysis samples for a…

Descriptors: Equated Scores, English (Second Language), Reading Tests, Mathematics Tests

Developing Form Assembly Specifications for Exams with Multiple Choice and Constructed Response Items: Balancing Reliability and Validity Concerns

Download full text

Hendrickson, Amy; Patterson, Brian; Ewing, Maureen – College Board, 2010

The psychometric considerations and challenges associated with including constructed response items on tests are discussed along with how these issues affect the form assembly specifications for mixed-format exams. Reliability and validity, security and fairness, pretesting, content and skills coverage, test length and timing, weights, statistical…

Descriptors: Multiple Choice Tests, Test Format, Test Construction, Test Validity

« Previous Page | Next Page »

Pages: 1 | ... | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | ... | 75

Journal of Educational…	108
ETS Research Report Series	78
Applied Psychological…	69
Applied Measurement in…	55
Educational and Psychological…	43
Measurement:…	26
Educational Measurement:…	25
ProQuest LLC	25
Educational Testing Service	23
Journal of Educational and…	17
International Journal of…	13
Journal of Educational…	13
Practical Assessment,…	10
Psychometrika	10
College Board	8
ACT, Inc.	6
Educational Assessment	6
Journal of Experimental…	6
Online Submission	6
Studies in Educational…	6
College Entrance Examination…	5
Journal of Applied Measurement	5
Assessment in Education:…	4
International Journal of…	4
International Journal of…	4
More ▼

Journal Articles	596
Reports - Research	587
Reports - Evaluative	284
Speeches/Meeting Papers	201
Numerical/Quantitative Data	82
Reports - Descriptive	77
Opinion Papers	33
Dissertations/Theses -…	27
Information Analyses	24
Guides - Non-Classroom	15
Tests/Questionnaires	10
Collected Works - General	7
Guides - General	5
Reports - General	5
Books	4
Collected Works - Proceedings	4
Collected Works - Serials	3
Reference Materials -…	3
Book/Product Reviews	2
Guides - Classroom - Learner	2
Dissertations/Theses	1
Guides - Classroom - Teacher	1
Historical Materials	1
Legal/Legislative/Regulatory…	1
Non-Print Media	1
More ▼

SAT (College Admission Test)	73
Iowa Tests of Basic Skills	48
California Achievement Tests	43
Comprehensive Tests of Basic…	43
Metropolitan Achievement Tests	37
Sequential Tests of…	37
Stanford Achievement Tests	37
SRA Achievement Series	35
National Assessment of…	23
Graduate Record Examinations	20
ACT Assessment	18
Advanced Placement…	15
Law School Admission Test	13
Armed Services Vocational…	11
Gates MacGinitie Reading Tests	10
Test of English as a Foreign…	9
Program for International…	8
Preliminary Scholastic…	7
College Board Achievement…	6
Trends in International…	6
General Aptitude Test Battery	5
General Educational…	5
Graduate Management Admission…	5
National Merit Scholarship…	5
Wechsler Intelligence Scale…	5
More ▼