ERIC - Search Results

Publication Date

In 2025	4
Since 2024	9
Since 2021 (last 5 years)	58
Since 2016 (last 10 years)	147
Since 2006 (last 20 years)	496

Descriptor

Equated Scores	1113
Test Items	298
Item Response Theory	297
Comparative Analysis	247
Statistical Analysis	233
Test Construction	165
Error of Measurement	143
Test Format	135
Scaling	129
College Entrance Examinations	124
Difficulty Level	119
Scores	117
Achievement Tests	116
Latent Trait Theory	113
Standardized Tests	113
Item Analysis	111
Sample Size	110
Mathematical Models	106
Evaluation Methods	102
Scoring	102
Testing Problems	98
Reading Tests	97
Test Reliability	97
Simulation	95
Raw Scores	94
More ▼

Author

Bianchini, John C.	35
von Davier, Alina A.	34
Dorans, Neil J.	33
Kolen, Michael J.	31
Loret, Peter G.	31
Kim, Sooyeon	26
Moses, Tim	24
Livingston, Samuel A.	22
Holland, Paul W.	20
Puhan, Gautam	20
Liu, Jinghua	19
Hanson, Bradley A.	17
van der Linden, Wim J.	16
Sinharay, Sandip	15
Walker, Michael E.	13
Angoff, William H.	12
Brennan, Robert L.	12
Cook, Linda L.	12
Eignor, Daniel R.	12
Lee, Won-Chan	12
Linn, Robert L.	12
Guo, Hongwen	11
Haberman, Shelby J.	11
Harris, Deborah J.	10
More ▼

Education Level

Higher Education	68
Postsecondary Education	50
Secondary Education	47
Elementary Education	35
Elementary Secondary Education	34
High Schools	26
Middle Schools	22
Junior High Schools	19
Grade 8	18
Grade 4	11
Grade 7	10
Intermediate Grades	10
Grade 6	9
Grade 3	8
Early Childhood Education	7
Grade 5	6
Adult Education	5
Primary Education	5
Grade 1	3
Grade 9	3
Adult Basic Education	2
Grade 10	2
Grade 11	2
Grade 2	2
High School Equivalency…	2
More ▼

Audience

Researchers	45
Practitioners	7
Administrators	1
Policymakers	1
Students	1
Teachers	1

Location

Canada	9
Australia	8
Florida	8
United Kingdom (England)	8
Netherlands	7
New York	7
United States	7
Israel	6
Turkey	6
United Kingdom	6
California	5
Japan	4
Sweden	4
Texas	4
Delaware	3
Georgia	3
New Jersey	3
Oregon	3
United Kingdom (Wales)	3
Hungary	2
Indonesia	2
Italy	2
Michigan	2
North Carolina	2
Saudi Arabia	2
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	12
No Child Left Behind Act 2001	5
Education Consolidation…	3
Hawkins Stafford Act 1988	1
Race to the Top	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Equated Scores X

Showing 151 to 165 of 1,113 results Save | Export

An Examination of Two Procedures for Identifying Consequential Item Parameter Drift

Peer reviewed

Direct link

Wells, Craig S.; Hambleton, Ronald K.; Kirkpatrick, Robert; Meng, Yu – Applied Measurement in Education, 2014

The purpose of the present study was to develop and evaluate two procedures flagging consequential item parameter drift (IPD) in an operational testing program. The first procedure was based on flagging items that exhibit a meaningful magnitude of IPD using a critical value that was defined to represent barely tolerable IPD. The second procedure…

Descriptors: Test Items, Test Bias, Equated Scores, Item Response Theory

Evaluating Common Item Block Options When Faced with Practical Constraints

Peer reviewed
PDF on ERIC

Download full text

Wolkowitz, Amanda; Davis-Becker, Susan – Practical Assessment, Research & Evaluation, 2015

This study evaluates the impact of common item characteristics on the outcome of equating in credentialing examinations when traditionally recommended representation is not possible. This research used real data sets from several credentialing exams to test the impact of content representation, item statistics, and number of common items on…

Descriptors: Test Items, Equated Scores, Licensing Examinations (Professions), Test Content

Adapting Educational Measurement to the Demands of Test-Based Accountability

Peer reviewed

Direct link

Koretz, Daniel – Measurement: Interdisciplinary Research and Perspectives, 2015

Accountability has become a primary function of large-scale testing in the United States. The pressure on educators to raise scores is vastly greater than it was several decades ago. Research has shown that high-stakes testing can generate behavioral responses that inflate scores, often severely. I argue that because of these responses, using…

Descriptors: Accountability, Educational Testing, Test Construction, Test Validity

Equating without an Anchor for Nonequivalent Groups of Examinees

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2015

An equating procedure for a testing program with evolving distribution of examinee profiles is developed. No anchor is available because the original scoring scheme was based on expert judgment of the item difficulties. Pairs of examinees from two administrations are formed by matching on coarsened propensity scores derived from a set of…

Descriptors: Equated Scores, Testing Programs, College Entrance Examinations, Scoring

Psychometric Consequences of Subpopulation Item Parameter Drift

Peer reviewed

Direct link

Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2017

This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…

Descriptors: Psychometrics, Test Items, Item Response Theory, Hypothesis Testing

A Comparison of Raw-to-Scale Conversion Consistency between Single- and Multiple-Linking Using a Nonequivalent Groups Anchor Test Design. Research Report. ETS RR-14-13

Peer reviewed
PDF on ERIC

Download full text

Liu, Jinghua; Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2014

Maintaining score interchangeability and scale consistency is crucial for any testing programs that administer multiple forms across years. The use of a multiple linking design, which involves equating a new form to multiple old forms and averaging the conversions, has been proposed to control scale drift. However, the use of multiple linking…

Descriptors: Comparative Analysis, Reliability, Test Construction, Equated Scores

Local Observed-Score Kernel Equating

Peer reviewed

Direct link

Wiberg, Marie; van der Linden, Wim J.; von Davier, Alina A. – Journal of Educational Measurement, 2014

Three local observed-score kernel equating methods that integrate methods from the local equating and kernel equating frameworks are proposed. The new methods were compared with their earlier counterparts with respect to such measures as bias--as defined by Lord's criterion of equity--and percent relative error. The local kernel item response…

Descriptors: Measurement Techniques, Evaluation Methods, Item Response Theory, Equated Scores

Evaluating Equating Accuracy and Assumptions for Groups that Differ in Performance

Peer reviewed

Direct link

Powers, Sonya; Kolen, Michael J. – Journal of Educational Measurement, 2014

Accurate equating results are essential when comparing examinee scores across exam forms. Previous research indicates that equating results may not be accurate when group differences are large. This study compared the equating results of frequency estimation, chained equipercentile, item response theory (IRT) true-score, and IRT observed-score…

Descriptors: Accuracy, Equated Scores, Differences, Groups

The Ability of Non-Music Majors to Self-Evaluate at the End of a Music Course

Peer reviewed
PDF on ERIC

Download full text

Keast, Dan; Tapper, Larke – Journal of Educators Online, 2016

The researchers of this study investigated the participants' (N = 177) use of a self-evaluation tool employed at the end of an online undergraduate music course that fulfilled the Texas general education requirement for the creative arts. Participants' use of the two aspects of the tool correlated at r = 0.5548--interpreted as a high positive…

Descriptors: Music Education, Self Evaluation (Individuals), Majors (Students), Online Courses

Bi-Factor MIRT Observed-Score Equating for Mixed-Format Tests

Peer reviewed

Direct link

Lee, Guemin; Lee, Won-Chan – Applied Measurement in Education, 2016

The main purposes of this study were to develop bi-factor multidimensional item response theory (BF-MIRT) observed-score equating procedures for mixed-format tests and to investigate relative appropriateness of the proposed procedures. Using data from a large-scale testing program, three types of pseudo data sets were formulated: matched samples,…

Descriptors: Test Format, Multidimensional Scaling, Item Response Theory, Equated Scores

Applying the Nominal Response Model within a Longitudinal Framework to Construct the Positive Family Relationships Scale

Peer reviewed

Direct link

Preston, Kathleen Suzanne Johnson; Parral, Skye N.; Gottfried, Allen W.; Oliver, Pamella H.; Gottfried, Adele Eskeles; Ibrahim, Sirena M.; Delany, Danielle – Educational and Psychological Measurement, 2015

A psychometric analysis was conducted using the nominal response model under the item response theory framework to construct the Positive Family Relationships scale. Using data from the Fullerton Longitudinal Study, this scale was constructed within a long-term longitudinal framework spanning middle childhood through adolescence. Items tapping…

Descriptors: Family Relationship, Measures (Individuals), Psychometrics, Models

Effect of Adjusting Pseudo-Guessing Parameter Estimates on Test Scaling When Item Parameter Drift Is Present

Peer reviewed
PDF on ERIC

Download full text

Han, Kyung T.; Wells, Craig S.; Hambleton, Ronald K. – Practical Assessment, Research & Evaluation, 2015

In item response theory test scaling/equating with the three-parameter model, the scaling coefficients A and B have no impact on the c-parameter estimates of the test items since the cparameter estimates are not adjusted in the scaling/equating procedure. The main research question in this study concerned how serious the consequences would be if…

Descriptors: Item Response Theory, Monte Carlo Methods, Scaling, Test Items

Explaining Variation in Findings from Efficacy and Effectiveness Studies for English Reading Interventions for English Learners

Peer reviewed

Direct link

Barr, Christopher D.; Reutebuch, Colleen K.; Carlson, Coleen D.; Vaughn, Sharon; Francis, David J. – Journal of Research on Educational Effectiveness, 2019

Beginning in 2002, researchers developed, implemented, and evaluated the efficacy of an English reading intervention for first-grade English learners using multiple randomized control trials (RCTs). As a result of this efficacy work, researchers successfully competed for an IES Goal 4 effectiveness study using the same intervention. Unlike the…

Descriptors: Intervention, English Language Learners, Grade 1, Elementary School Students

New York Charter Schools Outperform Traditional Selective Public Schools: More Evidence That Cream-Skimming Is Not Driving Charters' Success. Report 33

Download full text

Winters, Marcus A. – Manhattan Institute for Policy Research, 2017

Critics of charter schools in New York City, America's largest school district, often allege that charters score better on standardized tests, on average, than traditional public schools because charters "cream-skim" (i.e., attract) the brightest, most motivated, students. Yet this accusation neglects the fact that not all traditional…

Descriptors: Charter Schools, Public Schools, School Effectiveness, Success

What's in a Grade? Grading Policies and Practices in Principles of Economics

Peer reviewed

Direct link

Walstad, William B.; Miller, Laurie A. – Journal of Economic Education, 2016

Survey results from a national sample of economics instructors describe the grading policies and practices in principles of economics courses. The survey results provide insights about absolute and relative grading systems used by instructors, the course components and their weights that determine grades, and the type of assessment items used for…

Descriptors: Grades (Scholastic), Grading, Economics Education, Educational Policy

« Previous Page | Next Page »

Pages: 1 | ... | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | ... | 75

Journal of Educational…	108
ETS Research Report Series	78
Applied Psychological…	69
Applied Measurement in…	55
Educational and Psychological…	43
Measurement:…	26
Educational Measurement:…	25
ProQuest LLC	25
Educational Testing Service	23
Journal of Educational and…	17
International Journal of…	13
Journal of Educational…	13
Practical Assessment,…	10
Psychometrika	10
College Board	8
ACT, Inc.	6
Educational Assessment	6
Journal of Experimental…	6
Online Submission	6
Studies in Educational…	6
College Entrance Examination…	5
Journal of Applied Measurement	5
Assessment in Education:…	4
International Journal of…	4
International Journal of…	4
More ▼

Journal Articles	596
Reports - Research	587
Reports - Evaluative	284
Speeches/Meeting Papers	201
Numerical/Quantitative Data	82
Reports - Descriptive	77
Opinion Papers	33
Dissertations/Theses -…	27
Information Analyses	24
Guides - Non-Classroom	15
Tests/Questionnaires	10
Collected Works - General	7
Guides - General	5
Reports - General	5
Books	4
Collected Works - Proceedings	4
Collected Works - Serials	3
Reference Materials -…	3
Book/Product Reviews	2
Guides - Classroom - Learner	2
Dissertations/Theses	1
Guides - Classroom - Teacher	1
Historical Materials	1
Legal/Legislative/Regulatory…	1
Non-Print Media	1
More ▼

SAT (College Admission Test)	73
Iowa Tests of Basic Skills	48
California Achievement Tests	43
Comprehensive Tests of Basic…	43
Metropolitan Achievement Tests	37
Sequential Tests of…	37
Stanford Achievement Tests	37
SRA Achievement Series	35
National Assessment of…	23
Graduate Record Examinations	20
ACT Assessment	18
Advanced Placement…	15
Law School Admission Test	13
Armed Services Vocational…	11
Gates MacGinitie Reading Tests	10
Test of English as a Foreign…	9
Program for International…	8
Preliminary Scholastic…	7
College Board Achievement…	6
Trends in International…	6
General Aptitude Test Battery	5
General Educational…	5
Graduate Management Admission…	5
National Merit Scholarship…	5
Wechsler Intelligence Scale…	5
More ▼