ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	40

Descriptor

Equated Scores	77
Test Construction	22
Item Response Theory	20
Scaling	17
Test Items	14
Test Format	13
Error of Measurement	12
Testing Programs	12
Foreign Countries	11
Measurement Techniques	10
Psychometrics	10
Achievement Tests	9
Data Collection	9
Scores	9
Statistical Analysis	9
Test Reliability	9
Academic Achievement	8
Evaluation Methods	8
Scoring	8
Test Validity	8
Testing	8
Comparative Analysis	7
Latent Trait Theory	7
College Entrance Examinations	6
State Programs	6
More ▼

Publication Type

Reports - Descriptive	77
Journal Articles	49
Speeches/Meeting Papers	12
Numerical/Quantitative Data	4
Opinion Papers	2
Guides - General	1
Guides - Non-Classroom	1
Information Analyses	1
Reports - Research	1
Tests/Questionnaires	1

Education Level

Higher Education	8
Secondary Education	6
Grade 8	4
Early Childhood Education	3
Elementary Education	3
Grade 3	3
Grade 4	3
Grade 5	3
Grade 6	3
Grade 7	3
Intermediate Grades	3
Junior High Schools	3
Middle Schools	3
Postsecondary Education	3
Primary Education	3
Elementary Secondary Education	2
High Schools	2
Adult Education	1
More ▼

Audience

Researchers	4
Practitioners	1
Teachers	1

Location

New York	3
Australia	2
Israel	2
Netherlands	2
Arkansas	1
Canada	1
New Jersey	1
Spain	1
Texas	1
United Kingdom (England)	1
United States	1
Virginia	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

SAT (College Admission Test)	4
ACT Assessment	2
College Board Achievement…	1
Law School Admission Test	1
Measures of Academic Progress	1
National Assessment of…	1
Program for International…	1
Test of Standard Written…	1
Texas Essential Knowledge and…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 77 results Save | Export

Digital Module 29: Multidimensional Item Response Theory Equating

Peer reviewed

Direct link

Kim, Stella Y. – Educational Measurement: Issues and Practice, 2022

In this digital ITEMS module, Dr. Stella Kim provides an overview of multidimensional item response theory (MIRT) equating. Traditional unidimensional item response theory (IRT) equating methods impose the sometimes untenable restriction on data that only a single ability is assessed. This module discusses potential sources of multidimensionality…

Descriptors: Item Response Theory, Models, Equated Scores, Evaluation Methods

Machine Learning and Small Data

Peer reviewed

Direct link

Cui, Zhongmin – Educational Measurement: Issues and Practice, 2021

Commonly used machine learning applications seem to relate to big data. This article provides a gentle review of machine learning and shows why machine learning can be applied to small data too. An example of applying machine learning to screen irregularity reports is presented. In the example, the support vector machine and multinomial naïve…

Descriptors: Artificial Intelligence, Man Machine Systems, Data, Bayesian Statistics

Multiple Group Item Response Theory Applications Using "Stata irt" Package

Peer reviewed

Direct link

Zheng, Xiaying; Yang, Ji Seung – Measurement: Interdisciplinary Research and Perspectives, 2021

The purpose of this paper is to briefly introduce two most common applications of multiple group item response theory (IRT) models, namely detecting differential item functioning (DIF) analysis and nonequivalent group score linking with a simultaneous calibration. We illustrate how to conduct those analyses using the "Stata" item…

Descriptors: Item Response Theory, Test Bias, Computer Software, Statistical Analysis

English MAP Reading Fluency Technical Report: Based on Assessments Administered during the 2020-2021 School Year

Download full text

NWEA, 2022

This technical report documents the processes and procedures employed by NWEA® to build and support the English MAP® Reading Fluency™ assessments administered during the 2020-2021 school year. It is written for measurement professionals and administrators to help evaluate the quality of MAP Reading Fluency. The seven sections of this report: (1)…

Descriptors: Achievement Tests, Reading Tests, Reading Achievement, Reading Fluency

On the Choice of Anchor Tests in Equating

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2018

The choice of anchor tests is crucial in applications of the nonequivalent groups with anchor test design of equating. Sinharay and Holland (2006, 2007) suggested "miditests," which are anchor tests that are content-representative and have the same mean item difficulty as the total test but have a smaller spread of item difficulties.…

Descriptors: Test Content, Difficulty Level, Test Items, Test Construction

Section Preequating under the Equivalent Groups Design without IRT

Peer reviewed

Direct link

Guo, Hongwen; Puhan, Gautam – Journal of Educational Measurement, 2014

In this article, we introduce a section preequating (SPE) method (linear and nonlinear) under the randomly equivalent groups design. In this equating design, sections of Test X (a future new form) and another existing Test Y (an old form already on scale) are administered. The sections of Test X are equated to Test Y, after adjusting for the…

Descriptors: Equated Scores, Correlation, Simulation, Testing

Statistical Assessment of Estimated Transformations in Observed-Score Equating

Peer reviewed

Direct link

Wiberg, Marie; González, Jorge – Journal of Educational Measurement, 2016

Equating methods make use of an appropriate transformation function to map the scores of one test form into the scale of another so that scores are comparable and can be used interchangeably. The equating literature shows that the ways of judging the success of an equating (i.e., the score transformation) might differ depending on the adopted…

Descriptors: Statistical Analysis, Equated Scores, Scores, Models

Does Testing Date Impact Student Scores on the ACT? Technical Brief

Download full text

Camara, Wayne J.; Allen, Jeff – ACT, Inc., 2017

Students must choose when to take the ACT for the first time and if and when to retest. States and districts that administer the ACT test to all students must also choose when to administer the test. A key consideration in making these decisions is the impact on scores. Because the ACT is a curriculum-based test of academic achievement, students…

Descriptors: Scores, Time Perspective, Scheduling, Testing

User Guide for the 2014-15 Teacher Median Student Growth Percentile Report

Download full text

New Jersey Department of Education, 2016

On March 22, 2016, the New Jersey Department of Education ("the Department") published a broadcast memo sharing secure district access to 2014-15 median Student Growth Percentile (mSGP) data for all qualifying teachers. These data describe student growth from the last school year, and comprise 10% of qualifying teachers' 2014-15…

Descriptors: Achievement Gains, Outcome Measures, Teacher Qualifications, Equated Scores

Equating a Large-Scale Writing Assessment Using Pairwise Comparisons of Performances

Peer reviewed

Direct link

Humphry, Stephen M.; McGrane, Joshua A. – Australian Educational Researcher, 2015

This paper presents a method for equating writing assessments using pairwise comparisons which does not depend upon conventional common-person or common-item equating designs. Pairwise comparisons have been successfully applied in the assessment of open-ended tasks in English and other areas such as visual art and philosophy. In this paper,…

Descriptors: Writing Evaluation, Evaluation Methods, Comparative Analysis, Writing Tests

An NCME Instructional Module on Population Invariance in Linking and Equating

Peer reviewed

Direct link

Huggins, Anne C.; Penfield, Randall D. – Educational Measurement: Issues and Practice, 2012

A goal for any linking or equating of two or more tests is that the linking function be invariant to the population used in conducting the linking or equating. Violations of population invariance in linking and equating jeopardize the fairness and validity of test scores, and pose particular problems for test-based accountability programs that…

Descriptors: Equated Scores, Tests, Test Bias, Validity

Assessing a Critical Aspect of Construct Continuity when Test Specifications Change or Test Forms Deviate from Specifications

Peer reviewed

Direct link

Liu, Jinghua; Dorans, Neil J. – Educational Measurement: Issues and Practice, 2013

We make a distinction between two types of test changes: inevitable deviations from specifications versus planned modifications of specifications. We describe how score equity assessment (SEA) can be used as a tool to assess a critical aspect of construct continuity, the equivalence of scores, whenever planned changes are introduced to testing…

Descriptors: Tests, Test Construction, Test Format, Change

Fair and Equitable Measurement of Student Learning in MOOCS: An Introduction to Item Response Theory, Scale Linking, and Score Equating

Peer reviewed
PDF on ERIC

Download full text

Meyer, J. Patrick; Zhu, Shi – Research & Practice in Assessment, 2013

Massive open online courses (MOOCs) are playing an increasingly important role in higher education around the world, but despite their popularity, the measurement of student learning in these courses is hampered by cheating and other problems that lead to unfair evaluation of student learning. In this paper, we describe a framework for maintaining…

Descriptors: Online Courses, College Students, Student Evaluation, Learning

Software Note: Using BILOG for Fixed-Anchor Item Calibration

Peer reviewed

Direct link

DeMars, Christine E.; Jurich, Daniel P. – Applied Psychological Measurement, 2012

The nonequivalent groups anchor test (NEAT) design is often used to scale item parameters from two different test forms. A subset of items, called the anchor items or common items, are administered as part of both test forms. These items are used to adjust the item calibrations for any differences in the ability distributions of the groups taking…

Descriptors: Computer Software, Item Response Theory, Scaling, Equated Scores

Equating of Augmented Subscores

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J. – Journal of Educational Measurement, 2011

Recently, there has been an increasing level of interest in subscores for their potential diagnostic value. Haberman (2008b) suggested reporting an augmented subscore that is a linear combination of a subscore and the total score. Sinharay and Haberman (2008) and Sinharay (2010) showed that augmented subscores often lead to more accurate…

Descriptors: Diagnostic Tests, Psychometrics, Testing, Equated Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Educational Measurement:…	12
Journal of Educational…	6
Applied Psychological…	4
Journal of Educational and…	4
Measurement:…	4
Educational Testing Service	3
New York State Education…	3
ACT, Inc.	2
Applied Measurement in…	2
Evaluation and the Health…	2
Journal of Applied Measurement	2
Studies in Educational…	2
Advances in Health Sciences…	1
American Institutes for…	1
Assessment in Education…	1
Assessment in Education:…	1
Australian Educational…	1
College Entrance Examination…	1
European Journal of…	1
Journal of Mixed Methods…	1
Journal of School Psychology	1
Mathematics Teacher	1
NWEA	1
New Jersey Department of…	1
Popular Measurement	1
More ▼

Dorans, Neil J.	4
Eignor, Daniel R.	3
Sinharay, Sandip	3
Cook, Linda L.	2
Haberman, Shelby J.	2
Hanson, Bradley A.	2
Huynh, Huynh	2
Lissitz, Robert W.	2
Livingston, Samuel A.	2
Ogasawara, Haruhiko	2
Penfield, Randall D.	2
van der Linden, Wim J.	2
von Davier, Alina A.	2
Allalouf, Avi	1
Allen, Jeff	1
Anderson, A. E.	1
Angoff, William H.	1
Antal, Judit	1
Baghi, Heibatollah	1
Beaton, Albert E.	1
Beguin, A. A.	1
Beretvas, S. Natasha	1
Brennan, Robert L.	1
Camara, Wayne J.	1
Cui, Zhongmin	1
More ▼