ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	16

Source

Educational Measurement:…

Publication Type

Journal Articles	25
Reports - Descriptive	12
Reports - Evaluative	5
Reports - Research	5
Opinion Papers	3
Speeches/Meeting Papers	2
Tests/Questionnaires	2
Historical Materials	1

Education Level

Adult Education	1
Elementary Secondary Education	1
Higher Education	1

Audience

Location

Canada	1
Israel	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	3
National Assessment of…	2
ACT Assessment	1
College Board Achievement…	1
Comprehensive Tests of Basic…	1
Graduate Record Examinations	1
Iowa Tests of Basic Skills	1
Preliminary Scholastic…	1

What Works Clearinghouse Rating

Educational Measurement: Issues and Practice X

Showing 1 to 15 of 25 results Save | Export

Digital Module 29: Multidimensional Item Response Theory Equating

Peer reviewed

Direct link

Kim, Stella Y. – Educational Measurement: Issues and Practice, 2022

In this digital ITEMS module, Dr. Stella Kim provides an overview of multidimensional item response theory (MIRT) equating. Traditional unidimensional item response theory (IRT) equating methods impose the sometimes untenable restriction on data that only a single ability is assessed. This module discusses potential sources of multidimensionality…

Descriptors: Item Response Theory, Models, Equated Scores, Evaluation Methods

Evaluating Population Invariance of Test Equating during the COVID-19 Pandemic

Peer reviewed

Direct link

Li, Dongmei; Kapoor, Shalini – Educational Measurement: Issues and Practice, 2022

Population invariance is a desirable property of test equating which might not hold when significant changes occur in the test population, such as those brought about by the COVID-19 pandemic. This research aims to investigate whether equating functions are reasonably invariant when the test population is impacted by the pandemic. Based on…

Descriptors: Test Items, Equated Scores, COVID-19, Pandemics

Adjusting for Ability Differences of Equating Samples When Randomization Is Suboptimal

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022

Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…

Descriptors: Ability, Tests, Equated Scores, Testing Problems

Machine Learning and Small Data

Peer reviewed

Direct link

Cui, Zhongmin – Educational Measurement: Issues and Practice, 2021

Commonly used machine learning applications seem to relate to big data. This article provides a gentle review of machine learning and shows why machine learning can be applied to small data too. An example of applying machine learning to screen irregularity reports is presented. In the example, the support vector machine and multinomial naïve…

Descriptors: Artificial Intelligence, Man Machine Systems, Data, Bayesian Statistics

On the Choice of Anchor Tests in Equating

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2018

The choice of anchor tests is crucial in applications of the nonequivalent groups with anchor test design of equating. Sinharay and Holland (2006, 2007) suggested "miditests," which are anchor tests that are content-representative and have the same mean item difficulty as the total test but have a smaller spread of item difficulties.…

Descriptors: Test Content, Difficulty Level, Test Items, Test Construction

The Philosophical Aspects of IRT Equating: Modeling Drift to Evaluate Cohort Growth in Large-Scale Assessments

Peer reviewed

Direct link

Taherbhai, Husein; Seo, Daeryong – Educational Measurement: Issues and Practice, 2013

Calibration and equating is the quintessential necessity for most large-scale educational assessments. However, there are instances when no consideration is given to the equating process in terms of context and substantive realization, and the methods used in its execution. In the view of the authors, equating is not merely an exhibit of the…

Descriptors: Item Response Theory, Equated Scores, Measurement, Educational Assessment

An NCME Instructional Module on Population Invariance in Linking and Equating

Peer reviewed

Direct link

Huggins, Anne C.; Penfield, Randall D. – Educational Measurement: Issues and Practice, 2012

A goal for any linking or equating of two or more tests is that the linking function be invariant to the population used in conducting the linking or equating. Violations of population invariance in linking and equating jeopardize the fairness and validity of test scores, and pose particular problems for test-based accountability programs that…

Descriptors: Equated Scores, Tests, Test Bias, Validity

Quantifying Error and Uncertainty Reductions in Scaling Functions: An ITEMS Module

Peer reviewed

Direct link

Moses, Tim – Educational Measurement: Issues and Practice, 2014

This module describes and extends X-to-Y regression measures that have been proposed for use in the assessment of X-to-Y scaling and equating results. Measures are developed that are similar to those based on prediction error in regression analyses but that are directly suited to interests in scaling and equating evaluations. The regression and…

Descriptors: Scaling, Regression (Statistics), Equated Scores, Comparative Analysis

Assessing a Critical Aspect of Construct Continuity when Test Specifications Change or Test Forms Deviate from Specifications

Peer reviewed

Direct link

Liu, Jinghua; Dorans, Neil J. – Educational Measurement: Issues and Practice, 2013

We make a distinction between two types of test changes: inevitable deviations from specifications versus planned modifications of specifications. We describe how score equity assessment (SEA) can be used as a tool to assess a critical aspect of construct continuity, the equivalence of scores, whenever planned changes are introduced to testing…

Descriptors: Tests, Test Construction, Test Format, Change

Equating Subscores under the Nonequivalent Anchor Test (NEAT) Design

Peer reviewed

Direct link

Puhan, Gautam; Liang, Longjuan – Educational Measurement: Issues and Practice, 2011

The study examined two approaches for equating subscores. They are (1) equating subscores using internal common items as the anchor to conduct the equating, and (2) equating subscores using equated and scaled total scores as the anchor to conduct the equating. Since equated total scores are comparable across the new and old forms, they can be used…

Descriptors: Equated Scores, Test Items, Methods

Comments on Neil Dorans's NCME Career Award Address: The Contestant Perspective on Taking Tests--Emanations from the Statue within

Peer reviewed

Direct link

Mislevy, Robert J. – Educational Measurement: Issues and Practice, 2012

This article presents the author's observations on Neil Dorans's NCME Career Award Address: "The Contestant Perspective on Taking Tests: Emanations from the Statue within." He calls attention to some points that Dr. Dorans made in his address, and offers his thoughts in response.

Descriptors: Testing, Test Reliability, Psychometrics, Scores

The Contestant Perspective on Taking Tests: Emanations from the Statue within

Peer reviewed

Direct link

Dorans, Neil J. – Educational Measurement: Issues and Practice, 2012

Views on testing--its purpose and uses and how its data are analyzed--are related to one's perspective on test takers. Test takers can be viewed as learners, examinees, or contestants. I briefly discuss the perspective of test takers as learners. I maintain that much of psychometrics views test takers as examinees. I discuss test takers as a…

Descriptors: Testing, Test Theory, Item Response Theory, Test Reliability

First Language of Test Takers and Fairness Assessment Procedures

Peer reviewed

Direct link

Sinharay, Sandip; Dorans, Neil J.; Liang, Longjuan – Educational Measurement: Issues and Practice, 2011

Over the past few decades, those who take tests in the United States have exhibited increasing diversity with respect to native language. Standard psychometric procedures for ensuring item and test fairness that have existed for some time were developed when test-taking groups were predominantly native English speakers. A better understanding of…

Descriptors: Test Bias, Testing Programs, Psychometrics, Language Proficiency

NCME 2007 Presidential Address: The Concordance Table--An Invitation to Misuse Test Scores

Peer reviewed

Direct link

Eignor, Daniel R. – Educational Measurement: Issues and Practice, 2008

This article discusses a particular type of concordance table and the potential for test score misuse that may result from employing such a table. The concordance that is discussed is typically created between scores on different, nonequatable versions of a test that share the same or close to the same test title. These concordance tables often…

Descriptors: Scores, Tables (Data), Comparative Analysis, Equated Scores

Measurement, Sampling, and Equating Errors in Large-Scale Assessments

Peer reviewed

Direct link

Wu, Margaret – Educational Measurement: Issues and Practice, 2010

In large-scale assessments, such as state-wide testing programs, national sample-based assessments, and international comparative studies, there are many steps involved in the measurement and reporting of student achievement. There are always sources of inaccuracies in each of the steps. It is of interest to identify the source and magnitude of…

Descriptors: Testing Programs, Educational Assessment, Measures (Individuals), Program Effectiveness

Previous Page | Next Page »

Pages: 1 | 2

Equated Scores	25
Test Items	7
Item Response Theory	6
Comparative Analysis	5
Scaling	5
Test Bias	5
Test Format	5
Testing Problems	5
College Entrance Examinations	4
Educational Assessment	4
Psychometrics	4
Sampling	4
Scoring	4
Test Interpretation	4
Testing Programs	4
Achievement Tests	3
Latent Trait Theory	3
Scores	3
Test Construction	3
Test Reliability	3
Test Results	3
Test Theory	3
Tests	3
Correlation	2
Error of Measurement	2
More ▼

Dorans, Neil J.	3
Eignor, Daniel R.	2
Kolen, Michael J.	2
Liang, Longjuan	2
Sinharay, Sandip	2
Allalouf, Avi	1
Angoff, William H.	1
Cook, Linda L.	1
Cui, Zhongmin	1
Green, Bert F.	1
Hoover, H. D.	1
Huggins, Anne C.	1
Jaeger, Richard M.	1
Kapoor, Shalini	1
Kim, Sooyeon	1
Kim, Stella Y.	1
Li, Dongmei	1
Liu, Jinghua	1
Mislevy, Robert J.	1
Moses, Tim	1
Penfield, Randall D.	1
Puhan, Gautam	1
Seo, Daeryong	1
Taherbhai, Husein	1
Wainer, Howard	1
More ▼