ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	9

Source

Educational Measurement:…

Author

Dorans, Neil J.	2
Sinharay, Sandip	2
Allalouf, Avi	1
Angoff, William H.	1
Cui, Zhongmin	1
Eignor, Daniel R.	1
Huggins, Anne C.	1
Kim, Stella Y.	1
Kolen, Michael J.	1
Liang, Longjuan	1
Liu, Jinghua	1
Penfield, Randall D.	1
Wainer, Howard	1
Wu, Margaret	1
More ▼

Publication Type

Journal Articles	12
Reports - Descriptive	12
Speeches/Meeting Papers	2
Opinion Papers	1

Education Level

Adult Education	1
Elementary Secondary Education	1
Higher Education	1

Audience

Location

Canada	1
Israel	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	2
College Board Achievement…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Digital Module 29: Multidimensional Item Response Theory Equating

Peer reviewed

Direct link

Kim, Stella Y. – Educational Measurement: Issues and Practice, 2022

In this digital ITEMS module, Dr. Stella Kim provides an overview of multidimensional item response theory (MIRT) equating. Traditional unidimensional item response theory (IRT) equating methods impose the sometimes untenable restriction on data that only a single ability is assessed. This module discusses potential sources of multidimensionality…

Descriptors: Item Response Theory, Models, Equated Scores, Evaluation Methods

Machine Learning and Small Data

Peer reviewed

Direct link

Cui, Zhongmin – Educational Measurement: Issues and Practice, 2021

Commonly used machine learning applications seem to relate to big data. This article provides a gentle review of machine learning and shows why machine learning can be applied to small data too. An example of applying machine learning to screen irregularity reports is presented. In the example, the support vector machine and multinomial naïve…

Descriptors: Artificial Intelligence, Man Machine Systems, Data, Bayesian Statistics

On the Choice of Anchor Tests in Equating

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2018

The choice of anchor tests is crucial in applications of the nonequivalent groups with anchor test design of equating. Sinharay and Holland (2006, 2007) suggested "miditests," which are anchor tests that are content-representative and have the same mean item difficulty as the total test but have a smaller spread of item difficulties.…

Descriptors: Test Content, Difficulty Level, Test Items, Test Construction

An NCME Instructional Module on Population Invariance in Linking and Equating

Peer reviewed

Direct link

Huggins, Anne C.; Penfield, Randall D. – Educational Measurement: Issues and Practice, 2012

A goal for any linking or equating of two or more tests is that the linking function be invariant to the population used in conducting the linking or equating. Violations of population invariance in linking and equating jeopardize the fairness and validity of test scores, and pose particular problems for test-based accountability programs that…

Descriptors: Equated Scores, Tests, Test Bias, Validity

Assessing a Critical Aspect of Construct Continuity when Test Specifications Change or Test Forms Deviate from Specifications

Peer reviewed

Direct link

Liu, Jinghua; Dorans, Neil J. – Educational Measurement: Issues and Practice, 2013

We make a distinction between two types of test changes: inevitable deviations from specifications versus planned modifications of specifications. We describe how score equity assessment (SEA) can be used as a tool to assess a critical aspect of construct continuity, the equivalence of scores, whenever planned changes are introduced to testing…

Descriptors: Tests, Test Construction, Test Format, Change

First Language of Test Takers and Fairness Assessment Procedures

Peer reviewed

Direct link

Sinharay, Sandip; Dorans, Neil J.; Liang, Longjuan – Educational Measurement: Issues and Practice, 2011

Over the past few decades, those who take tests in the United States have exhibited increasing diversity with respect to native language. Standard psychometric procedures for ensuring item and test fairness that have existed for some time were developed when test-taking groups were predominantly native English speakers. A better understanding of…

Descriptors: Test Bias, Testing Programs, Psychometrics, Language Proficiency

NCME 2007 Presidential Address: The Concordance Table--An Invitation to Misuse Test Scores

Peer reviewed

Direct link

Eignor, Daniel R. – Educational Measurement: Issues and Practice, 2008

This article discusses a particular type of concordance table and the potential for test score misuse that may result from employing such a table. The concordance that is discussed is typically created between scores on different, nonequatable versions of a test that share the same or close to the same test title. These concordance tables often…

Descriptors: Scores, Tables (Data), Comparative Analysis, Equated Scores

Measurement, Sampling, and Equating Errors in Large-Scale Assessments

Peer reviewed

Direct link

Wu, Margaret – Educational Measurement: Issues and Practice, 2010

In large-scale assessments, such as state-wide testing programs, national sample-based assessments, and international comparative studies, there are many steps involved in the measurement and reporting of student achievement. There are always sources of inaccuracies in each of the steps. It is of interest to identify the source and magnitude of…

Descriptors: Testing Programs, Educational Assessment, Measures (Individuals), Program Effectiveness

Linking Assessments Effectively: Purpose and Design.

Peer reviewed

Kolen, Michael J. – Educational Measurement: Issues and Practice, 2001

Discusses some practical issues in linking educational assessments, focusing on the importance of clarity of purpose when assessments are linked. Also stresses the importance of the design used to collect data for linking. Uses linking studies from a variety of situations to illustrate these points. (SLD)

Descriptors: Data Collection, Educational Assessment, Equated Scores, Research Design

An NCME Instructional Module on Quality Control Procedures in the Scoring, Equating, and Reporting of Test Scores

Peer reviewed

Direct link

Allalouf, Avi – Educational Measurement: Issues and Practice, 2007

There is significant potential for error in long production processes that consist of sequential stages, each of which is heavily dependent on the previous stage, such as the SER (Scoring, Equating, and Reporting) process. Quality control procedures are required in order to monitor this process and to reduce the number of mistakes to a minimum. In…

Descriptors: Scoring, Quality Control, Sequential Approach, Error Correction

Comparing the Incomparable: An Essay on the Importance of Big Assumptions and Scant Evidence.

Peer reviewed

Wainer, Howard – Educational Measurement: Issues and Practice, 1999

Discusses the comparison of groups of individuals who were administered different forms of a test. Focuses on the situation in which there is little overlap in content between the test forms. Reviews equating problems in national tests in Canada and Israel. (SLD)

Descriptors: Comparative Analysis, Equated Scores, Foreign Countries, National Competency Tests

Some Contributions of the College Board SAT to Psychometric Theory and Practice.

Peer reviewed

Angoff, William H. – Educational Measurement: Issues and Practice, 1986

The author describes the evolution of the Scholastic Aptitude Test (SAT). He describes some of the contributions to educational research made by inquiries based on the SAT and gives particular attention to the way scaling and equating procedures for the SAT have evolved over the last half century. (Author/JAZ)

Descriptors: Achievement Tests, Aptitude Tests, College Entrance Examinations, Educational History

Equated Scores	12
Comparative Analysis	3
Test Construction	3
Test Format	3
Test Items	3
College Entrance Examinations	2
Educational Assessment	2
Evaluation Methods	2
Measurement Techniques	2
Program Effectiveness	2
Psychometrics	2
Quality Control	2
Sampling	2
Test Bias	2
Testing Programs	2
Tests	2
Academic Achievement	1
Accountability	1
Achievement Rating	1
Achievement Tests	1
Aptitude Tests	1
Artificial Intelligence	1
Bayesian Statistics	1
Best Practices	1
Change	1
More ▼