ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	12
Since 2006 (last 20 years)	34

Source

Educational Measurement:…

Publication Type

Journal Articles	44
Reports - Descriptive	44
Speeches/Meeting Papers	3
Opinion Papers	2
Information Analyses	1

Education Level

Elementary Secondary Education	6
Higher Education	3
Postsecondary Education	2
Adult Education	1

Audience

Location

Canada	1
Israel	1
Maryland	1
United States	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

SAT (College Admission Test)	3
ACT Assessment	1
College Board Achievement…	1
Iowa Tests of Basic Skills	1
Iowa Tests of Educational…	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 44 results Save | Export

Digital Module 32: Understanding and Mitigating the Impact of Low Effort on Common Uses of Test and Survey Scores

Peer reviewed

Direct link

Soland, James – Educational Measurement: Issues and Practice, 2023

Most individuals who take, interpret, design, or score tests are aware that examinees do not always provide full effort when responding to items. However, many such individuals are not aware of how pervasive the issue is, what its consequences are, and how to address it. In this digital ITEMS module, Dr. James Soland will help fill these gaps in…

Descriptors: Student Behavior, Tests, Scores, Incidence

Digital Module 29: Multidimensional Item Response Theory Equating

Peer reviewed

Direct link

Kim, Stella Y. – Educational Measurement: Issues and Practice, 2022

In this digital ITEMS module, Dr. Stella Kim provides an overview of multidimensional item response theory (MIRT) equating. Traditional unidimensional item response theory (IRT) equating methods impose the sometimes untenable restriction on data that only a single ability is assessed. This module discusses potential sources of multidimensionality…

Descriptors: Item Response Theory, Models, Equated Scores, Evaluation Methods

Digital Module 14: Planning and Conducting Standard Setting

Peer reviewed

Direct link

Bunch, Michael B. – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Michael Bunch provides an in-depth, step-by-step look at how standard setting is done. It does not focus on any specific procedure or methodology (e.g., modified Angoff, bookmark, and body of work) but on the practical tasks that must be completed for any standard setting activity. Dr. Bunch carries the…

Descriptors: Standard Setting, Cutting Scores, Scores, Reports

Supporting the Interpretive Validity of Student-Level Claims in Science Assessment with Tiered Claim Structures

Peer reviewed

Direct link

Student, Sanford R.; Gong, Brian – Educational Measurement: Issues and Practice, 2022

We address two persistent challenges in large-scale assessments of the Next Generation Science Standards: (a) the validity of score interpretations that target the standards broadly and (b) how to structure claims for assessments of this complex domain. The NGSS pose a particular challenge for specifying claims about students that evidence from…

Descriptors: Science Tests, Test Validity, Test Items, Test Construction

Machine Learning and Small Data

Peer reviewed

Direct link

Cui, Zhongmin – Educational Measurement: Issues and Practice, 2021

Commonly used machine learning applications seem to relate to big data. This article provides a gentle review of machine learning and shows why machine learning can be applied to small data too. An example of applying machine learning to screen irregularity reports is presented. In the example, the support vector machine and multinomial naïve…

Descriptors: Artificial Intelligence, Man Machine Systems, Data, Bayesian Statistics

Standardization and "UNDERSTAND"ardization in Educational Assessment

Peer reviewed

Direct link

Sireci, Stephen G. – Educational Measurement: Issues and Practice, 2020

Educational tests are standardized so that all examinees are tested on the same material, under the same testing conditions, and with the same scoring protocols. This uniformity is designed to provide a level "playing field" for all examinees so that the test is "the same" for everyone. Thus, standardization is designed to…

Descriptors: Standards, Educational Assessment, Culture Fair Tests, Scoring

Digital Module 13: Monte Carlo Simulation Studies in Item Response Theory

Peer reviewed

Direct link

Leventhal, Brian; Ames, Allison – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Brian Leventhal and Dr. Allison Ames provide an overview of "Monte Carlo simulation studies" (MCSS) in "item response theory" (IRT). MCSS are utilized for a variety of reasons, one of the most compelling being that they can be used when analytic solutions are impractical or nonexistent because…

Descriptors: Item Response Theory, Monte Carlo Methods, Simulation, Test Items

Digital Module 07: Subscores--Evaluation and Reporting https://ncme.elevate.commpartners.com

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2019

Test score users often demand the reporting of subscores due to their potential diagnostic, remedial, and instructional benefits. Therefore, there is substantial pressure on testing programs to report subscores. However, professional standards require that subscores have to satisfy minimum quality standards before they can be reported. In this…

Descriptors: Testing, Scores, Item Response Theory, Evaluation Methods

How Can Released State Test Items Support Interim Assessment Purposes in an Educational Crisis?

Peer reviewed

Direct link

Klugman, Emma M.; Ho, Andrew D. – Educational Measurement: Issues and Practice, 2020

State testing programs regularly release previously administered test items to the public. We provide an open-source recipe for state, district, and school assessment coordinators to combine these items flexibly to produce scores linked to established state score scales. These would enable estimation of student score distributions and achievement…

Descriptors: Testing Programs, State Programs, Test Items, Scores

On the Choice of Anchor Tests in Equating

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2018

The choice of anchor tests is crucial in applications of the nonequivalent groups with anchor test design of equating. Sinharay and Holland (2006, 2007) suggested "miditests," which are anchor tests that are content-representative and have the same mean item difficulty as the total test but have a smaller spread of item difficulties.…

Descriptors: Test Content, Difficulty Level, Test Items, Test Construction

Too Simple to Be Useful: A Comment on Feinberg and Wainer (2014)

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby; Boughton, Keith – Educational Measurement: Issues and Practice, 2015

Feinberg and Wainer (2014) provided a simple equation to approximate/predict a subscore's value. The purpose of this note is to point out that their equation is often inaccurate in that it does not always predict a subscore's value correctly. Therefore, the utility of their simple equation is not clear.

Descriptors: Equations (Mathematics), Scores, Prediction, Accuracy

A Tale of Two Tests (and of Two Examinees)

Peer reviewed

Direct link

Clauser, Amanda L.; Wainer, Howard – Educational Measurement: Issues and Practice, 2016

It is widely accepted dogma that consequential decisions are better made with multiple measures, because using but a single one is thought more likely to be laden with biases and errors that can be better controlled with a wider source of evidence for making judgments. Unfortunately, advocates of using multiple measures too rarely provide detailed…

Descriptors: Tests, Examiners, College Entrance Examinations, Measurement

A Simple Equation to Predict a Subscore's Value

Peer reviewed

Direct link

Feinberg, Richard A.; Wainer, Howard – Educational Measurement: Issues and Practice, 2014

Subscores are often used to indicate test-takers' relative strengths and weaknesses and so help focus remediation. But a subscore is not worth reporting if it is too unreliable to believe or if it contains no information that is not already contained in the total score. It is possible, through the use of a simple linear equation provided in…

Descriptors: Scores, Equations (Mathematics), Prediction, Reliability

Do 45% of College Students Lack Critical Thinking Skills? Revisiting a Central Conclusion of "Academically Adrift"

Peer reviewed

Direct link

Lane, David; Oswald, Frederick L. – Educational Measurement: Issues and Practice, 2016

The educational literature, the popular press, and educated laypeople have all echoed a conclusion from the book "Academically Adrift" by Richard Arum and Josipa Roksa (which has now become received wisdom), namely, that 45% of college students showed no significant gains in critical thinking skills. Similar results were reported by…

Descriptors: College Students, Critical Thinking, Thinking Skills, Statistical Analysis

Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

Peer reviewed

Direct link

Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011

This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3

Sinharay, Sandip	5
Kolen, Michael J.	3
Wainer, Howard	3
Dorans, Neil J.	2
Frisbie, David A.	2
Ho, Andrew D.	2
Sireci, Stephen G.	2
Allalouf, Avi	1
Ames, Allison	1
Angoff, William H.	1
Bejar, Issac I.	1
Boughton, Keith	1
Brennan, Robert L.	1
Bunch, Michael B.	1
Cahan, Sorel	1
Cangelosi, James S.	1
Cizek, Gregory J.	1
Clauser, Amanda L.	1
Crocker, Linda	1
Cui, Zhongmin	1
Desimone, Laura M.	1
Downing, Steven M.	1
Eignor, Daniel R.	1
Feinberg, Richard A.	1
Frisvold, David	1
More ▼

Scores	27
Equated Scores	12
Test Construction	9
Psychometrics	8
Test Items	8
Educational Assessment	7
Models	7
Standards	7
Tests	7
Validity	7
Cutting Scores	6
Test Results	6
Test Validity	6
Testing Programs	6
Comparative Analysis	5
Elementary Secondary Education	5
Error of Measurement	5
Evaluation Methods	5
Measurement	5
Accountability	4
High Stakes Tests	4
Item Response Theory	4
Scoring	4
Standard Setting	4
Test Format	4
More ▼