ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	10

Source

Educational Measurement:…

Publication Type

Journal Articles	11
Reports - Descriptive	11

Education Level

Elementary Secondary Education	2
Higher Education	2
Adult Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Iowa Tests of Basic Skills	1
Iowa Tests of Educational…	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Digital Module 27: Hierarchical Rater Models

Peer reviewed

Direct link

Casabianca, Jodi M. – Educational Measurement: Issues and Practice, 2021

Module Overview: In this digital ITEMS module, Dr. Jodi M. Casabianca provides a primer on the "hierarchical rater model" (HRM) framework and the recent expansions to the model for analyzing raters and ratings of constructed responses. In the first part of the module, she establishes an understanding of the nature of constructed…

Descriptors: Hierarchical Linear Modeling, Rating Scales, Error of Measurement, Item Response Theory

Reframing Research and Assessment Practices: Advancing an Antiracist and Anti-Ableist Research Agenda

Peer reviewed

Direct link

Angela Johnson; Elizabeth Barker; Marcos Viveros Cespedes – Educational Measurement: Issues and Practice, 2024

Educators and researchers strive to build policies and practices on data and evidence, especially on academic achievement scores. When assessment scores are inaccurate for specific student populations or when scores are inappropriately used, even data-driven decisions will be misinformed. To maximize the impact of the research-practice-policy…

Descriptors: Equal Education, Inclusion, Evaluation Methods, Error of Measurement

Digital Module 18: Automated Scoring

Peer reviewed

Direct link

Lottridge, Sue; Burkhardt, Amy; Boyer, Michelle – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Sue Lottridge, Amy Burkhardt, and Dr. Michelle Boyer provide an overview of automated scoring. Automated scoring is the use of computer algorithms to score unconstrained open-ended test items by mimicking human scoring. The use of automated scoring is increasing in educational assessment programs because it allows…

Descriptors: Computer Assisted Testing, Scoring, Automation, Educational Assessment

A Technical Note on IRT Simulation Studies: Dealing with Truth, Estimates, Observed Data, and Residuals

Peer reviewed

Direct link

Luecht, Richard; Ackerman, Terry A. – Educational Measurement: Issues and Practice, 2018

Simulation studies are extremely common in the item response theory (IRT) research literature. This article presents a didactic discussion of "truth" and "error" in IRT-based simulation studies. We ultimately recommend that future research focus less on the simple recovery of parameters from a convenient generating IRT model,…

Descriptors: Item Response Theory, Simulation, Ethics, Error of Measurement

Do 45% of College Students Lack Critical Thinking Skills? Revisiting a Central Conclusion of "Academically Adrift"

Peer reviewed

Direct link

Lane, David; Oswald, Frederick L. – Educational Measurement: Issues and Practice, 2016

The educational literature, the popular press, and educated laypeople have all echoed a conclusion from the book "Academically Adrift" by Richard Arum and Josipa Roksa (which has now become received wisdom), namely, that 45% of college students showed no significant gains in critical thinking skills. Similar results were reported by…

Descriptors: College Students, Critical Thinking, Thinking Skills, Statistical Analysis

Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

Peer reviewed

Direct link

Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011

This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores

Scaling: An Items Module

Peer reviewed

Direct link

Tong, Ye; Kolen, Michael J. – Educational Measurement: Issues and Practice, 2010

"Scaling" is the process of constructing a score scale that associates numbers or other ordered indicators with the performance of examinees. Scaling typically is conducted to aid users in interpreting test results. This module describes different types of raw scores and scale scores, illustrates how to incorporate various sources of…

Descriptors: Test Results, Scaling, Measures (Individuals), Raw Scores

Same-Form Retest Effects on Credentialing Examinations

Peer reviewed

Direct link

Raymond, Mark R.; Neustel, Sandra; Anderson, Dan – Educational Measurement: Issues and Practice, 2009

Examinees who take high-stakes assessments are usually given an opportunity to repeat the test if they are unsuccessful on their initial attempt. To prevent examinees from obtaining unfair score increases by memorizing the content of specific test items, testing agencies usually assign a different test form to repeat examinees. The use of multiple…

Descriptors: Test Results, Test Items, Testing, Aptitude Tests

Measurement, Sampling, and Equating Errors in Large-Scale Assessments

Peer reviewed

Direct link

Wu, Margaret – Educational Measurement: Issues and Practice, 2010

In large-scale assessments, such as state-wide testing programs, national sample-based assessments, and international comparative studies, there are many steps involved in the measurement and reporting of student achievement. There are always sources of inaccuracies in each of the steps. It is of interest to identify the source and magnitude of…

Descriptors: Testing Programs, Educational Assessment, Measures (Individuals), Program Effectiveness

Effects of Assigning Raters to Items

Peer reviewed

Direct link

Sykes, Robert C.; Ito, Kyoko; Wang, Zhen – Educational Measurement: Issues and Practice, 2008

Student responses to a large number of constructed response items in three Math and three Reading tests were scored on two occasions using three ways of assigning raters: single reader scoring, a different reader for each response (item-specific), and three readers each scoring a rater item block (RIB) containing approximately one-third of a…

Descriptors: Test Items, Mathematics Tests, Reading Tests, Scoring

Another Answer to the Cut-Off Score Question.

Peer reviewed

Cangelosi, James S. – Educational Measurement: Issues and Practice, 1984

Test development procedures and six methods for determining cut-off scores are briefly described. An alternate method, appropriate when the test developer also determines the cut-off score, is suggested. Unlike other methods, the standard is set during the test development stage. Its computations are intelligible to nonstatistically-oriented…

Descriptors: Criterion Referenced Tests, Cutting Scores, Elementary Secondary Education, Error of Measurement

Error of Measurement	11
Educational Assessment	3
Item Response Theory	3
Psychometrics	3
Scoring	3
Test Construction	3
Comparative Analysis	2
Data Collection	2
Evaluation Methods	2
Evaluation Problems	2
Measurement Techniques	2
Measures (Individuals)	2
Raw Scores	2
Scores	2
Test Items	2
Test Results	2
Test Validity	2
Academic Achievement	1
Accuracy	1
Achievement Gains	1
Achievement Rating	1
Achievement Tests	1
Adults	1
Aptitude Tests	1
Automation	1
More ▼

Kolen, Michael J.	2
Ackerman, Terry A.	1
Anderson, Dan	1
Angela Johnson	1
Boyer, Michelle	1
Burkhardt, Amy	1
Cangelosi, James S.	1
Casabianca, Jodi M.	1
Elizabeth Barker	1
Ito, Kyoko	1
Lane, David	1
Lee, Won-Chan	1
Lottridge, Sue	1
Luecht, Richard	1
Marcos Viveros Cespedes	1
Neustel, Sandra	1
Oswald, Frederick L.	1
Raymond, Mark R.	1
Sykes, Robert C.	1
Tong, Ye	1
Wang, Zhen	1
Wu, Margaret	1
More ▼