ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	8

Source

Educational Measurement:…

Author

Babcock, Ben	1
Burkhardt, Amy	1
Flake, Jessica Kay	1
Lehman, Blair	1
Leighton, Jacqueline P.	1
Lottridge, Susan	1
Merzdorf, H. E.	1
Petway, Kevin Terrance, II	1
Risk, Nicole M.	1
Solano-Flores, Guillermo	1
Stefanie A. Wind	1
Traynor, A.	1
Walker, A. Adrienne	1
Wind, Stefanie A.	1
Woolf, Sherri	1
Wyse, Adam E.	1
Yangmeng Xu	1
More ▼

Publication Type

Journal Articles	8
Reports - Research	6
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Examining the Psychometric Impact of Targeted and Random Double-Scoring in Mixed-Format Assessments

Peer reviewed

Direct link

Yangmeng Xu; Stefanie A. Wind – Educational Measurement: Issues and Practice, 2025

Double-scoring constructed-response items is a common but costly practice in mixed-format assessments. This study explored the impacts of Targeted Double-Scoring (TDS) and random double-scoring procedures on the quality of psychometric outcomes, including student achievement estimates, person fit, and student classifications under various…

Descriptors: Academic Achievement, Psychometrics, Scoring, Evaluation Methods

A Rubric for the Detection of Students in Crisis

Peer reviewed

Direct link

Burkhardt, Amy; Lottridge, Susan; Woolf, Sherri – Educational Measurement: Issues and Practice, 2021

For some students, standardized tests serve as a conduit to disclose sensitive issues of harm or distress that may otherwise go unreported. By detecting this writing, known as "crisis papers," testing programs have a unique opportunity to assist in mitigating the risk of harm to these students. The use of machine learning to…

Descriptors: Scoring Rubrics, Identification, At Risk Students, Standardized Tests

Boolean Analysis of Interobserver Agreement: Formal and Functional Evidence Sampling in Complex Coding Endeavors

Peer reviewed

Direct link

Solano-Flores, Guillermo – Educational Measurement: Issues and Practice, 2021

This article proposes a Boolean approach to representing and analyzing interobserver agreement in dichotomous coding. Building on the notion that observations are samples of a universe of observations, it submits that coding can be viewed as a process in which observers sample pieces of evidence on constructs. It distinguishes between formal and…

Descriptors: Online Searching, Coding, Interrater Reliability, Evidence

On the Superior Statistical Properties of Frequency Scales in Job Analyses

Peer reviewed

Direct link

Babcock, Ben; Risk, Nicole M.; Wyse, Adam E. – Educational Measurement: Issues and Practice, 2020

This study compared the statistical properties of four job analysis task survey response scale types: criticality, difficulty in learning, importance, and frequency. We used nine job analysis studies spanning two fields, medical imaging and allied health professionals, to compare the job analysis scales in terms of variability and interrater…

Descriptors: Job Analysis, Radiology, Allied Health Personnel, Surveys

Digital Module 12: Think-Aloud Interviews and Cognitive Labs https://ncme.elevate.commpartners.com

Peer reviewed

Direct link

Leighton, Jacqueline P.; Lehman, Blair – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Jacqueline Leighton and Dr. Blair Lehman review differences between think-aloud interviews to measure problem-solving processes and cognitive labs to measure comprehension processes. Learners are introduced to historical, theoretical, and procedural differences between these methods and how to use and analyze…

Descriptors: Protocol Analysis, Interviews, Problem Solving, Cognitive Processes

Methodologies for Investigating and Interpreting Student-Teacher Rating Incongruence in Noncognitive Assessment

Peer reviewed

Direct link

Flake, Jessica Kay; Petway, Kevin Terrance, II – Educational Measurement: Issues and Practice, 2019

Numerous studies merely note divergence in students' and teachers' ratings of student noncognitive constructs. However, given the increased attention and use of these constructs in educational research and practice, an in-depth study focused on this issue was needed. Using a variety of quantitative methodologies, we thoroughly investigate…

Descriptors: Teachers, Students, Achievement Rating, Interrater Reliability

A Model-Data-Fit-Informed Approach to Score Resolution in Performance Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Walker, A. Adrienne – Educational Measurement: Issues and Practice, 2021

Many large-scale performance assessments include score resolution procedures for resolving discrepancies in rater judgments. The goal of score resolution is conceptually similar to person fit analyses: To identify students for whom observed scores may not accurately reflect their achievement. Previously, researchers have observed that…

Descriptors: Goodness of Fit, Performance Based Assessment, Evaluators, Decision Making

Rater Agreement in Test-to-Curriculum Alignment Reviews

Peer reviewed

Direct link

Traynor, A.; Merzdorf, H. E. – Educational Measurement: Issues and Practice, 2018

During the development of large-scale curricular achievement tests, recruited panels of independent subject-matter experts use systematic judgmental methods--often collectively labeled "alignment" methods--to rate the correspondence between a given test's items and the objective statements in a particular curricular standards document.…

Descriptors: Achievement Tests, Expertise, Alignment (Education), Test Items

Interrater Reliability	8
Evaluation Methods	2
Goodness of Fit	2
Identification	2
Scoring	2
Students	2
Academic Achievement	1
Achievement Rating	1
Achievement Tests	1
Alignment (Education)	1
Allied Health Personnel	1
At Risk Students	1
Classification	1
Coding	1
Cognitive Processes	1
Comparative Analysis	1
Comprehension	1
Congruence (Psychology)	1
Decision Making	1
Evaluators	1
Evidence	1
Expertise	1
Individual Characteristics	1
Interviews	1
Job Analysis	1
More ▼