ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	2

Descriptor

Generalizability Theory	5
Scoring	5
Test Interpretation	5
Error of Measurement	2
Essay Tests	2
Interrater Reliability	2
Mathematical Models	2
Scores	2
Test Construction	2
Test Validity	2
Ability	1
Analysis of Variance	1
Authentic Learning	1
Automation	1
Best Practices	1
Biology	1
Classroom Observation…	1
College Science	1
Computer Assisted Testing	1
Correlation	1
Data Analysis	1
Data Collection	1
Data Interpretation	1
Decision Making	1
Design	1
More ▼

Source

Applied Measurement in…	1
Educational Measurement:…	1
International Journal of…	1

Author

Anum Khushal	1
Brian A. Couch	1
Burton, Elizabeth	1
Haertel, Edward H.	1
Joseph Dauer	1
Linn, Robert L.	1
Lyrica Lucas	1
Robert Mayes	1
Rupp, André A.	1
Shale, Doug	1

Publication Type

Journal Articles	3
Speeches/Meeting Papers	3
Reports - Evaluative	2
Opinion Papers	1
Reports - Descriptive	1
Reports - Research	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Development of the Quantitative Modelling Observation Protocol (QMOP) for Undergraduate Biology Courses: Validity Evidence for Score Interpretation and Uses

Peer reviewed

Direct link

Lyrica Lucas; Anum Khushal; Robert Mayes; Brian A. Couch; Joseph Dauer – International Journal of Science Education, 2025

Educational reform priorities such as emphasis on quantitative modelling (QM) have positioned undergraduate biology instructors as designers of QM experiences to engage students in authentic science practices that support the development of data-driven and evidence-based reasoning. Yet, little is known about how biology instructors adapt to the…

Descriptors: Undergraduate Students, College Science, Biology, Classroom Observation Techniques

Designing, Evaluating, and Deploying Automated Scoring Systems with Validity in Mind: Methodological Design Decisions

Peer reviewed

Direct link

Rupp, André A. – Applied Measurement in Education, 2018

This article discusses critical methodological design decisions for collecting, interpreting, and synthesizing empirical evidence during the design, deployment, and operational quality-control phases for automated scoring systems. The discussion is inspired by work on operational large-scale systems for automated essay scoring but many of the…

Descriptors: Design, Automation, Scoring, Test Scoring Machines

Performance-Based Assessment: Implications of Task Specificity.

Peer reviewed

Linn, Robert L.; Burton, Elizabeth – Educational Measurement: Issues and Practice, 1994

Generalizability of performance-based assessment scores across raters and tasks is examined, focusing on implications of generalizability analyses for specific uses and interpretations of assessment results. Although it seems probable that assessment conditions, task characteristics, and interactions with instructional experiences affect the…

Descriptors: Educational Assessment, Educational Experience, Generalizability Theory, Interaction

Latent Traits or Latent States? The Role of Discrete Models for Ability and Performance.

Download full text

Haertel, Edward H. – 1992

Classical test theory, item response theory, and generalizability theory all treat the abilities to be measured as continuous variables, and the items of a test as independent probes of underlying continua. These models are well-suited to measuring the broad, diffuse traits of traditional differential psychology, but not for measuring the outcomes…

Descriptors: Ability, Data Analysis, Error of Measurement, Generalizability Theory

Essay Reliability: Form and Meaning.

Download full text

Shale, Doug – 1986

This study is an attempt at a cohesive characterization of the concept of essay reliability. As such, it takes as a basic premise that previous and current practices in reporting reliability estimates for essay tests have certain shortcomings. The study provides an analysis of these shortcomings--partly to encourage a fuller understanding of the…

Descriptors: Analysis of Variance, Correlation, Error of Measurement, Essay Tests