ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	7

Descriptor

Evaluation Methods	9
Scoring	6
Psychometrics	4
Educational Assessment	3
Item Response Theory	3
Test Construction	3
Validity	3
Correlation	2
Evaluative Thinking	2
Multidimensional Scaling	2
Scoring Rubrics	2
Teacher Evaluation	2
Test Items	2
Academic Achievement	1
Accuracy	1
Administrator Characteristics	1
Administrators	1
Algebra	1
Computer Uses in Education	1
Conflict Resolution	1
Construct Validity	1
Criteria	1
Data Analysis	1
Developmental Stages	1
Discriminant Analysis	1
More ▼

Source

Educational Assessment

Publication Type

Journal Articles	9
Reports - Research	7
Reports - Evaluative	1
Reports - General	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	1
Junior High Schools	1
Middle Schools	1

Audience

Location

California	1
California (Los Angeles)	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Resolving and Re-Scoring Constructed Response Items in Mixed-Format Assessments: An Exploration of Three Approaches

Peer reviewed

Direct link

Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024

We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…

Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners

Beyond Agreement: Exploring Rater Effects in Large-Scale Mixed Format Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Guo, Wenjing – Educational Assessment, 2021

Scoring procedures for the constructed-response (CR) items in large-scale mixed-format educational assessments often involve checks for rater agreement or rater reliability. Although these analyses are important, researchers have documented rater effects that persist despite rater training and that are not always detected in rater agreement and…

Descriptors: Scoring, Responses, Test Items, Test Format

Test Assembly Implications for Providing Reliable and Valid Subscores

Peer reviewed

Direct link

Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017

This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…

Descriptors: Scores, Test Construction, Test Reliability, Test Validity

Strategies for Assessing Classroom Teaching: Examining Administrator Thinking as Validity Evidence

Peer reviewed

Direct link

Bell, Courtney A.; Jones, Nathan D.; Qi, Yi; Lewis, Jennifer M. – Educational Assessment, 2018

All 50 states use observations to evaluate practicing teachers, but we know little about how administrators actually reason when they use those observation protocols. Drawing on think-aloud and stimulated recall data, this study describes the types of strategies and warrants practicing administrators used when rating with their district's…

Descriptors: Administrators, Observation, Validity, Logical Thinking

A Multidimensional Assessment of Teachers' Knowledge of Algebra for Teaching: Developing an Instrument and Supporting Valid Inferences

Peer reviewed

Direct link

Reckase, Mark D.; McCrory, Raven; Floden, Robert E.; Ferrini-Mundy, Joan; Senk, Sharon L. – Educational Assessment, 2015

Numerous researchers have suggested that there are multiple mathematical knowledge and skill areas needed by teachers in order for them to be effective teachers of mathematics: knowledge of the mathematics that are the goals of instruction, advanced mathematics beyond the instructional material, and mathematical knowledge that is specific to what…

Descriptors: Algebra, Knowledge Base for Teaching, Multidimensional Scaling, Psychometrics

A Review of the Psychometric Properties of Retell Instruments

Peer reviewed

Direct link

Reed, Deborah K. – Educational Assessment, 2011

This narrative synthesis reviews the psychometric properties of commercially and publicly available retell instruments used to assess the reading comprehension of students in grades K-12. Eleven instruments met selection criteria and were systematically coded for data related to the administration procedures, scoring procedures, and technical…

Descriptors: Reading Comprehension, Elementary Secondary Education, Construct Validity, Validity

Measuring Classroom Assessment Practice Using Instructional Artifacts: A Validation Study of the QAS Notebook

Peer reviewed

Direct link

Martinez, Jose Felipe; Borko, Hilda; Stecher, Brian; Luskin, Rebecca; Kloser, Matt – Educational Assessment, 2012

We report the results of a pilot validation study of the Quality Assessment in Science Notebook, a portfolio-like instrument for measuring teacher assessment practices in middle school science classrooms. A statewide sample of 42 teachers collected 2 notebooks during the school year, corresponding to science topics taught in the fall and spring.…

Descriptors: Validity, Middle School Teachers, Evaluation Methods, Educational Assessment

Choosing between Examinee-Centered and Test-Centered Standard-Setting Methods.

Peer reviewed

Kane, Michael – Educational Assessment, 1998

Examines criteria for choosing between test-centered and examinee-centered methods of standard setting in empirical terms and in terms of whether the method is consistent with the model of achievement underlying test design and interpretation and the assessment methods being used. Contains 35 references. (Author/SLD)

Descriptors: Academic Achievement, Criteria, Educational Assessment, Evaluation Methods

The LAAS: A Computerized Scoring System for Small- and Large-Scale Developmental Assessments

Peer reviewed

Direct link

Dawson, Theo L.; Wilson, Mark – Educational Assessment, 2004

The evaluation of developmental interventions has been hampered by a lack of practical, reliable, and objective developmental assessment systems. This article describes the construction of a domain-general computerized developmental assessment system for texts: the Lexical Abstraction Assessment System (LAAS). The LAAS provides assessments of the…

Descriptors: Scoring, Evaluation Methods, Discriminant Analysis, Computer Uses in Education

Bell, Courtney A.	1
Borko, Hilda	1
Dawson, Theo L.	1
Ferrini-Mundy, Joan	1
Floden, Robert E.	1
Guo, Wenjing	1
Jones, Nathan D.	1
Kane, Michael	1
Kloser, Matt	1
Lee, Minji K.	1
Lewis, Jennifer M.	1
Luskin, Rebecca	1
Martinez, Jose Felipe	1
McCrory, Raven	1
Melican, Gerald J.	1
Qi, Yi	1
Reckase, Mark D.	1
Reed, Deborah K.	1
Senk, Sharon L.	1
Stecher, Brian	1
Stefanie A. Wind	1
Sweeney, Kevin	1
Wilson, Mark	1
Wind, Stefanie A.	1
Yangmeng Xu	1
More ▼