ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	5

Descriptor

Evaluation Methods	7
Models	7
Scoring Formulas	7
Control Groups	2
Correlation	2
Evaluation Research	2
Experimental Groups	2
Goodness of Fit	2
Grading	2
Interrater Reliability	2
Item Response Theory	2
Scores	2
Standardized Tests	2
Statistical Analysis	2
Test Scoring Machines	2
Academic Standards	1
Accuracy	1
Achievement Gains	1
Algebra	1
Alignment (Education)	1
Automation	1
College Entrance Examinations	1
Compensatory Education	1
Computation	1
Computer Software	1
More ▼

Source

Applied Psychological…	2
Applied Measurement in…	1
ETS Research Report Series	1
Teachers College Record	1

Author

Attali, Yigal	1
Bridgeman, Brent	1
Cech, Joseph	1
Cohen, Allan	1
Crane, Laura R.	1
Davey, Tim	1
Hochbein, Craig	1
Kreiner, Svend	1
Pollio, Marty	1
Raczynski, Kevin	1
Ramineni, Chaitanya	1
Smith, Richard M.	1
Trapani, Catherine S.	1
Williamson, David M.	1
More ▼

Publication Type

Journal Articles	5
Reports - Research	4
Reports - Descriptive	2
Speeches/Meeting Papers	2
Reports - Evaluative	1

Education Level

Higher Education	2
Adult Education	1
Grade 11	1
Grade 7	1
High Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Denmark

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Graduate Record Examinations

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Appraising the Scoring Performance of Automated Essay Scoring Systems--Some Additional Considerations: Which Essays? Which Human Raters? Which Scores?

Peer reviewed

Direct link

Raczynski, Kevin; Cohen, Allan – Applied Measurement in Education, 2018

The literature on Automated Essay Scoring (AES) systems has provided useful validation frameworks for any assessment that includes AES scoring. Furthermore, evidence for the scoring fidelity of AES systems is accumulating. Yet questions remain when appraising the scoring performance of AES systems. These questions include: (a) which essays are…

Descriptors: Essay Tests, Test Scoring Machines, Test Validity, Evaluators

The Association between Standards- Based Grading and Standardized Test Scores in a High School Reform Model

Peer reviewed

Direct link

Pollio, Marty; Hochbein, Craig – Teachers College Record, 2015

Background/Context: From two decades of research on the grading practices of teachers in secondary schools, researchers discovered that teachers evaluated students on numerous factors that do not validly assess a student's achievement level in a specific content area. These consistent findings suggested that traditional grading practices evolved…

Descriptors: Standardized Tests, Academic Standards, Grading, Scores

Immediate Feedback and Opportunity to Revise Answers: Application of a Graded Response IRT Model

Peer reviewed

Direct link

Attali, Yigal – Applied Psychological Measurement, 2011

Recently, Attali and Powers investigated the usefulness of providing immediate feedback on the correctness of answers to constructed response questions and the opportunity to revise incorrect answers. This article introduces an item response theory (IRT) model for scoring revised responses to questions when several attempts are allowed. The model…

Descriptors: Feedback (Response), Item Response Theory, Models, Error Correction

A Note on Item-Restscore Association in Rasch Models

Peer reviewed

Direct link

Kreiner, Svend – Applied Psychological Measurement, 2011

To rule out the need for a two-parameter item response theory (IRT) model during item analysis by Rasch models, it is important to check the Rasch model's assumption that all items have the same item discrimination. Biserial and polyserial correlation coefficients measuring the association between items and restscores are often used in an informal…

Descriptors: Item Analysis, Correlation, Item Response Theory, Models

Evaluation of the "e-rater"® Scoring Engine for the "GRE"® Issue and Argument Prompts. Research Report. ETS RR-12-02

Peer reviewed
PDF on ERIC

Download full text

Ramineni, Chaitanya; Trapani, Catherine S.; Williamson, David M.; Davey, Tim; Bridgeman, Brent – ETS Research Report Series, 2012

Automated scoring models for the "e-rater"® scoring engine were built and evaluated for the "GRE"® argument and issue-writing tasks. Prompt-specific, generic, and generic with prompt-specific intercept scoring models were built and evaluation statistics such as weighted kappas, Pearson correlations, standardized difference in…

Descriptors: Scoring, Test Scoring Machines, Automation, Models

A Model for Evaluating the Assessment of Partial Knowledge.

Smith, Richard M. – 1982

There have been many attempts to formulate a procedure for extracting information from incorrect responses to multiple choice items, i.e., the assessment of partial knowledge. The results of these attempts can be described as inconsistent at best. It is hypothesized that these inconsistencies arise from three methodological problems: the…

Descriptors: Difficulty Level, Evaluation Methods, Goodness of Fit, Guessing (Tests)

Title I Evaluation Models A1 and B1: An Empirical Comparison.

Download full text

Crane, Laura R.; Cech, Joseph – 1979

Normal curve equivalent achievement gains estimates were compared with RMC Title I evaluation Models A1 and B1. The comparison focused upon the amount of bias introduced by Model A1 when its underlying assumptions were violated. The model assumes, first, that the local school population is accurately represented by the national norm group; and…

Descriptors: Achievement Gains, Compensatory Education, Control Groups, Early Childhood Education