ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	4

Source

Applied Measurement in…

Publication Type

Journal Articles	8
Reports - Research	5
Reports - Evaluative	2
Collected Works - General	1
Reports - Descriptive	1

Education Level

Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Israel

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Validating Human and Automated Scoring of Essays against "True" Scores

Peer reviewed

Direct link

Cohen, Yoav; Levi, Effi; Ben-Simon, Anat – Applied Measurement in Education, 2018

In the current study, two pools of 250 essays, all written as a response to the same prompt, were rated by two groups of raters (14 or 15 raters per group), thereby providing an approximation to the essay's true score. An automated essay scoring (AES) system was trained on the datasets and then scored the essays using a cross-validation scheme. By…

Descriptors: Test Validity, Automation, Scoring, Computer Assisted Testing

The Effectiveness of Machine Score-Ability Ratings in Predicting Automated Scoring Performance

Peer reviewed

Direct link

Lottridge, Susan; Wood, Scott; Shaw, Dan – Applied Measurement in Education, 2018

This study sought to provide a framework for evaluating machine score-ability of items using a new score-ability rating scale, and to determine the extent to which ratings were predictive of observed automated scoring performance. The study listed and described a set of factors that are thought to influence machine score-ability; these factors…

Descriptors: Program Effectiveness, Computer Assisted Testing, Test Scoring Machines, Scoring

Designing, Evaluating, and Deploying Automated Scoring Systems with Validity in Mind: Methodological Design Decisions

Peer reviewed

Direct link

Rupp, André A. – Applied Measurement in Education, 2018

This article discusses critical methodological design decisions for collecting, interpreting, and synthesizing empirical evidence during the design, deployment, and operational quality-control phases for automated scoring systems. The discussion is inspired by work on operational large-scale systems for automated essay scoring but many of the…

Descriptors: Design, Automation, Scoring, Test Scoring Machines

Comparing Human and Automated Essay Scoring for Prospective Graduate Students with Learning Disabilities and/or ADHD

Peer reviewed

Direct link

Buzick, Heather; Oliveri, Maria Elena; Attali, Yigal; Flor, Michael – Applied Measurement in Education, 2016

Automated essay scoring is a developing technology that can provide efficient scoring of large numbers of written responses. Its use in higher education admissions testing provides an opportunity to collect validity and fairness evidence to support current uses and inform its emergence in other areas such as K-12 large-scale assessment. In this…

Descriptors: Essays, Learning Disabilities, Attention Deficit Hyperactivity Disorder, Scoring

A Comparison of the Generalizability of Scores Produced by Expert Raters and Automated Scoring Systems.

Peer reviewed

Clauser, Brian E.; Swanson, David B.; Clyman, Stephen G. – Applied Measurement in Education, 1999

Performed generalizability analyses of expert ratings and computer-produced scores for a computer-delivered performance assessment of physicians' patient management skills. The two automated scoring systems produced scores for the 200 medical students that were approximately as generalizable as those produced by the four expert raters. (SLD)

Descriptors: Comparative Analysis, Computer Assisted Testing, Generalizability Theory, Higher Education

Applications of Item Response Theory to Partial Credit Scoring.

Peer reviewed

Wise, Steven L., Ed.; And Others – Applied Measurement in Education, 1988

Six papers on the use of partial credit item response theory score models in applied measurement settings are presented. These applications include the scoring of medical certification examinations using computer-based patient simulations, narrative writing tests, and educational diagnosis. (TJH)

Descriptors: Clinical Diagnosis, Computer Assisted Testing, Computer Simulation, Educational Diagnosis

Practical Issues in Large-Scale Computerized Adaptive Testing.

Peer reviewed

Mills, Craig N.; Stocking, Martha L. – Applied Measurement in Education, 1996

Issues that must be addressed in the large-scale application of computerized adaptive testing are explored, including considerations of test design, scoring, test administration, item and item bank development, and other aspects of test construction. Possible solutions and areas in which additional work is needed are identified. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Elementary Secondary Education, Higher Education

Development of a Scoring Algorithm To Replace Expert Rating for Scoring a Complex Performance-Based Assessment.

Peer reviewed

Clauser, Brian E.; Ross, Linette P.; Clyman, Stephen G.; Rose, Kathie M.; Margolis, Melissa J.; Nungester, Ronald J.; Piemme, Thomas E.; Chang, Lucy; El-Bayoumi, Gigi; Malakoff, Gary L.; Pincetl, Pierre S. – Applied Measurement in Education, 1997

Describes an automated scoring algorithm for a computer-based simulation examination of physicians' patient-management skills. Results with 280 medical students show that scores produced using this algorithm are highly correlated to actual clinician ratings. Scores were also effective in discriminating between case performance judged passing or…

Descriptors: Algorithms, Computer Assisted Testing, Computer Simulation, Evaluators

Computer Assisted Testing	8
Scoring	8
Test Scoring Machines	4
Automation	3
Test Construction	3
Comparative Analysis	2
Computer Simulation	2
Correlation	2
Essays	2
Evaluators	2
Generalizability Theory	2
Higher Education	2
Medical Students	2
Physicians	2
Rating Scales	2
Statistical Analysis	2
Test Validity	2
Adaptive Testing	1
Algorithms	1
Attention Deficit…	1
Best Practices	1
Clinical Diagnosis	1
College Entrance Examinations	1
College Students	1
Data Collection	1
More ▼

Clauser, Brian E.	2
Clyman, Stephen G.	2
Attali, Yigal	1
Ben-Simon, Anat	1
Buzick, Heather	1
Chang, Lucy	1
Cohen, Yoav	1
El-Bayoumi, Gigi	1
Flor, Michael	1
Levi, Effi	1
Lottridge, Susan	1
Malakoff, Gary L.	1
Margolis, Melissa J.	1
Mills, Craig N.	1
Nungester, Ronald J.	1
Oliveri, Maria Elena	1
Piemme, Thomas E.	1
Pincetl, Pierre S.	1
Rose, Kathie M.	1
Ross, Linette P.	1
Rupp, André A.	1
Shaw, Dan	1
Stocking, Martha L.	1
Swanson, David B.	1
More ▼