ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	17

Descriptor

Computer Assisted Testing	17
Weighted Scores	17
Scoring	9
Comparative Analysis	8
Correlation	8
Prediction	7
Evaluation Methods	5
Regression (Statistics)	5
Test Items	5
Writing Evaluation	5
Elementary Secondary Education	4
Essay Tests	4
Grammar	4
Language Tests	4
Models	4
Multiple Regression Analysis	4
Scores	4
Second Language Learning	4
Automation	3
College Entrance Examinations	3
Computation	3
Construct Validity	3
English (Second Language)	3
Essays	3
Factor Analysis	3
More ▼

Source

ETS Research Report Series	7
Centers for Disease Control…	1
Education Sciences	1
Innovations in Education and…	1
International Association for…	1
International Association for…	1
International Journal of…	1
Journal of Technology,…	1
Language Testing	1
Pearson	1
Psychometrika	1
More ▼

Publication Type

Journal Articles	13
Reports - Research	11
Reports - Evaluative	4
Books	1
Collected Works - General	1
Collected Works - Proceedings	1
Collected Works - Serial	1
Speeches/Meeting Papers	1

Education Level

Higher Education	5
Elementary Secondary Education	4
Postsecondary Education	2
Secondary Education	2
Elementary Education	1
Grade 4	1
High Schools	1
Intermediate Grades	1

Audience

Location

Asia	1
Australia	1
Brazil	1
Connecticut	1
Denmark	1
Egypt	1
Estonia	1
Florida	1
Germany	1
Greece	1
Hawaii	1
Ireland	1
Israel	1
Italy	1
Japan	1
Kazakhstan	1
Netherlands	1
Norway	1
Ohio	1
Pakistan	1
Pennsylvania	1
Philippines	1
Portugal	1
Singapore	1
South Korea	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	4
Graduate Record Examinations	3
International Association for…	1
Progress in International…	1
Trends in International…	1
Youth Risk Behavior Survey	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Detecting the Impact of Remote Proctored At-Home Testing Using Propensity Score Weighting. Research Report. ETS RR-24-11

Peer reviewed
PDF on ERIC

Download full text

Jing Miao; Yi Cao; Michael E. Walker – ETS Research Report Series, 2024

Studies of test score comparability have been conducted at different stages in the history of testing to ensure that test results carry the same meaning regardless of test conditions. The expansion of at-home testing via remote proctoring sparked another round of interest. This study uses data from three licensure tests to assess potential mode…

Descriptors: Testing, Test Format, Computer Assisted Testing, Home Study

Youth Risk Behavior Surveillance--United States, 2023. Morbidity and Mortality Weekly Report (MMWR). Supplement. Vol. 73 No. 4

Download full text

Christine G. Casey, Editor – Centers for Disease Control and Prevention, 2024

The "Morbidity and Mortality Weekly Report" ("MMWR") series of publications is published by the Office of Science, Centers for Disease Control and Prevention (CDC), U.S. Department of Health and Human Services. Articles included in this supplement are: (1) Overview and Methods for the Youth Risk Behavior Surveillance System --…

Descriptors: High School Students, At Risk Students, Health Behavior, National Surveys

Implementing a Contributory Scoring Approach for the "GRE"® Analytical Writing Section: A Comprehensive Empirical Investigation. Research Report. ETS RR-17-14

Peer reviewed
PDF on ERIC

Download full text

Breyer, F. Jay; Rupp, André A.; Bridgeman, Brent – ETS Research Report Series, 2017

In this research report, we present an empirical argument for the use of a contributory scoring approach for the 2-essay writing assessment of the analytical writing section of the "GRE"® test in which human and machine scores are combined for score creation at the task and section levels. The approach was designed to replace a currently…

Descriptors: College Entrance Examinations, Scoring, Essay Tests, Writing Evaluation

Optimal Weighting for Exam Composition

Peer reviewed
PDF on ERIC

Download full text

Ganzfried, Sam; Yusuf, Farzana – Education Sciences, 2018

A problem faced by many instructors is that of designing exams that accurately assess the abilities of the students. Typically, these exams are prepared several days in advance, and generic question scores are used based on rough approximation of the question difficulty and length. For example, for a recent class taught by the author, there were…

Descriptors: Weighted Scores, Test Construction, Student Evaluation, Multiple Choice Tests

An Investigation of the "e-rater"® Automated Scoring Engine's Grammar, Usage, Mechanics, and Style Microfeatures and Their Aggregation Model. Research Report. ETS RR-17-04

Peer reviewed
PDF on ERIC

Download full text

Chen, Jing; Zhang, Mo; Bejar, Isaac I. – ETS Research Report Series, 2017

Automated essay scoring (AES) generally computes essay scores as a function of macrofeatures derived from a set of microfeatures extracted from the text using natural language processing (NLP). In the "e-rater"® automated scoring engine, developed at "Educational Testing Service" (ETS) for the automated scoring of essays, each…

Descriptors: Computer Assisted Testing, Scoring, Automation, Essay Tests

Reliability and Validity of International Large-Scale Assessment: Understanding IEA's Comparative Studies of Student Achievement. IEA Research for Education. Volume 10

Download full text

Wagemaker, Hans, Ed. – International Association for the Evaluation of Educational Achievement, 2020

Although International Association for the Evaluation of Educational Achievement-pioneered international large-scale assessment (ILSA) of education is now a well-established science, non-practitioners and many users often substantially misunderstand how large-scale assessments are conducted, what questions and challenges they are designed to…

Descriptors: International Assessment, Achievement Tests, Educational Assessment, Comparative Analysis

Automated Trait Scores for "TOEFL"® Writing Tasks. Research Report. ETS RR-15-14

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015

The "e-rater"® automated essay scoring system is used operationally in the scoring of "TOEFL iBT"® independent and integrated tasks. In this study we explored the psychometric added value of reporting four trait scores for each of these two tasks, beyond the total e-rater score.The four trait scores are word choice, grammatical…

Descriptors: Writing Tests, Scores, Language Tests, English (Second Language)

Development of a Diagnostic and Remedial Learning System Based on an Enhanced Concept--Effect Model

Peer reviewed

Direct link

Panjaburees, Patcharin; Triampo, Wannapong; Hwang, Gwo-Jen; Chuedoung, Meechoke; Triampo, Darapond – Innovations in Education and Teaching International, 2013

With the rapid advances in computer technology during recent years, researchers have demonstrated the pivotal influences of computer-assisted diagnostic systems on student learning performance improvement. This research aims to develop a Diagnostic and Remedial Learning System (DRLS) for an algebra course in a Thai lower secondary school context…

Descriptors: Educational Diagnosis, Algebra, Secondary School Mathematics, Remedial Mathematics

Automated Trait Scores for "GRE"® Writing Tasks. Research Report. ETS RR-15-15

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015

The "e-rater"® automated essay scoring system is used operationally in the scoring of the argument and issue tasks that form the Analytical Writing measure of the "GRE"® General Test. For each of these tasks, this study explored the value added of reporting 4 trait scores for each of these 2 tasks over the total e-rater score.…

Descriptors: Scores, Computer Assisted Testing, Computer Software, Grammar

A Comparison of Three Content Balancing Methods for Fixed and Variable Length Computerized Adaptive Tests

Direct link

Shin, Chingwei David; Chien, Yuehmei; Way, Walter Denny – Pearson, 2012

Content balancing is one of the most important components in the computerized adaptive testing (CAT) especially in the K to 12 large scale tests that complex constraint structure is required to cover a broad spectrum of content. The purpose of this study is to compare the weighted penalty model (WPM) and the weighted deviation method (WDM) under…

Descriptors: Computer Assisted Testing, Elementary Secondary Education, Test Content, Models

A Comparison of Two Scoring Methods for an Automated Speech Scoring System

Peer reviewed

Direct link

Xi, Xiaoming; Higgins, Derrick; Zechner, Klaus; Williamson, David – Language Testing, 2012

This paper compares two alternative scoring methods--multiple regression and classification trees--for an automated speech scoring system used in a practice environment. The two methods were evaluated on two criteria: construct representation and empirical performance in predicting human scores. The empirical performance of the two scoring models…

Descriptors: Scoring, Classification, Weighted Scores, Comparative Analysis

To Weight or Not to Weight? Balancing Influence of Initial Items in Adaptive Testing

Peer reviewed

Direct link

Chang, Hua-Hua; Ying, Zhiliang – Psychometrika, 2008

It has been widely reported that in computerized adaptive testing some examinees may get much lower scores than they would normally if an alternative paper-and-pencil version were given. The main purpose of this investigation is to quantitatively reveal the cause for the underestimation phenomenon. The logistic models, including the 1PL, 2PL, and…

Descriptors: Adaptive Testing, Computer Assisted Testing, Computation, Test Items

Performance of a Generic Approach in Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Bridgeman, Brent; Trapani, Catherine – Journal of Technology, Learning, and Assessment, 2010

A generic approach in automated essay scoring produces scores that have the same meaning across all prompts, existing or new, of a writing assessment. This is accomplished by using a single set of linguistic indicators (or features), a consistent way of combining and weighting these features into essay scores, and a focus on features that are not…

Descriptors: Writing Evaluation, Writing Tests, Scoring, Test Scoring Machines

Correcting for Person Misfit in Aggregated Score Reporting

Peer reviewed

Direct link

Brown, Richard S.; Villarreal, Julio C. – International Journal of Testing, 2007

There has been considerable research regarding the extent to which psychometric sound assessments sometimes yield individual score estimates that are inconsistent with the response patterns of the individual. It has been suggested that individual response patterns may differ from expectations for a number of reasons, including subject motivation,…

Descriptors: Psychometrics, Test Bias, Testing, Simulation

On-the-Fly Customization of Automated Essay Scoring. Research Report. ETS RR-07-42

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal – ETS Research Report Series, 2007

Because there is no commonly accepted view of what makes for good writing, automated essay scoring (AES) ideally should be able to accommodate different theoretical positions, certainly at the level of state standards but also perhaps among teachers at the classroom level. This paper presents a practical approach and an interactive computer…

Descriptors: Computer Assisted Testing, Automation, Essay Tests, Scoring

Previous Page | Next Page »

Pages: 1 | 2

Attali, Yigal	5
Bridgeman, Brent	2
Sinharay, Sandip	2
Bejar, Isaac I.	1
Breyer, F. Jay	1
Brown, Richard S.	1
Chang, Hua-Hua	1
Chen, Jing	1
Chien, Yuehmei	1
Christine G. Casey, Editor	1
Chuedoung, Meechoke	1
Ganzfried, Sam	1
Higgins, Derrick	1
Hwang, Gwo-Jen	1
Jing Miao	1
Michael E. Walker	1
Panjaburees, Patcharin	1
Rupp, André A.	1
Shin, Chingwei David	1
Trapani, Catherine	1
Triampo, Darapond	1
Triampo, Wannapong	1
Villarreal, Julio C.	1
Wagemaker, Hans, Ed.	1
Way, Walter Denny	1
More ▼