ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	10

Descriptor

Evaluators	17
Test Validity	17
Second Language Learning	7
Computer Assisted Testing	6
Language Tests	6
Testing	6
English (Second Language)	5
Scoring	5
Test Reliability	5
Language Proficiency	4
Scores	4
Standardized Tests	4
Testing Programs	4
Accuracy	3
Elementary Secondary Education	3
Evaluation Criteria	3
Evaluation Methods	3
Factor Analysis	3
Foreign Countries	3
Higher Education	3
Interrater Reliability	3
Oral Language	3
Psychometrics	3
Student Evaluation	3
Test Construction	3
More ▼

Source

Language Testing	3
ETS Research Report Series	1
Educational Researcher	1
English Language Teaching	1
Grantee Submission	1
Language Assessment Quarterly	1
National Center for Education…	1
ProQuest LLC	1

Publication Type

Reports - Research	10
Journal Articles	7
Speeches/Meeting Papers	6
Reports - Evaluative	3
Opinion Papers	2
Tests/Questionnaires	2
Dissertations/Theses -…	1

Education Level

Elementary Education	1
Elementary Secondary Education	1
Higher Education	1
Postsecondary Education	1

Audience

Researchers

Location

California	1
China	1
United States	1
Vietnam	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	2
International English…	1
National Assessment of…	1
National Teacher Examinations	1
Torrance Tests of Creative…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Measuring Original Thinking in Elementary School: Development and Validation of a Computational Psychometric Approach

Peer reviewed

Direct link

Selcuk Acar; Denis Dumas; Peter Organisciak; Kelly Berthiaume – Grantee Submission, 2024

Creativity is highly valued in both education and the workforce, but assessing and developing creativity can be difficult without psychometrically robust and affordable tools. The open-ended nature of creativity assessments has made them difficult to score, expensive, often imprecise, and therefore impractical for school- or district-wide use. To…

Descriptors: Thinking Skills, Elementary School Students, Artificial Intelligence, Measurement Techniques

Language Testing in China: Past and Future

Peer reviewed
PDF on ERIC

Download full text

Li, Xuelian – English Language Teaching, 2019

Based on the articles written by mainland Chinese scholars published in the most influential Chinese and international journals, the present article analyzed the language testing research, compared the tendencies of seven categories between 2000-2009 and 2010-2019, and put forward future research directions by referring to international hot…

Descriptors: Language Tests, Testing, Educational History, Futures (of Society)

Speaking in Turns and Sequences: Interactional Competence as a Target Construct in Testing Speaking

Peer reviewed

Direct link

Roever, Carsten; Kasper, Gabriele – Language Testing, 2018

In the assessment of speaking, a psycholinguistically based speaking construct has predominated. In this paper, we argue for the integration of the construct of interactional competence (IC) in speaking assessments to broaden the range of defensible inferences from speaking tests. IC emphasizes the co-constructed nature of interaction and enables…

Descriptors: Language Tests, Testing, Second Language Learning, Language Proficiency

A Study on the Impact of Fatigue on Human Raters When Scoring Speaking Responses

Peer reviewed

Direct link

Ling, Guangming; Mollaun, Pamela; Xi, Xiaoming – Language Testing, 2014

The scoring of constructed responses may introduce construct-irrelevant factors to a test score and affect its validity and fairness. Fatigue is one of the factors that could negatively affect human performance in general, yet little is known about its effects on a human rater's scoring quality on constructed responses. In this study, we compared…

Descriptors: Evaluators, Fatigue (Biology), Scoring, Performance

Automated Essay Evaluation for English Language Learners: A Case Study of "MY Access"

Peer reviewed

Direct link

Hoang, Giang Thi Linh; Kunnan, Antony John – Language Assessment Quarterly, 2016

Computer technology made its way into writing instruction and assessment with spelling and grammar checkers decades ago, but more recently it has done so with automated essay evaluation (AEE) and diagnostic feedback. And although many programs and tools have been developed in the last decade, not enough research has been conducted to support or…

Descriptors: Case Studies, Essays, Writing Evaluation, English (Second Language)

Whether and How to Use State Tests to Measure Student Achievement in a Multi-State Randomized Experiment: An Empirical Assessment Based on Four Recent Evaluations. NCEE 2012-4015

Peer reviewed
PDF on ERIC

Download full text

Somers, Marie-Andree; Zhu, Pei; Wong, Edmond – National Center for Education Evaluation and Regional Assistance, 2011

This study examines the practical implications of using state tests to measure student achievement in impact evaluations that span multiple states and grades. In particular, the study examines the sensitivity of impact findings to (1) the type of assessment used to measured achievement (state tests or an external assessment administered by the…

Descriptors: Evaluators, Grades (Scholastic), Academic Achievement, Program Effectiveness

Lecture:"Where We Have Been and Where We Should Go"

Peer reviewed

Direct link

Stansfield, Charles W. – Language Testing, 2008

In this speech, the author covers a lot of ground. In the first half of his speech, the author gives a brief summary of the last 40 years of the history of language testing, from his perspective. The author reviews these years more or less by decade. Additionally, he discusses the evolution of the profession of language testing during this period,…

Descriptors: History, Testing, Language Tests, Role

Issues Related to Test Use.

Anderson, Scarvia B. – 1977

Several issues are related to the use of educational tests. First, test users must be able to choose appropriate tests, interpret scores, and make decisions based on scores. In the field of educational testing, few test users have adequate training in these areas. Second, test makers must clearly specify directions for administration, allowable…

Descriptors: Educational Testing, Elementary Secondary Education, Evaluators, Guides

The Impact of World Englishes on Language Assessment: Rater Attitude, Rating Behavior, and Challenges

Direct link

Hsu, Huei-Lien – ProQuest LLC, 2012

By centralizing the issue of test fairness in language proficiency assessments, this study responds to a call by researchers for developing greater social responsibility in the language testing agenda. As inquiries into language attitude and psychology indicate, there is an underlying uncertainty pertaining to the validity of test use and score…

Descriptors: Language Variation, English (Second Language), Second Language Learning, Mixed Methods Research

Reconstructing Validity

Peer reviewed

Direct link

Moss, Pamela A. – Educational Researcher, 2007

In response to Lissitz and Samuelsen (2007), the author reconstructs the historical arguments for the more comprehensive unitary concept of validity and the principles of scientific inquiry underlying it. Her response is organized in terms of four questions: (a) How did validity in educational measurement come to be conceptualized as unitary, and…

Descriptors: Evaluators, Construct Validity, Test Validity, Measurement

The Role of Evaluation and Plans for Evaluating the Current Testing Program.

Winters, Lynn – 1981

The Palos Verdes Peninsula Unified School District Office of Program Evaluation and Research is responsible for providing information for program development and improvement; providing test information to special programs coordinators; and acting as a clearinghouse for all information concerning tests, evaluation methodology, and educational…

Descriptors: Elementary Secondary Education, Evaluation Criteria, Evaluation Methods, Evaluators

Toward an Understanding of the Role of Speech Recognition in Nonnative Speech Assessment. TOEFL iBT Research Report. TOEFL iBT-02. ETS RR-07-02

Peer reviewed
PDF on ERIC

Download full text

Zechner, Klaus; Bejar, Isaac I.; Hemat, Ramin – ETS Research Report Series, 2007

The increasing availability and performance of computer-based testing has prompted more research on the automatic assessment of language and speaking proficiency. In this investigation, we evaluated the feasibility of using an off-the-shelf speech-recognition system for scoring speaking prompts from the LanguEdge field test of 2002. We first…

Descriptors: Role, Computer Assisted Testing, Language Proficiency, Oral Language

The Assessment of Writing Proficiency via Qualitative Ratings of Writing Samples.

Steele, Joe M. – 1979

The College Outcome Measures Project/American College Testing Program (COMP/ACT) Writing Assessment is described, and issues of validity and reliability in the assessment of writing samples using qualitative rating scales are explored. COMP/ACT is composed of three role-playing tasks in the social sciences, natural sciences, and arts, which are…

Descriptors: Adults, Essay Tests, Evaluators, Higher Education

Designing a Review and Appeal Process for a Large Scale Writing Assessment Program.

Goldberg, Gail Lynn; Walker-Bartnick, Leslie – 1989

A formative evaluation of a pilot review and appeal process for the Maryland Writing Test (MWT) is described. The MWT is a large-scale direct assessment of writing. A three-year pilot phase was to culminate in the implementation of an operational procedure for the review and appeal of scores impacting a pass/fail decision for examinees. The MWT…

Descriptors: Cutting Scores, Evaluators, Formative Evaluation, Graduation Requirements

Metacognition of Performance Raters.

Download full text

Littlefield, John H.; And Others – 1985

Sixteen Family Practice faculty members completed ratings on 59 senior medical students after a 6-week primary care clerkship. Each student was rated by seven to ten faculty members and the chief residents who worked with them, resulting in a total of 353 ratings. The rating scale covered: (1) attainment of learning objectives; (2) progress during…

Descriptors: Analysis of Variance, Clinical Experience, Confidence Testing, Evaluators

Previous Page | Next Page »

Pages: 1 | 2

Anderson, Scarvia B.	1
Bejar, Isaac I.	1
Busch, John Christian	1
Denis Dumas	1
Goldberg, Gail Lynn	1
Hemat, Ramin	1
Hoang, Giang Thi Linh	1
Hsu, Huei-Lien	1
Jaeger, Richard M.	1
Kasper, Gabriele	1
Kelly Berthiaume	1
Kunnan, Antony John	1
Li, Xuelian	1
Ling, Guangming	1
Littlefield, John H.	1
Mollaun, Pamela	1
Moss, Pamela A.	1
Peter Organisciak	1
Roever, Carsten	1
Selcuk Acar	1
Shiflett, Samuel	1
Somers, Marie-Andree	1
Stansfield, Charles W.	1
Steele, Joe M.	1
More ▼