Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 10 |
Descriptor
Evaluators | 17 |
Test Validity | 17 |
Second Language Learning | 7 |
Computer Assisted Testing | 6 |
Language Tests | 6 |
Testing | 6 |
English (Second Language) | 5 |
Scoring | 5 |
Test Reliability | 5 |
Language Proficiency | 4 |
Scores | 4 |
More ▼ |
Source
Language Testing | 3 |
ETS Research Report Series | 1 |
Educational Researcher | 1 |
English Language Teaching | 1 |
Grantee Submission | 1 |
Language Assessment Quarterly | 1 |
National Center for Education… | 1 |
ProQuest LLC | 1 |
Author
Publication Type
Reports - Research | 10 |
Journal Articles | 7 |
Speeches/Meeting Papers | 6 |
Reports - Evaluative | 3 |
Opinion Papers | 2 |
Tests/Questionnaires | 2 |
Dissertations/Theses -… | 1 |
Education Level
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Researchers | 1 |
Location
California | 1 |
China | 1 |
United States | 1 |
Vietnam | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 2 |
International English… | 1 |
National Assessment of… | 1 |
National Teacher Examinations | 1 |
Torrance Tests of Creative… | 1 |
What Works Clearinghouse Rating
Selcuk Acar; Denis Dumas; Peter Organisciak; Kelly Berthiaume – Grantee Submission, 2024
Creativity is highly valued in both education and the workforce, but assessing and developing creativity can be difficult without psychometrically robust and affordable tools. The open-ended nature of creativity assessments has made them difficult to score, expensive, often imprecise, and therefore impractical for school- or district-wide use. To…
Descriptors: Thinking Skills, Elementary School Students, Artificial Intelligence, Measurement Techniques
Li, Xuelian – English Language Teaching, 2019
Based on the articles written by mainland Chinese scholars published in the most influential Chinese and international journals, the present article analyzed the language testing research, compared the tendencies of seven categories between 2000-2009 and 2010-2019, and put forward future research directions by referring to international hot…
Descriptors: Language Tests, Testing, Educational History, Futures (of Society)
Roever, Carsten; Kasper, Gabriele – Language Testing, 2018
In the assessment of speaking, a psycholinguistically based speaking construct has predominated. In this paper, we argue for the integration of the construct of interactional competence (IC) in speaking assessments to broaden the range of defensible inferences from speaking tests. IC emphasizes the co-constructed nature of interaction and enables…
Descriptors: Language Tests, Testing, Second Language Learning, Language Proficiency
Ling, Guangming; Mollaun, Pamela; Xi, Xiaoming – Language Testing, 2014
The scoring of constructed responses may introduce construct-irrelevant factors to a test score and affect its validity and fairness. Fatigue is one of the factors that could negatively affect human performance in general, yet little is known about its effects on a human rater's scoring quality on constructed responses. In this study, we compared…
Descriptors: Evaluators, Fatigue (Biology), Scoring, Performance
Hoang, Giang Thi Linh; Kunnan, Antony John – Language Assessment Quarterly, 2016
Computer technology made its way into writing instruction and assessment with spelling and grammar checkers decades ago, but more recently it has done so with automated essay evaluation (AEE) and diagnostic feedback. And although many programs and tools have been developed in the last decade, not enough research has been conducted to support or…
Descriptors: Case Studies, Essays, Writing Evaluation, English (Second Language)
Somers, Marie-Andree; Zhu, Pei; Wong, Edmond – National Center for Education Evaluation and Regional Assistance, 2011
This study examines the practical implications of using state tests to measure student achievement in impact evaluations that span multiple states and grades. In particular, the study examines the sensitivity of impact findings to (1) the type of assessment used to measured achievement (state tests or an external assessment administered by the…
Descriptors: Evaluators, Grades (Scholastic), Academic Achievement, Program Effectiveness
Stansfield, Charles W. – Language Testing, 2008
In this speech, the author covers a lot of ground. In the first half of his speech, the author gives a brief summary of the last 40 years of the history of language testing, from his perspective. The author reviews these years more or less by decade. Additionally, he discusses the evolution of the profession of language testing during this period,…
Descriptors: History, Testing, Language Tests, Role
Anderson, Scarvia B. – 1977
Several issues are related to the use of educational tests. First, test users must be able to choose appropriate tests, interpret scores, and make decisions based on scores. In the field of educational testing, few test users have adequate training in these areas. Second, test makers must clearly specify directions for administration, allowable…
Descriptors: Educational Testing, Elementary Secondary Education, Evaluators, Guides
Hsu, Huei-Lien – ProQuest LLC, 2012
By centralizing the issue of test fairness in language proficiency assessments, this study responds to a call by researchers for developing greater social responsibility in the language testing agenda. As inquiries into language attitude and psychology indicate, there is an underlying uncertainty pertaining to the validity of test use and score…
Descriptors: Language Variation, English (Second Language), Second Language Learning, Mixed Methods Research
Moss, Pamela A. – Educational Researcher, 2007
In response to Lissitz and Samuelsen (2007), the author reconstructs the historical arguments for the more comprehensive unitary concept of validity and the principles of scientific inquiry underlying it. Her response is organized in terms of four questions: (a) How did validity in educational measurement come to be conceptualized as unitary, and…
Descriptors: Evaluators, Construct Validity, Test Validity, Measurement
Winters, Lynn – 1981
The Palos Verdes Peninsula Unified School District Office of Program Evaluation and Research is responsible for providing information for program development and improvement; providing test information to special programs coordinators; and acting as a clearinghouse for all information concerning tests, evaluation methodology, and educational…
Descriptors: Elementary Secondary Education, Evaluation Criteria, Evaluation Methods, Evaluators
Zechner, Klaus; Bejar, Isaac I.; Hemat, Ramin – ETS Research Report Series, 2007
The increasing availability and performance of computer-based testing has prompted more research on the automatic assessment of language and speaking proficiency. In this investigation, we evaluated the feasibility of using an off-the-shelf speech-recognition system for scoring speaking prompts from the LanguEdge field test of 2002. We first…
Descriptors: Role, Computer Assisted Testing, Language Proficiency, Oral Language
Steele, Joe M. – 1979
The College Outcome Measures Project/American College Testing Program (COMP/ACT) Writing Assessment is described, and issues of validity and reliability in the assessment of writing samples using qualitative rating scales are explored. COMP/ACT is composed of three role-playing tasks in the social sciences, natural sciences, and arts, which are…
Descriptors: Adults, Essay Tests, Evaluators, Higher Education
Goldberg, Gail Lynn; Walker-Bartnick, Leslie – 1989
A formative evaluation of a pilot review and appeal process for the Maryland Writing Test (MWT) is described. The MWT is a large-scale direct assessment of writing. A three-year pilot phase was to culminate in the implementation of an operational procedure for the review and appeal of scores impacting a pass/fail decision for examinees. The MWT…
Descriptors: Cutting Scores, Evaluators, Formative Evaluation, Graduation Requirements
Littlefield, John H.; And Others – 1985
Sixteen Family Practice faculty members completed ratings on 59 senior medical students after a 6-week primary care clerkship. Each student was rated by seven to ten faculty members and the chief residents who worked with them, resulting in a total of 353 ratings. The rating scale covered: (1) attainment of learning objectives; (2) progress during…
Descriptors: Analysis of Variance, Clinical Experience, Confidence Testing, Evaluators
Previous Page | Next Page ยป
Pages: 1 | 2