ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	16

Descriptor

Scores	27
Test Format	27
Scoring	17
Test Items	12
Computer Assisted Testing	10
Test Construction	9
Test Validity	9
Higher Education	8
College Students	6
Correlation	6
Language Tests	6
Multiple Choice Tests	6
Second Language Learning	6
English (Second Language)	5
Foreign Countries	5
Scoring Rubrics	5
Test Use	5
Testing	5
Achievement Tests	4
Difficulty Level	4
Educational Assessment	4
Evaluation Methods	4
Language Proficiency	4
Mathematics Tests	4
Psychometrics	4
More ▼

Source

Practical Assessment,…	3
Journal of Educational…	2
Language Testing	2
College Board	1
College Entrance Examination…	1
Council for Aid to Education	1
Education and Information…	1
Evaluation and the Health…	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational…	1
Journal of Pan-Pacific…	1
Language Assessment Quarterly	1
National Center for the…	1
ProQuest LLC	1
Rowman & Littlefield…	1
More ▼

Publication Type

Journal Articles	15
Reports - Research	15
Reports - Descriptive	6
Reports - Evaluative	4
Speeches/Meeting Papers	4
Tests/Questionnaires	2
Books	1
Dissertations/Theses -…	1
Numerical/Quantitative Data	1

Education Level

Higher Education	8
Postsecondary Education	6
Elementary Secondary Education	1
High Schools	1
Secondary Education	1

Audience

Location

China	1
France	1
Iran	1
Israel	1
Italy	1
Louisiana	1
Missouri	1
North Dakota	1
Tennessee	1
United States	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Graduate Record Examinations	2
Test of English as a Foreign…	2
ACT Assessment	1
College Board Achievement…	1
Computer Attitude Scale	1
Foreign Language Classroom…	1
National Assessment of…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 27 results Save | Export

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

Hanyu Shuiping Kaoshi (HSK): A Multi-Level, Multi-Purpose Proficiency Test

Peer reviewed

Direct link

Peng, Yue; Yan, Wei; Cheng, Liying – Language Testing, 2021

This test review focuses on the current version (2009) of [Chinese characters omitted] (Hanyu Shuiping Kaoshi), literally translated as the Chinese Language Proficiency Test and abbreviated as HSK. Tailored to non-native speakers of the Chinese language, this test consists of six proficiency levels (Levels 1 and 2 as beginners, Levels 3 and 4 as…

Descriptors: Language Proficiency, Language Tests, Chinese, Decision Making

Adapting Paper-Based Tests for Computer Administration: Lessons Learned from 30 Years of Mode Effects Studies in Education

Peer reviewed
PDF on ERIC

Download full text

Lynch, Sarah – Practical Assessment, Research & Evaluation, 2022

In today's digital age, tests are increasingly being delivered on computers. Many of these computer-based tests (CBTs) have been adapted from paper-based tests (PBTs). However, this change in mode of test administration has the potential to introduce construct-irrelevant variance, affecting the validity of score interpretations. Because of this,…

Descriptors: Computer Assisted Testing, Tests, Scores, Scoring

Proficiency at the Lexis-Grammar Interface: Comparing Oral versus Written French Exam Tasks

Peer reviewed

Direct link

Vandeweerd, Nathan; Housen, Alex; Paquot, Magali – Language Testing, 2023

This study investigates whether re-thinking the separation of lexis and grammar in language testing could lead to more valid inferences about proficiency across modes. As argued by Römer, typical scoring rubrics ignore important information about proficiency encoded at the lexis-grammar interface, in particular how the co-selection of lexical and…

Descriptors: French, Language Tests, Grammar, Second Language Learning

Investigation of 2018 ACT Score Declines Final Report

Download full text

Keng, Leslie; Boyer, Michelle – National Center for the Improvement of Educational Assessment, 2020

ACT requested assistance from the National Center for the Improvement of Educational Assessment (Center for Assessment) to investigate declines of scores for states administering the ACT to its 11th grade students in 2018. This request emerged from conversations among state leaders, the Center for Assessment, and ACT in trying to understand the…

Descriptors: College Entrance Examinations, Scores, Test Score Decline, Educational Trends

Integrated Listening/Speaking Skill Assessment: The Role of Ambiguity Tolerance, Cognitive/Metacognitive Strategy Use, and Foreign Language Anxiety

Peer reviewed
PDF on ERIC

Download full text

Karim Sadeghi; Neda Bakhshi – International Journal of Language Testing, 2025

Assessing language skills in an integrative form has drawn the attention of assessment experts in recent years. While some research data exists on integrative listening/reading-to-write assessment, there is comparatively little research literature on listening-to-speak integrated assessment. Also, little attention has been devoted to the role of…

Descriptors: Language Tests, Second Language Learning, English (Second Language), Computer Assisted Testing

Computerized Testing in Reading Comprehension Skill: Investigating Score Interchangeability, Item Review, Age and Gender Stereotypes, ICT Literacy and Computer Attitudes

Peer reviewed

Direct link

Toroujeni, Seyyed Morteza Hashemi – Education and Information Technologies, 2022

Score interchangeability of Computerized Fixed-Length Linear Testing (henceforth CFLT) and Paper-and-Pencil-Based Testing (henceforth PPBT) has become a controversial issue over the last decade when technology has meaningfully restructured methods of the educational assessment. Given this controversy, various testing guidelines published on…

Descriptors: Computer Assisted Testing, Reading Tests, Reading Comprehension, Scoring

Fairness Concerns of Discrete Option Multiple Choice Items

Peer reviewed
PDF on ERIC

Download full text

Eckerly, Carol; Smith, Russell; Sowles, John – Practical Assessment, Research & Evaluation, 2018

The Discrete Option Multiple Choice (DOMC) item format was introduced by Foster and Miller (2009) with the intent of improving the security of test content. However, by changing the amount and order of the content presented, the test taking experience varies by test taker, thereby introducing potential fairness issues. In this paper we…

Descriptors: Culture Fair Tests, Multiple Choice Tests, Testing, Test Items

The New Computer Adaptive Test of Size and Strength (CATSS): Development and Validation

Peer reviewed

Direct link

Aviad-Levitzky, Tami; Laufer, Batia; Goldstein, Zahava – Language Assessment Quarterly, 2019

This article describes the development and validation of the new CATSS (Computer Adaptive Test of Size and Strength), which measures vocabulary knowledge in four modalities -- productive recall, receptive recall, productive recognition, and receptive recognition. In the first part of the paper we present the assumptions that underlie the test --…

Descriptors: Foreign Countries, Test Construction, Test Validity, Test Reliability

Measuring Multidimensional Science Learning: Item Design, Scoring, and Psychometric Considerations

Direct link

Castle, Courtney – ProQuest LLC, 2018

The Next Generation Science Standards propose a multidimensional model of science learning, comprised of Core Disciplinary Ideas, Science and Engineering Practices, and Crosscutting Concepts (NGSS Lead States, 2013). Accordingly, there is a need for student assessment aligned with the new standards. Creating assessments that validly and reliably…

Descriptors: Science Education, Student Evaluation, Science Tests, Test Construction

Rating Quality Studies Using Rasch Measurement Theory. Research Report 2013-3

Download full text

Engelhard, George, Jr.; Wind, Stefanie A. – College Board, 2013

The major purpose of this study is to examine the quality of ratings assigned to CR (constructed-response) questions in large-scale assessments from the perspective of Rasch Measurement Theory. Rasch Measurement Theory provides a framework for the examination of rating scale category structure that can yield useful information for interpreting the…

Descriptors: Measurement Techniques, Rating Scales, Test Theory, Scores

A Case Study of an International Performance-Based Assessment of Critical Thinking Skills

Download full text

Wolf, Raffaela; Zahner, Doris; Kostoris, Fiorella; Benjamin, Roger – Council for Aid to Education, 2014

The measurement of higher-order competencies within a tertiary education system across countries presents methodological challenges due to differences in educational systems, socio-economic factors, and perceptions as to which constructs should be assessed (Blömeke, Zlatkin-Troitschanskaia, Kuhn, & Fege, 2013). According to Hart Research…

Descriptors: Case Studies, International Assessment, Performance Based Assessment, Critical Thinking

Gating Items: Definition, Significance, and Need for Further Study

Peer reviewed

Direct link

Judd, Wallace – Practical Assessment, Research & Evaluation, 2009

Over the past twenty years in performance testing a specific item type with distinguishing characteristics has arisen time and time again. It's been invented independently by dozens of test development teams. And yet this item type is not recognized in the research literature. This article is an invitation to investigate the item type, evaluate…

Descriptors: Test Items, Test Format, Evaluation, Item Analysis

Fundamental Concerns in High-Stakes Language Testing: The Case of the College English Test

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jin, Yan – Journal of Pan-Pacific Association of Applied Linguistics, 2011

The College English Test (CET) is an English language test designed for educational purposes, administered on a very large scale, and used for making high-stakes decisions. This paper discusses the key issues facing the CET during the course of its development in the past two decades. It argues that the most fundamental and critical concerns of…

Descriptors: High Stakes Tests, Language Tests, Measures (Individuals), Graduates

Classroom Assessment in Action

Direct link

Shermis, Mark D.; DiVesta, Francis J. – Rowman & Littlefield Publishers, Inc., 2011

"Classroom Assessment in Action" clarifies the multi-faceted roles of measurement and assessment and their applications in a classroom setting. Comprehensive in scope, Shermis and Di Vesta explain basic measurement concepts and show students how to interpret the results of standardized tests. From these basic concepts, the authors then…

Descriptors: Student Evaluation, Standardized Tests, Scores, Measurement

Previous Page | Next Page »

Pages: 1 | 2

Allen, Nancy L.	1
Arkin, Robert M.	1
Aviad-Levitzky, Tami	1
Baldwin, Peter	1
Barrett, Michael J.	1
Benjamin, Roger	1
Bennett, Randy Elliot	1
Boyer, Michelle	1
Bridgeman, Brent	1
Castle, Courtney	1
Cheng, Liying	1
Clariana, Roy B.	1
Clauser, Brian E.	1
DiVesta, Francis J.	1
Eckerly, Carol	1
Engelhard, George, Jr.	1
Goldstein, Zahava	1
Hambleton, Ronald K.	1
Harasym, P. H.	1
Housen, Alex	1
Jin, Yan	1
Judd, Wallace	1
Karim Sadeghi	1
Keng, Leslie	1
More ▼