ERIC - Search Results

Publication Date

In 2026	0
Since 2025	10
Since 2022 (last 5 years)	54
Since 2017 (last 10 years)	97
Since 2007 (last 20 years)	163

Descriptor

Test Format	506
Test Validity	506
Test Reliability	243
Test Construction	180
Test Items	127
Foreign Countries	108
Language Tests	96
Higher Education	86
Testing	80
Computer Assisted Testing	72
Test Use	67
Multiple Choice Tests	64
Scores	59
English (Second Language)	58
Second Language Learning	57
Standardized Tests	53
Student Evaluation	53
Test Interpretation	53
Elementary Secondary Education	52
Testing Problems	52
Language Proficiency	49
Comparative Analysis	48
Scoring	47
Test Content	47
Evaluation Methods	46
More ▼

Education Level

Higher Education	60
Postsecondary Education	50
Secondary Education	30
Elementary Education	25
Middle Schools	19
Junior High Schools	15
High Schools	13
Grade 8	11
Grade 4	9
Elementary Secondary Education	8
Grade 5	8
Grade 3	7
Intermediate Grades	7
Early Childhood Education	6
Grade 6	6
Grade 7	6
Primary Education	5
Adult Education	3
Grade 11	1
Grade 2	1
Grade 9	1
Kindergarten	1
Preschool Education	1
More ▼

Audience

Practitioners	30
Teachers	19
Administrators	17
Researchers	9
Community	1
Policymakers	1
Students	1
Support Staff	1

Location

Canada	10
China	9
New York	9
Japan	7
Netherlands	6
Germany	5
Turkey	5
United Kingdom	5
United Kingdom (England)	5
Australia	4
Georgia	4
Iran	4
United States	4
Israel	3
New Zealand	3
Indonesia	2
Mexico	2
North Carolina	2
Oregon	2
Singapore	2
South Korea	2
United Kingdom (Great Britain)	2
United Kingdom (Northern…	2
United Kingdom (Wales)	2
Africa	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	1
Individuals with Disabilities…	1
Job Training Partnership Act…	1
No Child Left Behind Act 2001	1
Pell Grant Program	1

What Works Clearinghouse Rating

Test Validity X

Showing 31 to 45 of 506 results Save | Export

Test Design and Validity Evidence of Interactive Speaking Assessment in the Era of Emerging Technologies

Peer reviewed

Direct link

Jung Youn, Soo – Language Testing, 2023

As access to smartphones and emerging technologies has become ubiquitous in our daily lives and in language learning, technology-mediated social interaction has become common in teaching and assessing L2 speaking. The changing ecology of L2 spoken interaction provides language educators and testers with opportunities for renewed test design and…

Descriptors: Test Construction, Test Validity, Second Language Learning, Telecommunications

Evaluating Assessment Score Validity and Characterizing Undergraduate Biology Exam Content

Direct link

Crystal Uminski – ProQuest LLC, 2023

The landscape of undergraduate biology education has been shaped by decades of reform efforts calling for instruction to integrate core concepts and scientific skills as a means of helping students become proficient in the discipline. Assessments can be used to make inferences about how these reform efforts have translated into changes in…

Descriptors: Undergraduate Students, Biology, Science Instruction, Science Tests

The Structural and Convergent Validity of the FMS[superscript 2] Assessment Tool among 8- to 12-Year-Old Children

Peer reviewed

Direct link

Nathan Gavigan; Sarahjane Belton; Una Britton; Shane Dalton; Johann Issartel – European Physical Education Review, 2024

Although there is a plethora of tools available to assess children's movement competence (MC), the literature suggests that many have significant limitations (e.g. not being practical for use in many 'real-world' settings). The FMS[superscript 2] assessment tool has recently been developed as a targeted solution to many of the existing barriers…

Descriptors: Test Validity, Test Format, Children, Evaluation

The Canadian English Language Proficiency Index Program (CELPIP) Test

Peer reviewed

Direct link

McLeod, Melissa; Cheng, Liying – Language Assessment Quarterly, 2023

The Canadian English Language Proficiency Index Program (CELPIP) Test was designed for immigration and citizenship in Canada. CELPIP is a computer-based English-language proficiency test which covers all four skills. This test review provides a description of the test and its construct, tasks, and delivery. Then, it appraises CELPIP for…

Descriptors: Language Tests, Language Proficiency, English (Second Language), Second Language Learning

Reliability and Validity of Methods to Assess Undergraduate Healthcare Student Performance in Pharmacology: Comparison of Open Book versus Time-Limited Closed Book Examinations

Peer reviewed
PDF on ERIC

Download full text

David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023

We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…

Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format

Validity Evidence for Forced-Choice and Mixed-Format Knowledge Assessments

Peer reviewed
PDF on ERIC

Download full text

Cari F. Herrmann Abell – Grantee Submission, 2021

In the last twenty-five years, the discussion surrounding validity evidence has shifted both in language and scope, from the work of Messick and Kane to the updated Standards for Educational and Psychological Testing. However, these discussions haven't necessarily focused on best practices for different types of instruments or assessments, taking…

Descriptors: Test Format, Measurement Techniques, Student Evaluation, Rating Scales

Innovations in Assessing Students' Digital Literacy Skills in Learning Science: Effective Multiple Choice Closed-Ended Tests Using Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Fitria Lafifa; Dadan Rosana – Turkish Online Journal of Distance Education, 2024

This research goal to develop a multiple-choice closed-ended test to assessing and evaluate students' digital literacy skills. The sample in this study were students at MTsN 1 Blitar City who were selected using a purposive sampling technique. The test was also validated by experts, namely 2 Doctors of Physics and Science from Yogyakarta State…

Descriptors: Educational Innovation, Student Evaluation, Digital Literacy, Multiple Choice Tests

Why Teaching? A Validation of the FIT-Choice Scale in the Serbian Context

Peer reviewed

Direct link

Simic, Nataša; Marušic Jablanovic, Milica; Grbic, Sanja – Journal of Education for Teaching: International Research and Pedagogy, 2022

The aim of this study was to validate the structure of the "FIT-Choice scale" on a Serbian sample of pre-service teachers, as well as to determine the motivations and beliefs about the teaching profession, and test if motivation differs across different groups of pre-service teachers. After prospective class and subject teachers…

Descriptors: Foreign Countries, Likert Scales, Factor Structure, Factor Analysis

Improving Student Understanding of Quantum Measurement in Infinite-Dimensional Hilbert Space Using a Research-Based Multiple-Choice Question Sequence

Peer reviewed

Direct link

Yangqiuting Li; Chandralekha Singh – Physical Review Physics Education Research, 2025

Research-based multiple-choice questions implemented in class with peer instruction have been shown to be an effective tool for improving students' engagement and learning outcomes. Moreover, multiple-choice questions that are carefully sequenced to build on each other can be particularly helpful for students to develop a systematic understanding…

Descriptors: Physics, Science Instruction, Science Tests, Multiple Choice Tests

Hanyu Shuiping Kaoshi (HSK): A Multi-Level, Multi-Purpose Proficiency Test

Peer reviewed

Direct link

Peng, Yue; Yan, Wei; Cheng, Liying – Language Testing, 2021

This test review focuses on the current version (2009) of [Chinese characters omitted] (Hanyu Shuiping Kaoshi), literally translated as the Chinese Language Proficiency Test and abbreviated as HSK. Tailored to non-native speakers of the Chinese language, this test consists of six proficiency levels (Levels 1 and 2 as beginners, Levels 3 and 4 as…

Descriptors: Language Proficiency, Language Tests, Chinese, Decision Making

Reviewing the Structure of Kolb's Learning Style Inventory from Factor Analysis and Thurstonian Item Response Theory (IRT) Model Approaches

Peer reviewed

Direct link

Calderón Carvajal, Carlos; Ximénez Gómez, Carmen; Lay-Lisboa, Siu; Briceño, Mauricio – Journal of Psychoeducational Assessment, 2021

Kolb's Learning Style Inventory (LSI) continues to generate a great debate among researchers, given the contradictory evidence resulting from its psychometric properties. One primary criticism focuses on the artificiality of the results derived from its internal structure because of the ipsative nature of the forced-choice format. This study seeks…

Descriptors: Factor Structure, Psychometrics, Test Format, Test Validity

Evaluating the Explanation Inference of a High-Stakes French Listening Test: An Argument-Based Perspective

Peer reviewed

Direct link

Arias, Angel; Blais, Jean-Guy – Canadian Modern Language Review, 2023

This article draws on argument-based validation to gather and evaluate construct-related evidence (i.e., the explanation inference) of a high-stakes test. The data stemmed from the listening component of a French test used for immigration to Canada through the province of Quebec. An expert panel with varied backgrounds in applied linguistics…

Descriptors: French, Listening Comprehension Tests, Second Language Learning, High Stakes Tests

Review of Problem-Solving Measurement: An Assessment Developed in the Indonesian Context

Peer reviewed
PDF on ERIC

Download full text

Wicaksono, Azizul Ghofar Candra; Korom, Erzsébet – Participatory Educational Research, 2022

The accuracy of learning results relies on the evaluation and assessment. The learning goals, including problem solving ability must be aligned with the valid standardized measurement tools. The study on exploring the nature of problem-solving, framework, and assessment in the Indonesian context will make contributions to problem solving…

Descriptors: Problem Solving, Educational Research, Test Construction, Test Validity

Issues and Concerns in Classroom Assessment Practices

Download full text

Areekkuzhiyil, Santhosh – Online Submission, 2021

Assessment is an integral part of any teaching learning process. Assessment has large number of functions to perform, whether it is formative or summative. This paper analyse the issues involved and the areas of concern in the classroom assessment practice and discusses the recent reforms take place. [This paper was published in Edutracks v20 n8…

Descriptors: Student Evaluation, Formative Evaluation, Summative Evaluation, Test Validity

Gender Bias in Test Item Formats: Evidence from PISA 2009, 2012, and 2015 Math and Reading Tests

Peer reviewed

Direct link

Shear, Benjamin R. – Journal of Educational Measurement, 2023

Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…

Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 34

Diagnostique	26
Educational and Psychological…	25
Language Testing	14
Journal of Educational…	9
New York State Education…	9
Language Assessment Quarterly	8
Journal of Reading	7
Psychological Assessment	7
ETS Research Report Series	5
Online Submission	5
Applied Psychological…	4
Assessment	4
Assessment for Effective…	4
International Journal of…	4
Journal of Experimental…	4
Journal of Psychoeducational…	4
Perceptual and Motor Skills	4
Applied Measurement in…	3
Assessment in Education:…	3
Canadian Modern Language…	3
Grantee Submission	3
International Journal of…	3
Measurement and Evaluation in…	3
Physical Review Physics…	3
ProQuest LLC	3
More ▼

Schriesheim, Chester A.	7
Hambleton, Ronald K.	5
Stansfield, Charles W.	5
Benson, Jeri	4
Cheng, Liying	3
Federico, Pat-Anthony	3
Melancon, Janet G.	3
Read, John	3
Silverstein, A. B.	3
Straus, Murray A.	3
Thompson, Bruce	3
Wainer, Howard	3
Alderson, J. Charles	2
Allen, Nancy L.	2
Byrne, Barbara M.	2
Carcelli, Larry	2
Conoyer, Sarah J.	2
Eignor, Daniel R.	2
Green, Kathy	2
Hamby, Sherry L.	2
Hendrickson, Amy	2
Henk, William A.	2
Herman, Joan	2
Huntley, Renee M.	2
More ▼

Journal Articles	318
Reports - Research	256
Reports - Evaluative	83
Reports - Descriptive	74
Speeches/Meeting Papers	70
Information Analyses	38
Opinion Papers	35
Guides - Non-Classroom	26
Tests/Questionnaires	20
Guides - Classroom - Teacher	9
Guides - General	6
Dissertations/Theses -…	4
Books	3
Collected Works - General	3
Numerical/Quantitative Data	3
Reference Materials -…	3
ERIC Publications	2
Collected Works - Proceedings	1
Collected Works - Serials	1
Guides - Classroom - Learner	1
Non-Print Media	1
Reference Materials - General	1
Reports - General	1
More ▼

Test of English as a Foreign…	9
SAT (College Admission Test)	6
International English…	5
Wechsler Adult Intelligence…	5
Beck Depression Inventory	4
Minnesota Multiphasic…	4
National Assessment of…	4
National Teacher Examinations	4
ACT Assessment	3
Armed Services Vocational…	3
Embedded Figures Test	3
Program for International…	3
Stanford Achievement Tests	3
Wechsler Intelligence Scale…	3
Graduate Record Examinations	2
Kaufman Brief Intelligence…	2
Keymath Diagnostic Arithmetic…	2
Peabody Picture Vocabulary…	2
Wechsler Individual…	2
Wechsler Intelligence Scales…	2
Woodcock Reading Mastery Test	2
Armed Forces Qualification…	1
Bar Examinations	1
Behavior Assessment System…	1
California Achievement Tests	1
More ▼