ERIC - Search Results

Publication Date

In 2025	4
Since 2024	12
Since 2021 (last 5 years)	25
Since 2016 (last 10 years)	60
Since 2006 (last 20 years)	100

Descriptor

Test Construction	892
Test Use	892
Elementary Secondary Education	283
Test Validity	239
Educational Assessment	228
Student Evaluation	176
Test Reliability	143
Evaluation Methods	138
Higher Education	137
Testing Programs	135
Performance Based Assessment	131
Standardized Tests	113
Scoring	112
Foreign Countries	110
Test Items	109
Testing Problems	108
Achievement Tests	99
State Programs	90
Language Tests	86
Testing	79
Academic Achievement	78
Test Interpretation	77
Educational Testing	72
Psychometrics	71
Scores	70
More ▼

Education Level

Higher Education	25
Postsecondary Education	23
Elementary Secondary Education	16
Elementary Education	14
Secondary Education	10
Early Childhood Education	6
Primary Education	6
Grade 3	4
Middle Schools	4
Adult Education	3
Grade 4	3
Grade 5	3
Grade 6	3
High Schools	3
Intermediate Grades	3
Junior High Schools	3
Grade 7	2
Grade 8	2
Kindergarten	2
Adult Basic Education	1
Grade 1	1
Grade 2	1
More ▼

Audience

Practitioners	117
Teachers	74
Administrators	23
Researchers	23
Students	15
Policymakers	11
Parents	7
Community	3
Counselors	2

Location

Australia	18
Canada	12
United Kingdom	8
United States	8
Japan	7
New York	7
Pennsylvania	7
Texas	7
China	6
Israel	6
Ohio	6
California	5
Netherlands	5
Oregon	5
Sweden	5
United Kingdom (England)	5
Colorado	4
Georgia	4
Kentucky	4
New Jersey	4
North Carolina	4
United Kingdom (Great Britain)	4
Washington	4
Alaska	3
Illinois	3
More ▼

Laws, Policies, & Programs

Improving Americas Schools…	4
No Child Left Behind Act 2001	3
Comprehensive Education…	2
Education Consolidation…	2
Every Student Succeeds Act…	2
Race to the Top	2
Education for All Handicapped…	1
Elementary and Secondary…	1
Individuals with Disabilities…	1
Kentucky Education Reform Act…	1
National Defense Education Act	1
Rehabilitation Act 1973…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 892 results Save | Export

Language Testers and Their Place in the Policy Web

Peer reviewed

Direct link

Laura Schildt; Bart Deygers; Albert Weideman – Language Testing, 2024

In the context of policy-driven language testing for citizenship, a growing body of research examines the political justifications and ethical implications of language requirements and test use. However, virtually no studies have looked at the role that language testers play in the evolution of language requirements. Critical gaps remain in our…

Descriptors: Language Tests, Citizenship, Educational Policy, Assessment Literacy

Improving Balance in Educational Measurement: A Legacy of E. F. Lindquist

Peer reviewed

Direct link

Daniel Koretz – Journal of Educational and Behavioral Statistics, 2024

A critically important balance in educational measurement between practical concerns and matters of technique has atrophied in recent decades, and as a result, some important issues in the field have not been adequately addressed. I start with the work of E. F. Lindquist, who exemplified the balance that is now wanting. Lindquist was arguably the…

Descriptors: Educational Assessment, Evaluation Methods, Achievement Tests, Educational History

Hold the Bets! Should Quasi-Experiments Be Preferred to True Experiments When Causal Generalization Is the Goal?

Peer reviewed

Direct link

Andrew P. Jaciw – American Journal of Evaluation, 2025

By design, randomized experiments (XPs) rule out bias from confounded selection of participants into conditions. Quasi-experiments (QEs) are often considered second-best because they do not share this benefit. However, when results from XPs are used to generalize causal impacts, the benefit from unconfounded selection into conditions may be offset…

Descriptors: Elementary School Students, Elementary School Teachers, Generalization, Test Bias

Moving the Field of Vocabulary Assessment Forward: The Need for More Rigorous Test Development and Validation

Peer reviewed

Direct link

Schmitt, Norbert; Nation, Paul; Kremmel, Benjamin – Language Teaching, 2020

Recently, a large number of vocabulary tests have been made available to language teachers, testers, and researchers. Unfortunately, most of them have been launched with inadequate validation evidence. The field of language testing has become increasingly more rigorous in the area of test validation, but developers of vocabulary tests have…

Descriptors: Test Construction, Test Validity, Language Tests, Test Use

Does Students' Understanding of Test Designs and Demands of a High-Stakes Test Influence Their Learning Practices: A Washback Study

Peer reviewed

Direct link

Manxia Dong; Boyu Wang – Language Testing in Asia, 2025

This study aimed to explore the relationship between students' understanding of the National Matriculation English Test (NMET) and their learning practices through standard multiple regression (SMR) and structural equation modeling (SEM) with the purpose of unraveling the working mechanism of washback. A total number of 3105 Chinese senior high…

Descriptors: Foreign Countries, High School Seniors, Test Construction, Test Use

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

Literacy, Social Justice and Inclusion: A Large-Scale Design Experiment to Narrow the Attainment Gap Linked to Poverty

Peer reviewed

Direct link

Ellis, Sue; Rowe, Adele – Support for Learning, 2020

This paper describes the development and use of a tool designed to support educators to use a broad range of professional knowledge to enable inclusive literacy teaching that delivers social justice and narrows the attainment gap associated with poverty. The tool encourages teachers to formally recognise and act on a wide range of evidence about…

Descriptors: Literacy, Social Justice, Inclusion, Achievement Gap

Revisiting Rating Scale Development for Rater-Mediated Language Performance Assessments: Modelling Construct and Contextual Choices Made by Scale Developers

Peer reviewed

Direct link

Knoch, Ute; Deygers, Bart; Khamboonruang, Apichat – Language Testing, 2021

Rating scale development in the field of language assessment is often considered in dichotomous ways: It is assumed to be guided either by expert intuition or by drawing on performance data. Even though quite a few authors have argued that rating scale development is rarely so easily classifiable, this dyadic view has dominated language testing…

Descriptors: Rating Scales, Test Construction, Language Tests, Test Use

The Intent of ChatGPT Usage and Its Robustness in Medical Proficiency Exams: A Systematic Review

Peer reviewed

Direct link

Tatiana Chaiban; Zeinab Nahle; Ghaith Assi; Michelle Cherfane – Discover Education, 2024

Background: Since it was first launched, ChatGPT, a Large Language Model (LLM), has been widely used across different disciplines, particularly the medical field. Objective: The main aim of this review is to thoroughly assess the performance of the distinct version of ChatGPT in subspecialty written medical proficiency exams and the factors that…

Descriptors: Medical Education, Accuracy, Artificial Intelligence, Computer Software

Application of the International Classification of Functioning, Disability, and Health in Autism and Attention-Deficit Hyperactivity Disorder: A Scoping Review

Peer reviewed

Direct link

Lovisa Alehagen; Sven Bölte; Melissa H Black – Autism: The International Journal of Research and Practice, 2025

The International Classification of Functioning, Disability, and Health is a biopsychosocial framework of health-related functioning designed to provide a unifying system for health care, social services, education, and policy sectors. Since its publication in 2001, the International Classification of Functioning has been used to guide clinical…

Descriptors: Autism Spectrum Disorders, Attention Deficit Hyperactivity Disorder, Classification, Functional Behavioral Assessment

Minority Language Testing: The Social Impact of the Zhuang Language Proficiency Test in China

Peer reviewed

Direct link

Ying Wu; Rita Elaine Silver; Guangwei Hu – Journal of Multilingual and Multicultural Development, 2024

The Zhuang language test ("Vahcuengh Sawcuengh Suijbingz Gaujsi", VSSG) is the first minority language test in the People's Republic of China. It was designed with multiple goals including improving Zhuang language teaching, recruiting students for relevant majors of tertiary study, identifying proficiency for work-related applications,…

Descriptors: Language Minorities, Language Tests, Second Language Learning, Second Language Instruction

A Critique on Discourse of Language Tests

Peer reviewed
PDF on ERIC

Download full text

Karatas, Tuçe Öztürk – Education Quarterly Reviews, 2021

In the 21st century, with the rise of the popularity of standardized or large-scale tests, their high-stakes have started to be apparent. High-stake tests are not new, but in most cases, their current use as social practice tends to shape individuals' futures. Currently the new trend for their quality discussion aims to critically evaluate tests…

Descriptors: Language Tests, Standardized Tests, High Stakes Tests, Test Use

Suggesting a Policy-Driven Approach to Validation in the Context of the Test of Proficiency in Korean (TOPIK)

Direct link

Im, Gwan-Hyeok; Shin, Dongil; Park, Soohyeon – Current Issues in Language Planning, 2022

This study suggests a conceptual framework for policy-driven test development and validation, using the Test of Proficiency in Korean (TOPIK) as an example context. By linking the literature on policy analysis and argument structure in the validation of testing, the strong relationships between policy and testing are illustrated. This rationalizes…

Descriptors: Language Proficiency, Language Tests, Korean, Test Construction

Equivalency Evidence of the English Competency Test across Different Modes: A Rasch Analysis

Peer reviewed

Direct link

Muhammad Yoga Prabowo; Sarah Rahmadian – TEFLIN Journal: A publication on the teaching and learning of English, 2023

The outbreak of the COVID-19 pandemic has transformed the educational landscape in a way unseen before. Educational institutions are navigating between offline and online learning worldwide. Computer-based testing is rapidly taking over paper-and-pencil testing as the dominant mode of assessment. In some settings, computer-based and…

Descriptors: English (Second Language), Second Language Learning, Test Format, Language Tests

Strengthening the Foundation of Educational Psychology by Integrating Construct Validation into Open Science Reform

Peer reviewed

Direct link

Flake, Jessica Kay – Educational Psychologist, 2021

An increased focus on transparency and replication in science has stimulated reform in research practices and dissemination. As a result, the research culture is changing: the use of preregistration is on the rise, access to data and materials is increasing, and large-scale replication studies are more common. In this article, I discuss two…

Descriptors: Educational Psychology, Construct Validity, Access to Information, Test Construction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 60

Educational Measurement:…	38
Psychological Assessment	19
Educational and Psychological…	17
Applied Measurement in…	14
Journal of Educational…	12
Studies in Educational…	8
Language Testing	7
Evaluation and Program…	6
Educational Assessment	5
Evaluation and the Health…	5
Journal of Personnel…	5
Psychological Test Bulletin	5
American Journal of…	4
Educational Researcher	4
International Journal of…	4
International Journal of…	4
Journal of Educational…	4
Measurement and Evaluation in…	4
ProQuest LLC	4
Academic Medicine	3
Adolescence	3
American Journal of Education	3
Assessment in Education:…	3
ETS Research Report Series	3
Intelligence	3
More ▼

Baker, Eva L.	7
Bond, Linda	5
Hambleton, Ronald K.	5
Herman, Joan L.	5
Linn, Robert L.	5
Marso, Ronald N.	5
Airasian, Peter W.	4
Ediger, Marlow	4
Pigge, Fred L.	4
Roeber, Edward	4
Shepard, Lorrie A.	4
Straus, Murray A.	4
Arter, Judy	3
Bennett, Randy Elliot	3
Danielson, Charlotte	3
Fraser, Barry J.	3
Gearhart, Maryl	3
Glaser, Robert	3
Green, Donald Ross	3
Green, Kathy E.	3
Hogan, Thomas P.	3
Mehrens, William A.	3
Mullis, Ina V. S.	3
Nichols, Paul D.	3
More ▼

Journal Articles	320
Reports - Evaluative	259
Speeches/Meeting Papers	212
Reports - Research	201
Reports - Descriptive	180
Guides - Non-Classroom	99
Books	66
Opinion Papers	66
Tests/Questionnaires	59
Information Analyses	54
Guides - Classroom - Teacher	24
Book/Product Reviews	20
Collected Works - General	20
Numerical/Quantitative Data	20
ERIC Digests in Full Text	12
ERIC Publications	12
Collected Works - Proceedings	9
Guides - Classroom - Learner	9
Legal/Legislative/Regulatory…	9
Collected Works - Serials	7
Guides - General	6
Reference Materials -…	6
Dissertations/Theses -…	4
Historical Materials	4
Reports - General	4
More ▼

National Assessment of…	27
Texas Assessment of Academic…	9
Test of English as a Foreign…	7
SAT (College Admission Test)	6
Graduate Record Examinations	5
Texas Essential Knowledge and…	5
Wechsler Intelligence Scale…	4
Armed Services Vocational…	3
College Level Academic Skills…	3
International English…	3
Stanford Binet Intelligence…	3
Wechsler Adult Intelligence…	3
ACT Assessment	2
Advanced Placement…	2
Behavior Assessment System…	2
Bender Gestalt Test	2
Computer Attitude Scale	2
Conflict Tactics Scale	2
General Educational…	2
Group Embedded Figures Test	2
Human Figure Drawing Test	2
Iowa Tests of Basic Skills	2
Myers Briggs Type Indicator	2
National Teacher Examinations	2
North Carolina End of Course…	2
More ▼