ERIC - Search Results

Publication Date

In 2025	4
Since 2024	12

Descriptor

Test Construction	12
Test Use	12
Accuracy	4
Test Validity	4
Decision Making	3
Elementary School Students	3
Foreign Countries	3
Language Tests	3
Psychometrics	3
Scores	3
Test Interpretation	3
Test Items	3
Administrator Attitudes	2
Alignment (Education)	2
Artificial Intelligence	2
Assessment Literacy	2
Caregiver Attitudes	2
Evaluation Methods	2
Majors (Students)	2
Parent Attitudes	2
Research Methodology	2
School Personnel	2
Screening Tests	2
Second Language Instruction	2
Second Language Learning	2
More ▼

Source

American Journal of Evaluation	1
Autism: The International…	1
Discover Education	1
ETS Research Report Series	1
Grantee Submission	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Multilingual and…	1
Journal of Practical Studies…	1
Language Testing	1
Language Testing in Asia	1
School Mental Health	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	7
Reports - Evaluative	3
Information Analyses	1
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Elementary Education	3
Higher Education	3
Postsecondary Education	3
Early Childhood Education	1
Grade 3	1
Grade 4	1
Grade 5	1
High Schools	1
Intermediate Grades	1
Middle Schools	1
Primary Education	1
Secondary Education	1
More ▼

Audience

Location

China	2
Europe	1
Tennessee	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Language Testers and Their Place in the Policy Web

Peer reviewed

Direct link

Laura Schildt; Bart Deygers; Albert Weideman – Language Testing, 2024

In the context of policy-driven language testing for citizenship, a growing body of research examines the political justifications and ethical implications of language requirements and test use. However, virtually no studies have looked at the role that language testers play in the evolution of language requirements. Critical gaps remain in our…

Descriptors: Language Tests, Citizenship, Educational Policy, Assessment Literacy

Improving Balance in Educational Measurement: A Legacy of E. F. Lindquist

Peer reviewed

Direct link

Daniel Koretz – Journal of Educational and Behavioral Statistics, 2024

A critically important balance in educational measurement between practical concerns and matters of technique has atrophied in recent decades, and as a result, some important issues in the field have not been adequately addressed. I start with the work of E. F. Lindquist, who exemplified the balance that is now wanting. Lindquist was arguably the…

Descriptors: Educational Assessment, Evaluation Methods, Achievement Tests, Educational History

Hold the Bets! Should Quasi-Experiments Be Preferred to True Experiments When Causal Generalization Is the Goal?

Peer reviewed

Direct link

Andrew P. Jaciw – American Journal of Evaluation, 2025

By design, randomized experiments (XPs) rule out bias from confounded selection of participants into conditions. Quasi-experiments (QEs) are often considered second-best because they do not share this benefit. However, when results from XPs are used to generalize causal impacts, the benefit from unconfounded selection into conditions may be offset…

Descriptors: Elementary School Students, Elementary School Teachers, Generalization, Test Bias

Does Students' Understanding of Test Designs and Demands of a High-Stakes Test Influence Their Learning Practices: A Washback Study

Peer reviewed

Direct link

Manxia Dong; Boyu Wang – Language Testing in Asia, 2025

This study aimed to explore the relationship between students' understanding of the National Matriculation English Test (NMET) and their learning practices through standard multiple regression (SMR) and structural equation modeling (SEM) with the purpose of unraveling the working mechanism of washback. A total number of 3105 Chinese senior high…

Descriptors: Foreign Countries, High School Seniors, Test Construction, Test Use

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

The Intent of ChatGPT Usage and Its Robustness in Medical Proficiency Exams: A Systematic Review

Peer reviewed

Direct link

Tatiana Chaiban; Zeinab Nahle; Ghaith Assi; Michelle Cherfane – Discover Education, 2024

Background: Since it was first launched, ChatGPT, a Large Language Model (LLM), has been widely used across different disciplines, particularly the medical field. Objective: The main aim of this review is to thoroughly assess the performance of the distinct version of ChatGPT in subspecialty written medical proficiency exams and the factors that…

Descriptors: Medical Education, Accuracy, Artificial Intelligence, Computer Software

Application of the International Classification of Functioning, Disability, and Health in Autism and Attention-Deficit Hyperactivity Disorder: A Scoping Review

Peer reviewed

Direct link

Lovisa Alehagen; Sven Bölte; Melissa H Black – Autism: The International Journal of Research and Practice, 2025

The International Classification of Functioning, Disability, and Health is a biopsychosocial framework of health-related functioning designed to provide a unifying system for health care, social services, education, and policy sectors. Since its publication in 2001, the International Classification of Functioning has been used to guide clinical…

Descriptors: Autism Spectrum Disorders, Attention Deficit Hyperactivity Disorder, Classification, Functional Behavioral Assessment

Minority Language Testing: The Social Impact of the Zhuang Language Proficiency Test in China

Peer reviewed

Direct link

Ying Wu; Rita Elaine Silver; Guangwei Hu – Journal of Multilingual and Multicultural Development, 2024

The Zhuang language test ("Vahcuengh Sawcuengh Suijbingz Gaujsi", VSSG) is the first minority language test in the People's Republic of China. It was designed with multiple goals including improving Zhuang language teaching, recruiting students for relevant majors of tertiary study, identifying proficiency for work-related applications,…

Descriptors: Language Minorities, Language Tests, Second Language Learning, Second Language Instruction

Developing a Whole Child School Screening Instrument: Evaluating Perceived Usability as an Initial Step in Planning for Consequential Validity

Peer reviewed

Direct link

Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – School Mental Health, 2024

We are developing the Equitable Screening to Support Youth (ESSY) Whole Child Screener to address concerns prevalent in existing school-based screenings that impede goals to advance educational equity using universal screeners. Traditional assessment development does not include end users in the early development phases, instead relying on a…

Descriptors: Screening Tests, Psychometrics, Validity, Child Development

Language Testing Reimagined: Enhancing Teaching and Learning in English Education

Peer reviewed
PDF on ERIC

Download full text

Gopal Prasad Pandey – Journal of Practical Studies in Education, 2024

This paper explores the role of language testing in English education, focusing on its theoretical foundations, methodologies and practical applications. It analyzes how language tests fulfill various purposes, such as placement, progress monitoring, achievement evaluation and diagnostic feedback, underlining the importance of a critical…

Descriptors: Language Tests, English (Second Language), Second Language Learning, Second Language Instruction

Developing a Whole Child School Screening Instrument: Evaluating Perceived Usability as an Initial Step in Planning for Consequential Validity

Peer reviewed

Direct link

Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – Grantee Submission, 2024

Descriptors: Screening Tests, Usability, Decision Making, Validity

Charting the Future of Assessments. Research Report. ETS RR-24-13

Peer reviewed
PDF on ERIC

Download full text

Patrick Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Report Series, 2024

Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international large-scale assessments of cognitive and…

Descriptors: Assessment Literacy, Testing, Test Bias, Test Construction

Amy Briesch	2
Brittany Melo	2
Jacqueline M. Caemmerer	2
Jessica B. Koslouski	2
Sandra M. Chafouleas	2
Albert Weideman	1
Amery D. Wu	1
Amit Sevak	1
Andrew P. Jaciw	1
Bart Deygers	1
Boyu Wang	1
Daniel Fishtein	1
Daniel Koretz	1
Ghaith Assi	1
Gopal Prasad Pandey	1
Guangwei Hu	1
Ikkyu Choi	1
Jake Stone	1
Jesse Sparks	1
Laura Schildt	1
Lovisa Alehagen	1
Manxia Dong	1
Melissa H Black	1
Michelle Cherfane	1
Patrick Kyllonen	1
More ▼