Publication Date
In 2025 | 4 |
Since 2024 | 12 |
Descriptor
Test Construction | 12 |
Test Use | 12 |
Accuracy | 4 |
Test Validity | 4 |
Decision Making | 3 |
Elementary School Students | 3 |
Foreign Countries | 3 |
Language Tests | 3 |
Psychometrics | 3 |
Scores | 3 |
Test Interpretation | 3 |
More ▼ |
Source
Author
Amy Briesch | 2 |
Brittany Melo | 2 |
Jacqueline M. Caemmerer | 2 |
Jessica B. Koslouski | 2 |
Sandra M. Chafouleas | 2 |
Albert Weideman | 1 |
Amery D. Wu | 1 |
Amit Sevak | 1 |
Andrew P. Jaciw | 1 |
Bart Deygers | 1 |
Boyu Wang | 1 |
More ▼ |
Publication Type
Journal Articles | 11 |
Reports - Research | 7 |
Reports - Evaluative | 3 |
Information Analyses | 1 |
Reports - Descriptive | 1 |
Tests/Questionnaires | 1 |
Education Level
Elementary Education | 3 |
Higher Education | 3 |
Postsecondary Education | 3 |
Early Childhood Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
High Schools | 1 |
Intermediate Grades | 1 |
Middle Schools | 1 |
Primary Education | 1 |
More ▼ |
Audience
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Laura Schildt; Bart Deygers; Albert Weideman – Language Testing, 2024
In the context of policy-driven language testing for citizenship, a growing body of research examines the political justifications and ethical implications of language requirements and test use. However, virtually no studies have looked at the role that language testers play in the evolution of language requirements. Critical gaps remain in our…
Descriptors: Language Tests, Citizenship, Educational Policy, Assessment Literacy
Daniel Koretz – Journal of Educational and Behavioral Statistics, 2024
A critically important balance in educational measurement between practical concerns and matters of technique has atrophied in recent decades, and as a result, some important issues in the field have not been adequately addressed. I start with the work of E. F. Lindquist, who exemplified the balance that is now wanting. Lindquist was arguably the…
Descriptors: Educational Assessment, Evaluation Methods, Achievement Tests, Educational History
Andrew P. Jaciw – American Journal of Evaluation, 2025
By design, randomized experiments (XPs) rule out bias from confounded selection of participants into conditions. Quasi-experiments (QEs) are often considered second-best because they do not share this benefit. However, when results from XPs are used to generalize causal impacts, the benefit from unconfounded selection into conditions may be offset…
Descriptors: Elementary School Students, Elementary School Teachers, Generalization, Test Bias
Manxia Dong; Boyu Wang – Language Testing in Asia, 2025
This study aimed to explore the relationship between students' understanding of the National Matriculation English Test (NMET) and their learning practices through standard multiple regression (SMR) and structural equation modeling (SEM) with the purpose of unraveling the working mechanism of washback. A total number of 3105 Chinese senior high…
Descriptors: Foreign Countries, High School Seniors, Test Construction, Test Use
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Tatiana Chaiban; Zeinab Nahle; Ghaith Assi; Michelle Cherfane – Discover Education, 2024
Background: Since it was first launched, ChatGPT, a Large Language Model (LLM), has been widely used across different disciplines, particularly the medical field. Objective: The main aim of this review is to thoroughly assess the performance of the distinct version of ChatGPT in subspecialty written medical proficiency exams and the factors that…
Descriptors: Medical Education, Accuracy, Artificial Intelligence, Computer Software
Lovisa Alehagen; Sven Bölte; Melissa H Black – Autism: The International Journal of Research and Practice, 2025
The International Classification of Functioning, Disability, and Health is a biopsychosocial framework of health-related functioning designed to provide a unifying system for health care, social services, education, and policy sectors. Since its publication in 2001, the International Classification of Functioning has been used to guide clinical…
Descriptors: Autism Spectrum Disorders, Attention Deficit Hyperactivity Disorder, Classification, Functional Behavioral Assessment
Ying Wu; Rita Elaine Silver; Guangwei Hu – Journal of Multilingual and Multicultural Development, 2024
The Zhuang language test ("Vahcuengh Sawcuengh Suijbingz Gaujsi", VSSG) is the first minority language test in the People's Republic of China. It was designed with multiple goals including improving Zhuang language teaching, recruiting students for relevant majors of tertiary study, identifying proficiency for work-related applications,…
Descriptors: Language Minorities, Language Tests, Second Language Learning, Second Language Instruction
Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – School Mental Health, 2024
We are developing the Equitable Screening to Support Youth (ESSY) Whole Child Screener to address concerns prevalent in existing school-based screenings that impede goals to advance educational equity using universal screeners. Traditional assessment development does not include end users in the early development phases, instead relying on a…
Descriptors: Screening Tests, Psychometrics, Validity, Child Development
Gopal Prasad Pandey – Journal of Practical Studies in Education, 2024
This paper explores the role of language testing in English education, focusing on its theoretical foundations, methodologies and practical applications. It analyzes how language tests fulfill various purposes, such as placement, progress monitoring, achievement evaluation and diagnostic feedback, underlining the importance of a critical…
Descriptors: Language Tests, English (Second Language), Second Language Learning, Second Language Instruction
Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – Grantee Submission, 2024
We are developing the Equitable Screening to Support Youth (ESSY) Whole Child Screener to address concerns prevalent in existing school-based screenings that impede goals to advance educational equity using universal screeners. Traditional assessment development does not include end users in the early development phases, instead relying on a…
Descriptors: Screening Tests, Usability, Decision Making, Validity
Patrick Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Report Series, 2024
Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international large-scale assessments of cognitive and…
Descriptors: Assessment Literacy, Testing, Test Bias, Test Construction