Publication Date
| In 2026 | 0 |
| Since 2025 | 2142 |
| Since 2022 (last 5 years) | 12652 |
| Since 2017 (last 10 years) | 33777 |
| Since 2007 (last 20 years) | 68268 |
Descriptor
| Foreign Countries | 30502 |
| Test Validity | 21718 |
| Scores | 18245 |
| Academic Achievement | 16904 |
| Test Construction | 16724 |
| Test Reliability | 15006 |
| Achievement Tests | 14836 |
| Standardized Tests | 14707 |
| Comparative Analysis | 14429 |
| Elementary Secondary Education | 13033 |
| Language Tests | 12545 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5033 |
| Teachers | 3390 |
| Researchers | 2630 |
| Policymakers | 1229 |
| Administrators | 976 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2813 |
| Australia | 2425 |
| Canada | 2269 |
| California | 1851 |
| United States | 1725 |
| Texas | 1613 |
| China | 1577 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1202 |
| Germany | 1120 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Vahid Aryadoust – Applied Linguistics, 2024
I analyzed a corpus of the international English language testing system (IELTS) comprising 256 listening sections (1996-2021). The primary objective of the study was to gain insights into the assumptions made by test designers regarding the real-life contexts that test-takers will encounter. Overall, 15 superordinate topic areas and 300 subtopics…
Descriptors: Dialects, Pronunciation, Commercialization, Second Language Learning
Aaron Soo Ping Chow; Amanda Nabors – Society for Research on Educational Effectiveness, 2024
Background: Early-grade reading proficiency is well documented as in important factor for later academic success. Researchers and educators consider the achievement of reading proficiency by the end of grade three to be crucial to future academic success and financial independence (Hein et al., 2013; La Paro & Pianta, 2000; Singh, 2013). As…
Descriptors: Elementary School Students, Primary Education, Grade 3, Emergent Literacy
William Marshall Harvey – ProQuest LLC, 2024
This study examined whether there was a significant difference between the End-of-Course Examination scores (EOC) across multiple academic disciplines between students who participated in sports and those who did not participate in athletics at all in three rural high schools in South Carolina. The theoretical foundation for this study was based…
Descriptors: Tests, Student Evaluation, Student Athletes, Scores
Shelby J. Haberman; Sabine Meinck; Ann-Kristin Koop – Large-scale Assessments in Education, 2024
This paper extends existing work on teacher weighting in student-centered surveys by looking into aspects of practical implementation of deriving and using weights for teacher-centered analysis in the Trends in International Mathematics and Science Study (TIMSS) and the Progress in International Reading Literacy Study (PIRLS). The formal…
Descriptors: Elementary Secondary Education, Foreign Countries, Achievement Tests, Mathematics Achievement
Gong, Kaixuan – Asian-Pacific Journal of Second and Foreign Language Education, 2023
The extensive use of automated speech scoring in large-scale speaking assessment can be revolutionary not only to test design and rating, but also to the learning and instruction of speaking based on how students and teachers perceive and react to this technology. However, its washback remained underexplored. This mixed-method study aimed to…
Descriptors: Second Language Learning, Language Tests, English (Second Language), Automation
Aleyna Altan; Zehra Taspinar Sener – Online Submission, 2023
This research aimed to develop a valid and reliable test to be used to detect sixth grade students' misconceptions and errors regarding the subject of fractions. A misconception diagnostic test has been developed that includes the concept of fractions, different representations of fractions, ordering and comparing fractions, equivalence of…
Descriptors: Diagnostic Tests, Mathematics Tests, Fractions, Misconceptions
Kohen, Zehavit; Gharra-Badran, Yasmin – Teaching Mathematics and Its Applications, 2023
Mathematics modelling is a vital competency for students of all ages. In this study, we aim to fill the research gap about valid and reliable tools for assessing and grading mathematical modeling problems, particularly those reflecting multiple steps of the modelling cycle. We present in this paper the design of a reliable and valid assessment…
Descriptors: Scoring Rubrics, Mathematical Models, Test Construction, Test Validity
Albert Weideman; Tobie van Dyk – Language Teaching Research Quarterly, 2023
This contribution investigates gains in technical economy in measuring language ability by considering one recurrent interest of JD Brown: cloze tests. In the various versions of the Test of Academic Literacy Levels (TALL), its Sesotho and Afrikaans (Toets van Akademiese Geletterdheidsvlakke -- TAG) counterparts, as well as related other tests…
Descriptors: Language Skills, Language Aptitude, Cloze Procedure, Reading Tests
Deniz Arslan; Ömer Faruk Tamul; Murat Dogan Sahin; Ugur Sak – Journal of Pedagogical Research, 2023
An examination of gender-related differential item functioning was conducted on the verbal subtests of the Anadolu-Sak Intelligence Scale. Analyses were conducted using the scale standardization data (N = 4641). A Mantel-Haenszel statistic was used to detect differential item functioning (DIF). A total of 58 verbal analogical reasoning items, 20…
Descriptors: Foreign Countries, Intelligence Tests, Gender Bias, Gender Differences
Balbuena, Sherwin – International Journal of Assessment Tools in Education, 2023
Depression is a latent characteristic that is measured through self-reported or clinician-mediated instruments such as scales and inventories. The precision of depression estimates largely depends on the validity of the items used and on the truthfulness of people responding to these items. The existing methodology in instrumentation based on a…
Descriptors: Depression (Psychology), Test Items, Test Validity, Test Reliability
NWEA, 2022
This technical report documents the processes and procedures employed by NWEA® to build and support the English MAP® Reading Fluency™ assessments administered during the 2020-2021 school year. It is written for measurement professionals and administrators to help evaluate the quality of MAP Reading Fluency. The seven sections of this report: (1)…
Descriptors: Achievement Tests, Reading Tests, Reading Achievement, Reading Fluency
Student, Sanford R.; Gong, Brian – Educational Measurement: Issues and Practice, 2022
We address two persistent challenges in large-scale assessments of the Next Generation Science Standards: (a) the validity of score interpretations that target the standards broadly and (b) how to structure claims for assessments of this complex domain. The NGSS pose a particular challenge for specifying claims about students that evidence from…
Descriptors: Science Tests, Test Validity, Test Items, Test Construction
Kayla V. Campaña; Benjamin G. Solomon – Assessment for Effective Intervention, 2025
The purpose of this study was to compare the classification accuracy of data produced by the previous year's end-of-year New York state assessment, a computer-adaptive diagnostic assessment ("i-Ready"), and the gating combination of both assessments to predict the rate of students passing the following year's end-of-year state assessment…
Descriptors: Accuracy, Classification, Diagnostic Tests, Adaptive Testing
Pastor, Dena A.; Patterson, Chris R.; Finney, Sara J. – Journal of Psychoeducational Assessment, 2023
In low-stakes testing contexts, there are minimal personal consequences associated with examinee performance. Examples include assessments administered for research, program evaluation, test development, and international comparisons (e.g., Programme for International Student Assessment [PISA]). Because test-taking motivation can suffer in…
Descriptors: Test Construction, Test Validity, Student Attitudes, Attitude Measures
Kalkbrenner, Michael T.; Ryan, Aimee F.; Hunt, Adam J.; Rahman, Samiah R. – Measurement and Evaluation in Counseling and Development, 2023
We conducted a psychometric synthesis of the internal consistency reliability and internal structure of scores on the English versions of the Patient Health Questionnaire-9 (PHQ-9) and Generalized Anxiety Disorder-7 (GAD-7) in publications between 2012 and 2022. Results supported acceptable-to-strong reliability and validity evidence of scores…
Descriptors: Psychometrics, Test Validity, Test Reliability, Questionnaires

Peer reviewed
Direct link
