Publication Date
| In 2026 | 0 |
| Since 2025 | 27 |
| Since 2022 (last 5 years) | 113 |
| Since 2017 (last 10 years) | 280 |
| Since 2007 (last 20 years) | 517 |
Descriptor
| Testing Problems | 4850 |
| Elementary Secondary Education | 1262 |
| Test Validity | 1008 |
| Test Construction | 801 |
| Standardized Tests | 790 |
| Higher Education | 658 |
| Test Reliability | 607 |
| Student Evaluation | 583 |
| Testing | 564 |
| Test Bias | 562 |
| Achievement Tests | 555 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 248 |
| Researchers | 220 |
| Teachers | 81 |
| Administrators | 35 |
| Policymakers | 34 |
| Parents | 15 |
| Counselors | 13 |
| Students | 5 |
| Community | 3 |
| Support Staff | 2 |
Location
| Canada | 52 |
| Australia | 45 |
| California | 44 |
| United Kingdom | 37 |
| United States | 36 |
| United Kingdom (England) | 31 |
| China | 29 |
| Netherlands | 26 |
| Florida | 25 |
| New York | 25 |
| United Kingdom (Great Britain) | 24 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards with or without Reservations | 1 |
Slez, Adam; O'Connell, Heather A.; Curtis, Katherine J. – Sociological Methods & Research, 2017
Areal data have been used to good effect in a wide range of sociological research. One of the most persistent problems associated with this type of data, however, is the need to combine data sets with incongruous boundaries. To help address this problem, we introduce a new method for identifying common geographies. We show that identifying common…
Descriptors: Data, Data Processing, Geographic Information Systems, Research Methodology
Sinharay, Sandip – Applied Measurement in Education, 2017
Karabatsos compared the power of 36 person-fit statistics using receiver operating characteristics curves and found the "H[superscript T]" statistic to be the most powerful in identifying aberrant examinees. He found three statistics, "C", "MCI", and "U3", to be the next most powerful. These four statistics,…
Descriptors: Nonparametric Statistics, Goodness of Fit, Simulation, Comparative Analysis
National Center for Education Statistics, 2013
Educators, parents, and the public depend on accurate, valid, reliable, and timely information about student academic performance. Testing irregularities--breaches of test security or improper administration of academic testing--undermine efforts to use those data to improve student achievement. Unfortunately, there have been high-profile and…
Descriptors: Testing, Best Practices, Testing Problems, Integrity
Banks, Kathleen – Practical Assessment, Research & Evaluation, 2015
This article introduces practitioners and researchers to the topic of missing data in the context of differential item functioning (DIF), reviews the current literature on the issue, discusses implications of the review, and offers suggestions for future research. A total of nine studies were reviewed. All of these studies determined what effect…
Descriptors: Test Bias, Data, Literature Reviews, Evaluation Research
Yu, Guoxing; He, Lianzhen; Rea-Dickins, Pauline; Kiely, Richard; Lu, Yanbin; Zhang, Jing; Zhang, Yan; Xu, Shasha; Fang, Lin – ETS Research Report Series, 2017
Language test preparation has often been studied within the consequential validity framework in relation to ethics, equity, fairness, and washback of assessment. The use of independent and integrated speaking tasks in the "TOEFL iBT"® test represents a significant development and innovation in assessing speaking ability in academic…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Oral Language
Bray, Mark; Kobakhidze, Magda Nutsa – Comparative Education Review, 2014
Expanding numbers of researchers are focusing on the scale and impact of private supplementary tutoring. Such tutoring is widely called shadow education, since much of its curriculum mimics that of regular schooling. Although shadow education has expanded significantly worldwide and is now recognized to have far-reaching significance, research…
Descriptors: Tutoring, Private Education, Educational Research, Measurement
Hoang, Ngoc Thi Huyen – Language Education & Assessment, 2019
As validity pertains to test use rather than the test itself, using a test for unintended purposes requires a new validation program using additional evidence from relevant sources. This small-scale study contributes to the validation of the use of originally academic language tests--the International English Language Testing System and the Test…
Descriptors: Language Tests, Immigrants, Immigration, Testing Problems
Klufa, Jindrich – Journal on Efficiency and Responsibility in Education and Science, 2016
The paper contains an analysis of the differences of number of points in the test in mathematics between test variants, which were used in the entrance examinations at the Faculty of Business Administration at University of Economics in Prague in 2015. The differences may arise due to the varying difficulty of variants for students, but also…
Descriptors: Foreign Countries, College Students, Business Administration Education, College Entrance Examinations
Venticinque, Danilo; Whitworth, Andrew – Journal of Media Literacy Education, 2018
This article discusses the outcomes of research into the media literacy aspects of ENEM ("Exame Nacional do Ensino Médio"), Brazil's unified university entrance exam, which contains a significant number of exam questions based on excerpts from newspaper articles, online news and other media sources. Through content analysis, these…
Descriptors: Foreign Countries, College Entrance Examinations, Media Literacy, Test Content
Sternberg, Robert J. – Phi Delta Kappan, 2017
IQs increased by about 30 points in the 20th century. Part of this increase may have been the result of increased standardized testing because testing improves the skills on which students are tested. But although these practices may increase general intelligence, they may impede the development of creativity and wisdom. As a result, our society…
Descriptors: Intelligence Quotient, Intelligence Differences, Academic Achievement, Creativity
Karagöl, Efecan – Journal of Language and Linguistic Studies, 2020
Turkish and Foreign Languages Research and Application Center (TÖMER) is one of the important institutions for learning Turkish as a foreign language. In these institutions, proficiency tests are applied at the end of each level. However, test applications in TÖMERs vary between each center as there is no shared program in teaching Turkish as a…
Descriptors: Language Tests, Turkish, Language Proficiency, Second Language Learning
Yu, Chongni – English Language Teaching, 2020
The reform of National College Entrance Examination in Zhejiang Province, China has aroused widespread attention since it was released in 2014. It is notable that new English writing test types were adopted in the English subtest. The continuation task and summary writing become a challenge as well as a promoter for English writing teaching and…
Descriptors: Testing Problems, English (Second Language), Second Language Learning, Second Language Instruction
Lazarín, Melissa – Center for American Progress, 2014
It appears that schools and families are at a crossroads when it comes to testing. High-quality assessments generate rich data and can provide valuable information about student progress to teachers and parents, support accountability, promote high expectations, and encourage equity for students of color and low-income students. But it is…
Descriptors: Testing, Testing Problems, Urban Schools, Suburban Schools
Moodie, Ian; Nam, Hyun-Jeong – Language Teaching, 2016
This article reviews recent studies on English language teaching (ELT) in South Korea, where a great deal of research has been produced in recent years in local journals. In this article we review 95 studies from a pool of some 1,200 published between 2009 and 2014 on English language teaching and learning, focusing on research within the public…
Descriptors: English (Second Language), Second Language Learning, Language Usage, Language Teachers
Stohlman, Trey – Journal of the Scholarship of Teaching and Learning, 2015
A good assessment plan combines many direct and indirect measures to validate the collected data. One often controversial assessment measure comes in the form of retention exams. Although assessment retention exams may come with faults, others advocate for their inclusion in program assessment. Objective-based tests may offer insight to…
Descriptors: Alternative Assessment, Retention (Psychology), Program Evaluation, Program Effectiveness

Peer reviewed
Direct link
