Publication Date
| In 2026 | 10 |
| Since 2025 | 2328 |
| Since 2022 (last 5 years) | 12843 |
| Since 2017 (last 10 years) | 33968 |
| Since 2007 (last 20 years) | 68459 |
Descriptor
| Foreign Countries | 30579 |
| Test Validity | 21757 |
| Scores | 18263 |
| Academic Achievement | 16934 |
| Test Construction | 16763 |
| Test Reliability | 15036 |
| Achievement Tests | 14864 |
| Standardized Tests | 14724 |
| Comparative Analysis | 14431 |
| Elementary Secondary Education | 13046 |
| Language Tests | 12551 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3394 |
| Researchers | 2630 |
| Policymakers | 1232 |
| Administrators | 979 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2823 |
| Australia | 2430 |
| Canada | 2270 |
| California | 1854 |
| United States | 1727 |
| Texas | 1615 |
| China | 1579 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1203 |
| Germany | 1123 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Vuorre, Matti; Metcalfe, Janet – Metacognition and Learning, 2022
This article investigates the concern that assessment of metacognitive resolution (or relative accuracy--often evaluated by gamma correlations or signal detection theoretic measures such as d[subscript a]) is vulnerable to an artifact due to guessing that differentially impacts low as compared to high performers on tasks that involve…
Descriptors: Metacognition, Accuracy, Memory, Multiple Choice Tests
Polat, Murat; Turhan, Nihan S.; Toraman, Cetin – Pegem Journal of Education and Instruction, 2022
Testing English writing skills could be multi-dimensional; thus, the study aimed to compare students' writing scores calculated according to Classical Test Theory (CTT) and Multi-Facet Rasch Model (MFRM). The research was carried out in 2019 with 100 university students studying at a foreign language preparatory class and four experienced…
Descriptors: Comparative Analysis, Test Theory, Item Response Theory, Student Evaluation
Yan, Zi; Pastore, Serafina – Journal of Psychoeducational Assessment, 2022
A significant challenge in studying formative assessment is the lack of suitable instruments for assessing teachers' formative assessment practices. This paper reports the development of the Teacher Formative Assessment Practice Scale (TFAPS) and its psychometric properties based on two samples of primary and secondary school teachers: one from…
Descriptors: Formative Evaluation, Educational Strategies, Foreign Countries, Elementary School Teachers
Williamson, Joanna – Research Matters, 2022
Providing evidence that can inform awarding is an important application of Comparative Judgement (CJ) methods in high-stakes qualifications. The process of marking scripts is not changed, but CJ methods can assist in the maintenance of standards from one series to another by informing decisions about where to place grade boundaries or cut scores.…
Descriptors: Standards, Grading, Decision Making, Comparative Analysis
Gill, Tim – Research Matters, 2022
In Comparative Judgement (CJ) exercises, examiners are asked to look at a selection of candidate scripts (with marks removed) and order them in terms of which they believe display the best quality. By including scripts from different examination sessions, the results of these exercises can be used to help with maintaining standards. Results from…
Descriptors: Comparative Analysis, Decision Making, Scripts, Standards
Araneda, Sergio; Lee, Dukjae; Lewis, Jennifer; Sireci, Stephen G.; Moon, Jung Aa; Lehman, Blair; Arslan, Burcu; Keehner, Madeleine – Education Sciences, 2022
Students exhibit many behaviors when responding to items on a computer-based test, but only some of these behaviors are relevant to estimating their proficiencies. In this study, we analyzed data from computer-based math achievement tests administered to elementary school students in grades 3 (ages 8-9) and 4 (ages 9-10). We investigated students'…
Descriptors: Student Behavior, Academic Achievement, Computer Assisted Testing, Mathematics Achievement
Runge, Timothy J. – Communique, 2022
Reading, writing, and mathematics are widely regarded as foundational academic skills upon which many other academic skills depend. Consequently, each receives a considerable allocation of resources for instruction, assessment, and intervention in K-12 education (Hooper, 2002). An additional indicator of the importance of these skills is the…
Descriptors: High School Students, Writing Skills, Written Language, Writing Evaluation
Springer, Mark Christopher; Tyran, Craig K. – Quality Assurance in Education: An International Perspective, 2022
Purpose: This study aims to describe the development and validation of a student survey instrument to assess academic advising services. The instrument was based on the SERVQUAL scale, a well-known instrument for service quality. Design/methodology/approach: A quantitative methodology was used. Data were collected through a structured…
Descriptors: Academic Advising, Quality Assurance, Student Surveys, Test Construction
Masterson, Jessica E. – Reading Research Quarterly, 2022
I detail findings from an ethnographic study of a high school remedial reading class, with a particular focus on students' perceptions of what it means to be literate and how their mandatory enrollment in the course impacted their identities. Compounding students' experiences was the existence of a high-stakes reading examination that all students…
Descriptors: High School Students, Remedial Reading, Literacy, Student Attitudes
Betts, Joe; Muntean, William; Kim, Doyoung; Kao, Shu-chuan – Educational and Psychological Measurement, 2022
The multiple response structure can underlie several different technology-enhanced item types. With the increased use of computer-based testing, multiple response items are becoming more common. This response type holds the potential for being scored polytomously for partial credit. However, there are several possible methods for computing raw…
Descriptors: Scoring, Test Items, Test Format, Raw Scores
Albritton, Kizzy; Stuckey, Adrienne; Patton Terry, Nicole – Journal of Early Intervention, 2022
Three-year-old children are seldom the focus in studies about supplemental early literacy instructional support. This study examines 3-year-old children's potential need for additional early literacy support, extending and replicating a previous investigation that identified prekindergarten children (i.e., 4-year-olds) in Head Start classrooms for…
Descriptors: Emergent Literacy, Preschool Children, Supplementary Education, Classification
Wästerlid, Catarina – International Journal of Early Years Education, 2022
This systematic review analyses the research results of low-achieving grade K-3 children's numeracy competencies by investigating the research approaches used, the definitions of low achievers and the numeracy competencies reported. 18 articles, identified in ERIC, PsycINFO and Web of Science, were selected for further analysis. The results show…
Descriptors: Low Achievement, Elementary School Students, Numeracy, Competence
Christen, Scott; Violanti, Michelle T.; Morrow, Jennifer – Online Learning, 2022
This study involved the creation and validation of a self-rated social presence measure. Study 1 utilized focus groups to create items. The focus group participants were presented with a set of items based upon past literature; through discussion of these items, a preliminary measure was created. Study 2 involved an exploratory factor analysis on…
Descriptors: Measures (Individuals), Self Evaluation (Individuals), Social Media, Computer Mediated Communication
Zhan, Ying – Assessment & Evaluation in Higher Education, 2022
Although the importance of investigating student feedback literacy has been widely argued in the literature, a measurement instrument is still lacking. In this study, a student feedback literacy scale was developed and validated. The scale consists of six dimensions (eliciting, processing, enacting, appreciation, readiness and commitment). Five…
Descriptors: Test Construction, Test Validity, Feedback (Response), Multiple Literacies
Vidal Rodeiro, Carmen; Macinska, Sylwia – Assessment in Education: Principles, Policy & Practice, 2022
There has been controversy around the practice of providing accommodations, with some suggesting that they may give an unfair advantage rather than level the playing field. If that were the case, the assessment results of students with accommodations could be inflated, leading to a detrimental effect on the assessment's validity. This research…
Descriptors: Testing Accommodations, High Stakes Tests, Scores, Student Characteristics

Peer reviewed
Direct link
