Publication Date
| In 2026 | 0 |
| Since 2025 | 2142 |
| Since 2022 (last 5 years) | 12652 |
| Since 2017 (last 10 years) | 33777 |
| Since 2007 (last 20 years) | 68268 |
Descriptor
| Foreign Countries | 30502 |
| Test Validity | 21718 |
| Scores | 18245 |
| Academic Achievement | 16904 |
| Test Construction | 16724 |
| Test Reliability | 15006 |
| Achievement Tests | 14836 |
| Standardized Tests | 14707 |
| Comparative Analysis | 14429 |
| Elementary Secondary Education | 13033 |
| Language Tests | 12545 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5033 |
| Teachers | 3390 |
| Researchers | 2630 |
| Policymakers | 1229 |
| Administrators | 976 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2813 |
| Australia | 2425 |
| Canada | 2269 |
| California | 1851 |
| United States | 1725 |
| Texas | 1613 |
| China | 1577 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1202 |
| Germany | 1120 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Harrison, Scott; Kroehne, Ulf; Goldhammer, Frank; Lüdtke, Oliver; Robitzsch, Alexander – Large-scale Assessments in Education, 2023
Background: Mode effects, the variations in item and scale properties attributed to the mode of test administration (paper vs. computer), have stimulated research around test equivalence and trend estimation in PISA. The PISA assessment framework provides the backbone to the interpretation of the results of the PISA test scores. However, an…
Descriptors: Scoring, Test Items, Difficulty Level, Foreign Countries
Huang, Qi; Bolt, Daniel M. – Educational and Psychological Measurement, 2023
Previous studies have demonstrated evidence of latent skill continuity even in tests intentionally designed for measurement of binary skills. In addition, the assumption of binary skills when continuity is present has been shown to potentially create a lack of invariance in item and latent ability parameters that may undermine applications. In…
Descriptors: Item Response Theory, Test Items, Skill Development, Robustness (Statistics)
Luan, Lin; Liang, Jyh-Chong; Chai, Ching Sing; Lin, Tzu-Bin; Dong, Yan – Interactive Learning Environments, 2023
The emergence of new media technologies has empowered individuals to not merely consume but also create, share and critique media contents. Such activities are dependent on new media literacy (NML) necessary for living and working in the participatory culture of the twenty-first century. Although a burgeoning body of research has focused on the…
Descriptors: Foreign Countries, Media Literacy, Test Construction, English (Second Language)
Babic Cikeš, Ana; Cakic, Lara; Kuti, Vedrana – European Journal of Developmental Psychology, 2023
The changed sentence: The aim of this study was to investigate the reliability and validity of the Emotion Matching Task (EMT) in a sample of Croatian preschool children. The Croatian version of the EMT was applied to 198 children (52% female), together with measures of verbal ability and social competence. The internal structure of the test, as…
Descriptors: Foreign Countries, Objective Tests, Psychological Patterns, Preschool Children
Er, Zübeyde; Dinç Artut, Perihan; Bal, Ayten Pinar – Pegem Journal of Education and Instruction, 2023
This research aims to develop a reliable and valid scale to determine middle school students' self-efficacy about estimation skills. In addition, with the developed scale, the estimation skill self-efficacy of middle school students was examined in terms of various variables. For these purposes, a draft scale of 40 items was developed by reviewing…
Descriptors: Test Construction, Self Efficacy, Measures (Individuals), Middle School Students
Coohey, Carol; Landsman, Miriam J.; Cummings, Stephen P. – Journal of Teaching in Social Work, 2023
Many students report increasing test anxiety in the months before taking the national social work licensure exam. We evaluate whether adding a test-anxiety-reduction module to an online exam preparation course reduces MSW students' test anxiety. A non-equivalent pretest-posttest control-group design was used to compare 42 students who did not…
Descriptors: Test Preparation, Test Anxiety, Licensing Examinations (Professions), Social Work
Örnek, Gizem Tabaru; Sönmez, Yasemin; Kan, Adnan – Shanlax International Journal of Education, 2023
The aim of this study is to develop a measurement tool that can determine the attitudes of social studies teachers and classroom teachers towards the use of current events in the social studies course. The scale consists of 23 five-point Likert-type items. The scale form consisting of 40 items prepared by the researchers was administered to a…
Descriptors: Test Construction, Current Events, Social Studies, Teacher Attitudes
Ronan, Darcy; Erdil, D. Cenk; Brylow, Dennis – ACM Transactions on Computing Education, 2023
Instrument development is an important step towards unlocking the analytical power of teacher attitudes and beliefs towards Computer Science (CS). Teacher dispositions have strong empirical and theoretical ties to teacher motivation, professional choices, and classroom practices. To determine consensus desirable attitudes and beliefs, we analyzed…
Descriptors: Teacher Attitudes, Computer Science, Test Construction, Test Validity
De León, Leticia; Corbeil, Rene; Corbeil, Maria Elena – Journal of Research on Technology in Education, 2023
K-12 educators' digital literacy skills have been designated as a national and state priority by education and accreditation agencies in Texas, resulting in the need for curriculum alignments to the 2017 ISTE Standards for Educators and implementation of a digital literacy evaluation. The purpose of this validation study was to develop a…
Descriptors: Test Construction, Test Validity, Teacher Education, Digital Literacy
Zou, Tongtong; Bolt, Daniel M. – Measurement: Interdisciplinary Research and Perspectives, 2023
Person misfit and person reliability indices in item response theory (IRT) can play an important role in evaluating the validity of a test or survey instrument at the respondent level. Prior empirical comparisons of these indices have been applied to binary item response data and suggest that the two types of indices return very similar results.…
Descriptors: Item Response Theory, Rating Scales, Response Style (Tests), Measurement
Zetterqvist, Ann; Bach, Frank – International Journal of Science Education, 2023
The past century has seen a debate on what characterises a scientifically literate citizen. Originally, scientific literacy implied that a citizen should know the products of science but has grown to incorporate processes of science and aspects of the nature of science. Studies on students' epistemic knowledge are rarer than ones on students'…
Descriptors: Epistemology, Scientific Literacy, Science Instruction, International Assessment
Wolf, Mikyung Kim; Bailey, Alison L.; Ballard, Laura – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2023
Alignment between standards and assessments is a fundamental component of the larger effort to ensure appropriate and fair use of standards-based assessments and achievement of successful standards-based reform. This article focuses on conceptual and technical issues as well as potential strategies in evaluating alignment between large-scale,…
Descriptors: Alignment (Education), Academic Standards, Standardized Tests, Language Tests
Maïano, Christophe; Morin, Alexandre J. S.; Tietjens, Maike; Bastos, Tânia; Luiggi, Maxime; Corredeira, Rui; Griffet, Jean; Sánchez-Oliva, David – Measurement in Physical Education and Exercise Science, 2023
The present study sought to examine the psychometric properties of new German, Portuguese, and Spanish versions of the Revised Short Form of the Physical Self-Inventory (PSI-S-"R"), and to contrast these properties against those from the original French version of this instrument. Participants (n = 1802) were 288 French youth, 177 German…
Descriptors: German, Portuguese, Spanish, Test Construction
Stephen Hackler; Emily Elliott; Mark Eichenlaub; Alison M. Sweeney – Physical Review Physics Education Research, 2023
The increasing and diversifying student enrollments in introductory physics courses make reliable, valid, and usable instruments for measuring student skills and gains ever more important. In introductory physics, in addition to teaching facts about mechanics, we also seek to teach our students the skills of "thinking like a physicist,"…
Descriptors: Physics, Science Instruction, Thinking Skills, Test Construction
Christoph M. Paulus; Eric Klop – European Journal of Educational Sciences, 2023
The Jefferson Scale of Empathy is one of the most commonly used scales in medical education to measure empathy. It is specific to the field of medical education and geared toward orienting medical students to physician empathy in patient care situations. The scale was transferred to the educational context in teacher education. In doing so, the…
Descriptors: Attitude Measures, Test Construction, Empathy, Teacher Education

Peer reviewed
Direct link
