Publication Date
| In 2026 | 8 |
| Since 2025 | 2276 |
| Since 2022 (last 5 years) | 12791 |
| Since 2017 (last 10 years) | 33916 |
| Since 2007 (last 20 years) | 68407 |
Descriptor
| Foreign Countries | 30560 |
| Test Validity | 21743 |
| Scores | 18256 |
| Academic Achievement | 16928 |
| Test Construction | 16756 |
| Test Reliability | 15028 |
| Achievement Tests | 14859 |
| Standardized Tests | 14720 |
| Comparative Analysis | 14431 |
| Elementary Secondary Education | 13042 |
| Language Tests | 12551 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3393 |
| Researchers | 2630 |
| Policymakers | 1232 |
| Administrators | 978 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2822 |
| Australia | 2426 |
| Canada | 2270 |
| California | 1854 |
| United States | 1726 |
| Texas | 1615 |
| China | 1578 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1202 |
| Germany | 1122 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Lúcio, Patrícia Silva; Vandekerckhove, Joachim; Polanczyk, Guilherme V.; Cogo-Moreira, Hugo – Journal of Psychoeducational Assessment, 2021
The present study compares the fit of two- and three-parameter logistic (2PL and 3PL) models of item response theory in the performance of preschool children on the Raven's Colored Progressive Matrices. The test of Raven is widely used for evaluating nonverbal intelligence of factor g. Studies comparing models with real data are scarce on the…
Descriptors: Guessing (Tests), Item Response Theory, Test Validity, Preschool Children
Salmani Nodoushan, Mohammad Ali – Online Submission, 2021
It has been argued in the literature on (language) testing that any act of testing/assessment can impact: (1) educators' curriculum design; (2) teachers' teaching practices; and (3) students' learning behaviors. This quality of any given testing situation or act of assessment has been called washback, or backwash if you will. Washback falls into…
Descriptors: Testing Problems, Language Tests, Second Language Learning, Second Language Instruction
Thapelo Ncube Whitfield – ProQuest LLC, 2021
Student Experience surveys are used to measure student attitudes towards their campus as well as to initiate conversations for institutional change. Validity evidence to support the interpretations of these surveys' results, however, is lacking. The first purpose of this study was to compare three Differential Item Functioning (DIF) methods on…
Descriptors: College Students, Student Surveys, Student Experience, Student Attitudes
Clark McKown; Nicole Russo-Ponsaran; Ashley Karls – Society for Research on Educational Effectiveness, 2021
Social and Emotional Learning and Its Measurement: The ability to understand and effectively interact with others is a critical determinant of academic, social, and life success (DiPerna & Elliott, 2002). This fact is increasingly recognized in educational policy and practice. For example, an influential report by the National Academy of…
Descriptors: Computer Assisted Testing, Social Emotional Learning, Web Sites, Interpersonal Competence
Wartono; Batlolona, John Rafafy; Sutopo; Rahmatina, Desella Inna – Journal of Education and Learning (EduLearn), 2019
The purpose of this research is to develop test questions of problem solving ability on work-energy material for high school students class X. This type of research is research and development. The model used in this study is ADDIE with the stages of analyzing, planning, developing, implementing, and evaluating, but this study only up to the…
Descriptors: Problem Solving, Test Construction, High School Students, Test Items
Brown, Ted – Journal of Occupational Therapy, Schools & Early Intervention, 2019
The Bruininks-Oseretsky Test of Motor Proficiency -- Second Edition (BOT-2) is a commonly used assessment of children's skills. It is important that assessments have validity evidence reported about them. The objective of the study was to investigate the structural validity of the BOT-2's eight subscales and four composite scales. A sample of 117…
Descriptors: Performance Tests, Psychomotor Skills, Test Validity, Children
Xiao, Yang; Koenig, Kathleen; Han, Jing; Liu, Jing; Liu, Qiaoyi; Bao, Lei – Physical Review Physics Education Research, 2019
Standardized concept inventories (CIs) have been widely used in science, technology, engineering, and mathematics education for assessment of student learning. In practice, there have been concerns regarding the length of the test and possible test-retest memory effect. To address these issues, a recent study developed a method to split a CI into…
Descriptors: Scientific Concepts, Science Tests, Energy, Magnets
Lee, Elizabeth – Studies in Applied Linguistics & TESOL, 2020
Ensuring that test-score use brings about socially positive consequences for test-takers is an important aspect of test validation. While many studies use an inductive approach to evaluate test consequences, few studies have implemented Appraisal analysis. To that end, this case study investigated the test consequences of an English reading…
Descriptors: English (Second Language), Second Language Learning, Language Tests, Reading Tests
Wang, Lu; Steedle, Jeffrey – ACT, Inc., 2020
In recent ACT mode comparability studies, students testing on laptop or desktop computers earned slightly higher scores on average than students who tested on paper, especially on the ACT® reading and English tests (Li et al., 2017). Equating procedures adjust for such "mode effects" to make ACT scores comparable regardless of testing…
Descriptors: Test Format, Reading Tests, Language Tests, English
Elturki, Eman – English Teaching Forum, 2020
Accrediting agencies for English language programs, such as the Commission on English Language Program Accreditation (CEA), require a plan in writing for monitoring and reviewing assessment practices. Nonetheless, web-search queries such as "assessing assessment," "how to assess assessment," "assessing assessment…
Descriptors: College Second Language Programs, English (Second Language), Student Evaluation, Test Reliability
ShayesteFar, Parvaneh – Educational Assessment, Evaluation and Accountability, 2020
Research on test change often documents high-stakes English test impact on English language learning, whereas evidence for simultaneous impact on affective predictors of learning is still missing. We tested a theoretical model positing that changing high-stake English tests (English Language Requirements for University Entrance, in this study)…
Descriptors: Language Tests, High Stakes Tests, English Language Learners, Student Attitudes
Turhan, Nihan Sölpük – International Journal of Progressive Education, 2020
Measurement tools that are used in education are important factors that affect course success and motivation of students. This study aims to determine the opinions of high school students on different question types. As the subgoals of the research, the study aims to determine the reasons for multiple choice test preference and its effect on…
Descriptors: Test Items, Preferences, High School Students, Learning Motivation
Priyatni, Endah Tri; Martutik – SAGE Open, 2020
The ability to think critically and creatively is essential for students to help them thrive in the 21st century. Creative and critical thinking can be measured through problem solving because the assessment contains tasks that require students to find problems, analyze and evaluate problems, and work out the solutions. Therefore, this study was…
Descriptors: Problem Solving, Reading Tests, Test Construction, Test Validity
Gasteiger, Hedwig; Bruns, Julia; Benz, Christiane; Brunner, Esther; Sprenger, Priska – ZDM: The International Journal on Mathematics Education, 2020
Measurement instruments of early childhood teachers' mathematical pedagogical content knowledge (MPCK) have to consider the special characteristics of early childhood teaching. Early childhood teaching includes some planned activities but in contrast to learning in school, it is often motivated and generated by situations which unfold…
Descriptors: Mathematics Instruction, Pedagogical Content Knowledge, Multiple Choice Tests, Kindergarten
Lim, Lyndon – Journal of Psychoeducational Assessment, 2020
This article outlines the development and validation of the Computer-Delivered Test (CDT) Acceptance Questionnaire (CTAQ). The CTAQ was designed to be a practical measure of CDT acceptance of Singapore secondary and high school students (Grades 7-12) toward taking tests within an e-assessment system. The stages of test (questionnaire item)…
Descriptors: Student Attitudes, High School Students, Secondary School Students, Computer Assisted Testing

Peer reviewed
Direct link
