Publication Date
In 2025 | 3 |
Since 2024 | 7 |
Since 2021 (last 5 years) | 9 |
Since 2016 (last 10 years) | 22 |
Since 2006 (last 20 years) | 40 |
Descriptor
Test Reliability | 325 |
Test Validity | 325 |
Testing Problems | 325 |
Test Construction | 100 |
Standardized Tests | 67 |
Elementary Secondary Education | 61 |
Test Bias | 55 |
Achievement Tests | 50 |
Test Interpretation | 50 |
Student Evaluation | 44 |
Higher Education | 42 |
More ▼ |
Source
Author
Publication Type
Education Level
Higher Education | 11 |
Postsecondary Education | 8 |
Secondary Education | 5 |
Elementary Secondary Education | 4 |
Early Childhood Education | 1 |
Elementary Education | 1 |
High Schools | 1 |
Preschool Education | 1 |
Audience
Practitioners | 17 |
Researchers | 10 |
Teachers | 4 |
Counselors | 2 |
Administrators | 1 |
Parents | 1 |
Policymakers | 1 |
Students | 1 |
Support Staff | 1 |
Location
Canada | 5 |
Australia | 4 |
China | 4 |
Illinois | 3 |
United Kingdom | 3 |
United States | 3 |
California | 2 |
Israel | 2 |
Brazil | 1 |
California (Stanford) | 1 |
Colorado (Denver) | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Gökhan Iskifoglu – Turkish Online Journal of Educational Technology - TOJET, 2024
This research paper investigated the importance of conducting measurement invariance analysis in developing measurement tools for assessing differences between and among study variables. Most of the studies, which tended to develop an inventory to assess the existence of an attitude, behavior, belief, IQ, or an intuition in a person's…
Descriptors: Testing, Testing Problems, Error of Measurement, Attitude Measures
Esra Sözer Boz – Education and Information Technologies, 2025
International large-scale assessments provide cross-national data on students' cognitive and non-cognitive characteristics. A critical methodological issue that often arises in comparing data from cross-national studies is ensuring measurement invariance, indicating that the construct under investigation is the same across the compared groups.…
Descriptors: Achievement Tests, International Assessment, Foreign Countries, Secondary School Students
Paul T. von Hippel; Brendan A. Schuetze – Annenberg Institute for School Reform at Brown University, 2025
Researchers across many fields have called for greater attention to heterogeneity of treatment effects--shifting focus from the average effect to variation in effects between different treatments, studies, or subgroups. True heterogeneity is important, but many reports of heterogeneity have proved to be false, non-replicable, or exaggerated. In…
Descriptors: Educational Research, Replication (Evaluation), Generalizability Theory, Inferences
Zita Lysaght; Michael O'Leary; Angela Mazzone; Conor Scully – Sage Research Methods Cases, 2022
Since 2018, colleagues from two research centers at Dublin City University have been collaborating to develop a measurement scale to assess individuals' ability to identify workplace bullying. Having agreed on an operational definition of the construct, an item pool of 26 workplace bullying scenarios, that is, short descriptions of…
Descriptors: Foreign Countries, Test Construction, Test Validity, Test Reliability
Jiayi Wang; Michael T. Kalkbrenner; Riley Schaner – Psychology in the Schools, 2025
Teaching is a stressful profession with a high turnover rate. Schools and related institutions need to take more action to support teachers and keep teacher stress at a manageable level. The continued research and practical effort require measures to examine teachers' stress in a briefer and accurate manner. The Teacher Stress Scale is a recently…
Descriptors: Elementary School Teachers, Secondary School Teachers, Preschool Teachers, Stress Variables
Firdissa J. Aga – Intersection: A Journal at the Intersection of Assessment and Learning, 2024
The study investigated hurdles to the quality of student learning assessment by examining issues related to assessment procedures and practices, learners and learning, learning resources and test constructs, and test admin and feedback. Quantitative and qualitative data were collected from two Ethiopian universities using two types of…
Descriptors: Foreign Countries, College Faculty, College Students, Test Construction
Mengna Zheng; Chengwu Ruan – South African Journal of Education, 2024
Comprehensive quality assessment is an assessment system that identifies and explores students' strengths. By examining the developmental progress made in pilot provinces that have implemented comprehensive quality assessment, valuable insights and guidance can be derived for other provinces preparing to adopt this assessment approach. In this…
Descriptors: Foreign Countries, High School Students, College Entrance Examinations, Pilot Projects
Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024
Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…
Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing
McGill, Ryan J.; Ward, Thomas J.; Canivez, Gary L. – School Psychology International, 2020
The Wechsler Intelligence Scale for Children (WISC) is the most widely used intelligence test in the world. Now in its fifth edition, the WISC-V has been translated and adapted for use in nearly a dozen countries. Despite its popularity, numerous concerns have been raised about some of the procedures used to develop and validate translated and…
Descriptors: Children, Intelligence Tests, Translation, Test Validity
Kettler, Ryan J. – School Psychology International, 2020
This article is a commentary on McGill et al.'s (2020) article "Use of Translated and Adapted Versions of the WISC-V: Caveat Emptor." McGill et al. use caveat emptor in their title to indicate that the buyer of an assessment must be careful about the product being purchased, presumably because the seller of the assessment is not being…
Descriptors: Children, Intelligence Tests, Translation, Test Reliability
James Dean Brown; Ali Panahi; Hassan Mohebbi – Language Teaching Research Quarterly, 2023
Panahi and Mohebbi review James Dean Brown's 50-years of research in language testing, curriculum development and research statistics with reference to an impressionistic framework for analysis containing two components with their subcomponents: Annotations (i.e., briefing and implications) and main concepts and themes (i.e., testing and teaching…
Descriptors: Second Language Learning, Second Language Instruction, Language Tests, Curriculum Development
Bao, Lei; Xiao, Yang; Koenig, Kathleen; Han, Jing – Physical Review Physics Education Research, 2018
In science, technology, engineering, and mathematics education there has been increased emphasis on teaching goals that include not only the learning of content knowledge but also the development of scientific reasoning skills. The Lawson classroom test of scientific reasoning (LCTSR) is a popular assessment instrument for scientific reasoning.…
Descriptors: Science Tests, Science Process Skills, Logical Thinking, Test Validity
Rear, David – Assessment & Evaluation in Higher Education, 2019
In today's market-driven educational culture, universities are coming under increasing pressure to justify funding through the disclosure of measurable outcomes in education and research. One educational objective that receives particular attention is critical thinking, regarded as an essential skill in both academic and work environments. The…
Descriptors: Critical Thinking, Standardized Tests, Outcomes of Education, Educational Objectives
Shraim, Khitam – Turkish Online Journal of Distance Education, 2019
Online examinations, commonly known as electronic examinations (e-exams), are becoming increasingly implemented in higher education institutions in Palestine. However, learners' perspectives on these exams remain unexplored. This study therefore examines learners' perceptions of the online examination practices at Palestine Technical…
Descriptors: Computer Assisted Testing, Higher Education, Foreign Countries, Undergraduate Students
Zumbo, Bruno D.; Hubley, Anita M. – Assessment in Education: Principles, Policy & Practice, 2016
Ultimately, measures in research, testing, assessment and evaluation are used, or have implications, for ranking, intervention, feedback, decision-making or policy purposes. Explicit recognition of this fact brings the often-ignored and sometimes maligned concept of consequences to the fore. Given that measures have personal and social…
Descriptors: Testing Programs, Testing Problems, Measurement Techniques, Student Evaluation