Publication Date
In 2025 | 6 |
Since 2024 | 15 |
Since 2021 (last 5 years) | 53 |
Since 2016 (last 10 years) | 152 |
Since 2006 (last 20 years) | 272 |
Descriptor
Test Reliability | 560 |
Test Validity | 560 |
Scoring | 403 |
Test Construction | 198 |
Testing | 113 |
Scoring Rubrics | 97 |
Psychometrics | 80 |
Test Items | 79 |
Test Interpretation | 73 |
Foreign Countries | 69 |
Scoring Formulas | 66 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Practitioners | 26 |
Researchers | 12 |
Administrators | 9 |
Teachers | 9 |
Policymakers | 6 |
Students | 3 |
Counselors | 1 |
Parents | 1 |
Location
New York | 13 |
Turkey | 10 |
Canada | 8 |
Nebraska | 8 |
Australia | 6 |
Florida | 6 |
Pennsylvania | 6 |
California | 5 |
United Kingdom | 5 |
United States | 4 |
Idaho | 3 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 5 |
No Child Left Behind Act 2001 | 3 |
Education Consolidation… | 1 |
Elementary and Secondary… | 1 |
Individuals with Disabilities… | 1 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Steven Holtzman; Jonathan Steinberg; Jonathan Weeks; Christopher Robertson; Jessica Findley; David Klieger – ETS Research Report Series, 2024
At a time when institutions of higher education are exploring alternatives to traditional admissions testing, institutions are also seeking to better support students and prepare them for academic success. Under such an engaged model, one may seek to measure not just the accumulated knowledge and skills that students would bring to a new academic…
Descriptors: Law Schools, College Applicants, Legal Education (Professions), College Entrance Examinations
National Institute for Excellence in Teaching, 2023
Aspiring teachers must develop an in-depth understanding of high-quality instructional practices. In order to prepare, instruct, and coach aspiring teachers, the National Institute for Excellence in Teaching (NIET) has developed a the NIET Aspiring Teacher Rubric (ATR) based on principles of excellence in instruction. This research brief…
Descriptors: Scoring Rubrics, Preservice Teachers, Test Construction, Test Validity
Marcos Jiménez; María Zapata-Cáceres; Marcos Román-González; Gregorio Robles; Jesús Moreno-León; Estefanía Martín-Barroso – Journal of Science Education and Technology, 2024
Computational thinking (CT) is a multidimensional term that encompasses a wide variety of problem-solving skills related to the field of computer science. Unfortunately, standardized, valid, and reliable methods to assess CT skills in preschool children are lacking, compromising the reliability of the results reported in CT interventions. To…
Descriptors: Computation, Thinking Skills, Student Evaluation, Preschool Children
Susan K. Johnsen – Gifted Child Today, 2024
The author provides a checklist for educators who are selecting technically adequate tests for identifying and referring students for gifted education services and programs. The checklist includes questions related to how the test was normed, reliability and validity studies as well as questions related to types of scores, administration, and…
Descriptors: Test Selection, Academically Gifted, Gifted Education, Test Validity
Marcelo Fernando Rauber; Christiane Gresse von Wangenheim; Pedro Alberto Barbetta; Adriano Ferreti Borgatto; Ramon Mayor Martins; Jean Carlo Rossa Hauck – Informatics in Education, 2024
The insertion of Machine Learning (ML) in everyday life demonstrates the importance of popularizing an understanding of ML already in school. Accompanying this trend arises the need to assess the students' learning. Yet, so far, few assessments have been proposed, most lacking an evaluation. Therefore, we evaluate the reliability and validity of…
Descriptors: Artificial Intelligence, Measures (Individuals), Test Reliability, Test Validity
Reuben S. Asempapa; Doris Lee – Discover Education, 2025
Across the world, standards and practices for preparing teachers of mathematics emphasize the importance of math modeling (MM) in developing students' mathematical thinking. The aim of this research study was to develop the Mathematical Modeling Knowledge Scale (MAMKS), capable of determining preservice teachers' (PSTs') knowledge of MM. The study…
Descriptors: Preservice Teachers, Preservice Teacher Education, Mathematics Education, Mathematics Curriculum
Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025
Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…
Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment
Alatli, Betül – International Journal of Curriculum and Instruction, 2022
This study was conducted to review the use of tests. For this purpose, 45 articles in which the Turkish form of the "Test Anxiety Inventory (TAI)," which is one of the tests frequently used in the field of education, was employed and that were published between 2000 and 2020 were examined in terms of factors that should be considered in…
Descriptors: Anxiety, Likert Scales, Test Anxiety, Test Reliability
Wagaman, John; Fletcher, Michael – Teaching Statistics: An International Journal for Teachers, 2018
This article considers how a handicapping system should be devised for squash. It looks at the American scoring system, and whether it is possible to have a fair system of handicapping. We consider "fair" from a perspective of expected number of rallies won and probability of winning.
Descriptors: Probability, Athletes, Athletics, Inhibition
Kohen, Zehavit; Gharra-Badran, Yasmin – Teaching Mathematics and Its Applications, 2023
Mathematics modelling is a vital competency for students of all ages. In this study, we aim to fill the research gap about valid and reliable tools for assessing and grading mathematical modeling problems, particularly those reflecting multiple steps of the modelling cycle. We present in this paper the design of a reliable and valid assessment…
Descriptors: Scoring Rubrics, Mathematical Models, Test Construction, Test Validity
Flor de Lis González-Mujico – Education and Information Technologies, 2024
Over the past decade, self-assessment tools have garnered significant attention in the interest of measuring the skillset required by educators and students to function productively and ethically in digitally mediated environments, particularly in relation to education policy implementation. Since stated beliefs do not always align with actual…
Descriptors: Technological Literacy, Evaluation Methods, Test Validity, Test Construction
Fergadiotis, Gerasimos; Casilio, Marianne; Dickey, Michael Walsh; Steel, Stacey; Nicholson, Hannele; Fleegle, Mikala; Swiderski, Alexander; Hula, William D. – Journal of Speech, Language, and Hearing Research, 2023
Purpose: Item response theory (IRT) is a modern psychometric framework with several advantageous properties as compared with classical test theory. IRT has been successfully used to model performance on anomia tests in individuals with aphasia; however, all efforts to date have focused on noun production accuracy. The purpose of this study is to…
Descriptors: Item Response Theory, Psychometrics, Verbs, Naming
Brent J. Goertzen; Kaley Klaus – Research & Practice in Assessment, 2023
When evaluating student learning, educators often employ scoring rubrics, for which quality can be determined through evaluating validity and reliability. This article discusses the norming process utilized in a graduate organizational leadership program for a capstone scoring rubric. Concepts of validity and reliability are discussed, as is the…
Descriptors: Graduate Students, Graduate Study, Graduate School Faculty, Scoring Rubrics
Swapneel Thite; Jayashri Ravishankar; Inmaculada Tomeo-Reyes; Araceli Martinez Ortiz – European Journal of Engineering Education, 2024
Effectively working in an engineering workplace requires strong teamwork skills, yet the existing literature within various disciplines reveals discrepancies in evaluating these skills. This complicates the design of a generic teamwork peer evaluation tool for engineering students. This study aims to address this gap by introducing the DRIVE…
Descriptors: Scoring Rubrics, Evaluation Methods, Peer Evaluation, Teamwork
Maria Blevins; Bryce Hughes; Jennifer Green; Leila Sterman; Shannon Willoughby – Journal of College Science Teaching, 2025
In this work, the authors document an expansion of the Public Speaking Competency Rubric (PSCR). First developed in 2012 by Schreiber, et al., the original rubric has only one item related to non-verbal communication. The authors of this work expanded the rubric to include 10 items related to the non-verbal aspects of public speaking and had it…
Descriptors: Test Construction, Public Speaking, Competence, Scoring Rubrics