Publication Date
In 2025 | 1 |
Since 2024 | 5 |
Since 2021 (last 5 years) | 15 |
Since 2016 (last 10 years) | 48 |
Since 2006 (last 20 years) | 179 |
Descriptor
Reliability | 297 |
Validity | 271 |
Evaluation Methods | 67 |
Measures (Individuals) | 46 |
Higher Education | 41 |
Student Evaluation | 37 |
Models | 36 |
Foreign Countries | 34 |
Research Methodology | 32 |
Test Construction | 31 |
Accountability | 27 |
More ▼ |
Source
Author
Herman, Joan L. | 4 |
Raykov, Tenko | 4 |
Lane, Kathleen Lynne | 3 |
Oakes, Wendy Peia | 3 |
Bastick, Tony | 2 |
Darling-Hammond, Linda | 2 |
Dietel, Ronald | 2 |
Goldschmidt, Pete | 2 |
Haney, Walt | 2 |
Harlen, Wynne | 2 |
Heritage, Margaret | 2 |
More ▼ |
Publication Type
Education Level
Location
Australia | 8 |
Florida | 5 |
United Kingdom (England) | 5 |
United States | 5 |
New York | 4 |
California | 3 |
Canada | 3 |
India | 3 |
Maryland | 3 |
New Zealand | 3 |
United Kingdom | 3 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Anthony S. Bryk; Angel Yee-Lam Li; Stuart Luppescu; Mai Anh Bui – Peabody Journal of Education, 2025
This is the second article in a series of three in this special issue on establishing a boundary object to foster network health and development. The first article laid out the theoretical rationale for an Improvement Network Health and Development Framework. This article details the efforts to develop a set of practical measures tied to this…
Descriptors: Validity, Networks, Measurement Techniques, Reliability
Ian Jones; Ben Davies – International Journal of Research & Method in Education, 2024
Educational researchers often need to construct precise and reliable measurement scales of complex and varied representations such as participants' written work, videoed lesson segments and policy documents. Developing such scales using can be resource-intensive and time-consuming, and the outcomes are not always reliable. Here we present…
Descriptors: Educational Research, Comparative Analysis, Educational Researchers, Measurement
Jiangang Hao; Alina A. von Davier; Victoria Yaneva; Susan Lottridge; Matthias von Davier; Deborah J. Harris – Educational Measurement: Issues and Practice, 2024
The remarkable strides in artificial intelligence (AI), exemplified by ChatGPT, have unveiled a wealth of opportunities and challenges in assessment. Applying cutting-edge large language models (LLMs) and generative AI to assessment holds great promise in boosting efficiency, mitigating bias, and facilitating customized evaluations. Conversely,…
Descriptors: Evaluation Methods, Artificial Intelligence, Educational Change, Computer Software
Teck Kiang Tan – Practical Assessment, Research & Evaluation, 2024
The procedures of carrying out factorial invariance to validate a construct were well developed to ensure the reliability of the construct that can be used across groups for comparison and analysis, yet mainly restricted to the frequentist approach. This motivates an update to incorporate the growing Bayesian approach for carrying out the Bayesian…
Descriptors: Bayesian Statistics, Factor Analysis, Programming Languages, Reliability
Marc Brysbaert – Cognitive Research: Principles and Implications, 2024
Experimental psychology is witnessing an increase in research on individual differences, which requires the development of new tasks that can reliably assess variations among participants. To do this, cognitive researchers need statistical methods that many researchers have not learned during their training. The lack of expertise can pose…
Descriptors: Experimental Psychology, Individual Differences, Statistical Analysis, Task Analysis
Nghia, Tran Le Huu; Duyen, Nguyen Thi My – Higher Education: The International Journal of Higher Education Research, 2019
Developed as an integral component of many higher education programs, internships provide a multitude of benefits for participating students. However, there is a lack of tools designed to measure internship-related learning outcomes. Therefore, this article will present the process of constructing and validating a scale that can be used to…
Descriptors: Higher Education, Internship Programs, Construct Validity, Outcomes of Education
Quintão, Cátia; Andrade, Pedro; Almeida, Fernando – Journal of Interdisciplinary Studies in Education, 2020
The case study is a widely used method in qualitative research. Although defining the case study can be simple, it is complex to develop its strategy. Furthermore, it is still often not considered to be a sufficiently robust research strategy in the education field because it does not offer well-defined and use well-structured protocols. One of…
Descriptors: Case Studies, Research Methodology, Validity, Reliability
Price, Heather E.; Smith, Christian – Field Methods, 2021
To identify the dominant cultural models among parents transmitting faith to their children, we find few methodological guidelines to guide coding and analysis of semi-structured interviews. We thus developed a three-phase procedure for our research team. Phase-one follows Campbell et al. by unitizing on meanings rather than words/pages, including…
Descriptors: Semi Structured Interviews, Parents, Religion, Reliability
Evans, Carol; Kandiko Howson, Camille; Forsythe, Alex; Edwards, Corony – Assessment & Evaluation in Higher Education, 2021
Over the last 20 years there has been significant growth in the volume of higher education pedagogical research across disciplines and national contexts, but inherent tensions in defining quality remain. In this paper we present a framework to support understanding of what constitutes internationally excellent research, drawing on a range of…
Descriptors: Educational Quality, Higher Education, Educational Research, Scholarship
Howard, Jeffrey N. – Practical Assessment, Research & Evaluation, 2022
The Student Evaluation of Teaching (SET) instrument provides insight for instructors and administrators alike, often touting high response-rates to endorse their validity and reliability. However, response-rate alone omits consideration for "adequate quantity of 'observational sampling opportunity' (OSO) data points" (e.g., high student…
Descriptors: Student Evaluation of Teacher Performance, Validity, Reliability, Longitudinal Studies
Borbély-Pecze, Tibor Bors – British Journal of Guidance & Counselling, 2020
An overview of the evolution of career information in light of the changing nature of the world of work is presented. Owing to the constant fundamental changes in the labour market, the distribution of paid work has been also constantly changing. In this article, a more dynamic and -- often temporary -- interplay between citizens and their…
Descriptors: Career Development, Information Sources, Validity, Reliability
ElJishi, Ziad; Abdel-Hameed, Faten S. M. – International Journal of Education and Literacy Studies, 2022
This concept paper highlights the problem of the lack of a unified Arab list of Bloom's taxonomy to be used in teacher-preparation programs across Arab universities. The paper illustrates the current problem and offers steps needed for completing a project that would produce a unified list. The unified list would have both the required validity…
Descriptors: Arabs, Teacher Education Programs, Validity, Reliability
Wise, Steven L. – Applied Measurement in Education, 2019
The identification of rapid guessing is important to promote the validity of achievement test scores, particularly with low-stakes tests. Effective methods for identifying rapid guesses require reliable threshold methods that are also aligned with test taker behavior. Although several common threshold methods are based on rapid guessing response…
Descriptors: Guessing (Tests), Identification, Reaction Time, Reliability
Leighton, Jacqueline P.; Lehman, Blair – Educational Measurement: Issues and Practice, 2020
In this digital ITEMS module, Dr. Jacqueline Leighton and Dr. Blair Lehman review differences between think-aloud interviews to measure problem-solving processes and cognitive labs to measure comprehension processes. Learners are introduced to historical, theoretical, and procedural differences between these methods and how to use and analyze…
Descriptors: Protocol Analysis, Interviews, Problem Solving, Cognitive Processes
Becerra, Beatriz; Núñez, Paola; Vergara, Claudia; Santibáñez, David; Krüger, Dirk; Cofré, Hernán – Research in Science Education, 2023
Despite the importance of evolution to understand biology, there is significant evidence that many biology teachers have difficulties to successfully teach this topic. The purpose of this study is to describe procedures by which a paper-and-pencil instrument to assess teachers' pedagogical content knowledge for evolution (PCK[subscript evo]) was…
Descriptors: Evolution, Science Instruction, Pedagogical Content Knowledge, Construct Validity