Publication Date
| In 2026 | 0 |
| Since 2025 | 621 |
| Since 2022 (last 5 years) | 3121 |
| Since 2017 (last 10 years) | 7362 |
| Since 2007 (last 20 years) | 15000 |
Descriptor
| Test Reliability | 15006 |
| Test Validity | 10245 |
| Reliability | 9748 |
| Foreign Countries | 7119 |
| Test Construction | 4807 |
| Validity | 4189 |
| Measures (Individuals) | 3872 |
| Factor Analysis | 3820 |
| Psychometrics | 3513 |
| Interrater Reliability | 3117 |
| Correlation | 3037 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1319 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 249 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Sanjaya Mishra; Nitesh Kumar Jha; Kaushal Kumar Bhagat – Journal of Learning for Development, 2025
The benchmarking toolkit for Technology-Enabled Learning (TEL) developed by the Commonwealth of Learning (COL) is designed to assess TEL practices in higher education institutions. This study evaluated the content validity, internal consistency, and inter-domain relationships of the toolkit using a survey of 355 practitioners across 21…
Descriptors: Benchmarking, Technology Uses in Education, Higher Education, Educational Practices
Ian Jones; Ben Davies – International Journal of Research & Method in Education, 2024
Educational researchers often need to construct precise and reliable measurement scales of complex and varied representations such as participants' written work, videoed lesson segments and policy documents. Developing such scales using can be resource-intensive and time-consuming, and the outcomes are not always reliable. Here we present…
Descriptors: Educational Research, Comparative Analysis, Educational Researchers, Measurement
Manjula Wijewickrema – portal: Libraries and the Academy, 2024
This research compares the performance measures reported by two bibliographic databases relevant to a set of authors who have published in predatory journals. The reliability of decision-making based on the information provided by uncontrolled bibliographic databases is examined to support rational decisions. A sample of authors who published in…
Descriptors: Periodicals, Ethics, Deception, Authors
Stefan K. Schauber; Anne O. Olsen; Erik L. Werner; Morten Magelssen – Advances in Health Sciences Education, 2024
Introduction: Research in various areas indicates that expert judgment can be highly inconsistent. However, expert judgment is indispensable in many contexts. In medical education, experts often function as examiners in rater-based assessments. Here, disagreement between examiners can have far-reaching consequences. The literature suggests that…
Descriptors: Medical Students, Performance Based Assessment, Expertise, Interrater Reliability
Guido Schwarzer; Gerta Rücker; Cristina Semaca – Research Synthesis Methods, 2024
The "LFK" index has been promoted as an improved method to detect bias in meta-analysis. Putatively, its performance does not depend on the number of studies in the meta-analysis. We conducted a simulation study, comparing the "LFK" index test to three standard tests for funnel plot asymmetry in settings with smaller or larger…
Descriptors: Bias, Meta Analysis, Simulation, Evaluation Methods
Sermin Metin; Mehmet Basaran; Merve Yildirim Seheryeli; Emily Relkin; Damla Kalyenci – Journal of Science Education and Technology, 2024
In the early years, it has become essential to support the acquisition of computational thinking, which is seen as a 21st-century skill and new literacy. A valid and reliable measurement tool is needed to develop and evaluate educational practices related to these skills. "TechCheck" is a validated unplugged assessment of computational…
Descriptors: Computation, Thinking Skills, Test Validity, Test Reliability
Farahiyah Wan Yunus; Sakinah Idris; Siti Noraini Asmuri; Bess Fowler; Muhammad Hibatullah Romli – American Journal of Play, 2024
The authors contend that children benefit from play as a form of intervention and as a means of fostering their cognitive, social, and physical growth. They review several standardized instruments developed over the last fifty years to assess this benefit of play on child development. They identify twenty-one such play measures, the majority of…
Descriptors: Child Development, Play, Test Reliability, Standardized Tests
Harry May; Travis Atkison – Journal of Cybersecurity Education, Research and Practice, 2024
Detecting and mitigating wormhole attacks in wireless networks remains a critical challenge due to their deceptive nature and potential to compromise network integrity. This paper proposes a novel approach to wormhole detection by leveraging propagation delay analysis between network nodes. Unlike traditional methods that rely on signature-based…
Descriptors: Computer Security, Identification, Computer Networks, Telecommunications
Tenko Raykov; George A. Marcoulides; Natalja Menold – Applied Measurement in Education, 2024
We discuss an application of Bayesian factor analysis for estimation of the optimal linear combination and associated maximal reliability of a multi-component measuring instrument. The described procedure yields point and credibility interval estimates of this reliability coefficient, which are readily obtained in educational and behavioral…
Descriptors: Bayesian Statistics, Test Reliability, Error of Measurement, Measurement Equipment
Razavipour, Kioumars; Raji, Behnaz – Language Testing in Asia, 2022
The credibility of conclusions arrived at in quantitative research depends, to a large extent, on the quality of data collection instruments used to quantify language and non-language constructs. Despite this, research into data collection instruments used in Applied Linguistics and particularly in the thesis genre remains limited. This study…
Descriptors: Applied Linguistics, Test Reliability, Language Tests, Credibility
Unal, Zafer – Journal of Interactive Learning Research, 2022
Despite over fifteen years of flipped classroom implementation, current literature does not provide any reliable, standardized rubric as a guideline to create or evaluate flipped classroom lessons based on effective flipped classroom design principles. In fact, at the time of this study, when an internet search for existing rubrics was conducted,…
Descriptors: Flipped Classroom, Lesson Plans, Scoring Rubrics, Graduate Students
Funda Ugurlu; Filiz Evran Acar – Journal of Pedagogical Research, 2025
The aim of this study is to develop a valid and reliable measurement tool to identify teachers' tendencies towards professional development models. In line with the purpose of scale development, a survey model was preferred. The scale was designed to be applicable to teachers from various disciplines currently working in any institution…
Descriptors: Measures (Individuals), Test Reliability, Test Validity, Faculty Development
Snejana Slantcheva-Durst – Discover Education, 2025
This study assesses the civic orientation of graduate students via the "Importance of Social Action Engagement" scale and tests that instrument's construct validity and reliability when applied to graduate students. The study contributes to our understanding of the levels of graduate students' willingness to engage with social issues,…
Descriptors: Graduate Students, Civics, Citizenship Education, Measures (Individuals)
Mirjam Sophia Glessmer; Rachel Forsyth – Teaching & Learning Inquiry, 2025
Generative AI tools (GenAI) are increasingly used for academic tasks, including qualitative data analysis for the Scholarship of Teaching and Learning (SoTL). In our practice as academic developers, we are frequently asked for advice on whether this use for GenAI is reliable, valid, and ethical. Since this is a new field, we have not been able to…
Descriptors: Artificial Intelligence, Research Methodology, Data Analysis, Scholarship
Yangmeng Xu; Stefanie A. Wind – Educational Measurement: Issues and Practice, 2025
Double-scoring constructed-response items is a common but costly practice in mixed-format assessments. This study explored the impacts of Targeted Double-Scoring (TDS) and random double-scoring procedures on the quality of psychometric outcomes, including student achievement estimates, person fit, and student classifications under various…
Descriptors: Academic Achievement, Psychometrics, Scoring, Evaluation Methods

Peer reviewed
Direct link
