Publication Date
| In 2026 | 0 |
| Since 2025 | 621 |
| Since 2022 (last 5 years) | 3121 |
| Since 2017 (last 10 years) | 7362 |
| Since 2007 (last 20 years) | 15000 |
Descriptor
| Test Reliability | 15006 |
| Test Validity | 10245 |
| Reliability | 9748 |
| Foreign Countries | 7119 |
| Test Construction | 4807 |
| Validity | 4189 |
| Measures (Individuals) | 3872 |
| Factor Analysis | 3820 |
| Psychometrics | 3513 |
| Interrater Reliability | 3117 |
| Correlation | 3037 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1319 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 249 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Parker, David C.; Stewart, Lisa H.; Thomson, Susan; Kaminski, Ruth A. – Assessment for Effective Intervention, 2021
Vocabulary skills are important for overall reading competence, but vocabulary assessment approaches that inform instructional decision-making and are sensitive to improvement are limited. This article describes a process for developing vocabulary measures designed to facilitate data-driven decision-making for kindergarten and first-grade students…
Descriptors: Vocabulary, Kindergarten, Grade 1, Elementary School Students
Michelle Herridge – ProQuest LLC, 2021
Evaluation of student written work during summative assessments is an important and critical task for instructors at all educational levels. Nevertheless, few research studies exist that provide insights into how different instructors approach this task. Chemistry faculty (FIs) and graduate student instructors (GSIs) regularly engage in the…
Descriptors: Science Instruction, Chemistry, College Faculty, Teaching Assistants
Park, Yeonggwang; Cádiz, Manuel Díaz; Nagle, Kathleen F.; Stepp, Cara E. – Journal of Speech, Language, and Hearing Research, 2020
Purpose: Assessment of strained voice quality is difficult due to the weak reliability of auditory-perceptual evaluation and lack of strong acoustic correlates. This study evaluated the contributions of relative fundamental frequency (RFF) and mid-to-high frequency noise to the perception of strain. Method: Stimuli were created using recordings of…
Descriptors: Acoustics, Audio Equipment, Auditory Perception, Correlation
Kinnear, George; Bennett, Max; Binnie, Rachel; Bolt, Róisín; Zheng, Yinglan – Teaching Mathematics and Its Applications, 2020
The MATH taxonomy classifies questions according to the mathematical skills required to answer them. It was created to aid the development of more balanced assessments in undergraduate mathematics and has since been used to compare different assessment regimes across school and university. To date, there has been no systematic investigation of the…
Descriptors: Taxonomy, Mathematics Instruction, Teaching Methods, Reliability
Atilgan, Hakan – Eurasian Journal of Educational Research, 2019
Purpose: This study intended to examine the generalizability and reliability of essay ratings within the scope of the generalizability (G) theory. Specifically, the effect of raters on the generalizability and reliability of students' essay ratings was examined. Furthermore, variations of the generalizability and reliability coefficients with…
Descriptors: Foreign Countries, Essay Tests, Test Reliability, Interrater Reliability
The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues
Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022
How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…
Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making
Eryilmaz, Önder – Participatory Educational Research, 2022
Although there is an increasing number of studies concentrating upon education, some researchers have revealed that most studies, including qualitative studies in education, have methodological issues. One of the most common mistakes and neglected issues in qualitative studies is not to ensure the trustworthiness of the research, which indeed is…
Descriptors: Foreign Countries, Doctoral Dissertations, Research Methodology, Credibility
Verhelst, Dries; Vanhoof, Jan; Van Petegem, Peter – Environmental Education Research, 2022
Empirically based tools to map education for sustainable development within school organisations are not readily available, which is both a cause and a consequence of the scarce empirical and quantitative research on school organisations and education for sustainable development. In present study, the Education for Sustainable Development School…
Descriptors: Test Construction, Test Validity, Sustainable Development, School Organization
Kinarsky, Alana R.; Christie, Christina A. – American Journal of Evaluation, 2022
Since 2007, two taxonomies have been proposed to identify the components of evaluation practice that may be specified in an evaluation policy. Little is known, however, about how these taxonomies align with evaluation policies developed by philanthropic foundations. Through thematic analysis, this article first compares 12 foundation evaluation…
Descriptors: Taxonomy, Evaluation Methods, Philanthropic Foundations, Educational Policy
Ozalp, Ugur; Cetin, Munevver – International Journal of Assessment Tools in Education, 2022
The aim of this study was to develop a scale instrument for measuring academic intellectual capital in the Turkish higher education context depending on student perceptions. The sample consisted of students of higher education institutions in the 2020-2021 academic year. Data were gathered in two stages. Exploratory Factor Analysis (EFA) was…
Descriptors: Measures (Individuals), College Students, Test Validity, Test Reliability
Akdeniz, Seher; Budak, Hatice; Ahçi, Zeynep G. – International Education Studies, 2022
Narcissism in social media reveals itself differently than in daily social interactions. Therefore, the present study aimed to develop a Scale of Narcissism in Social Media through the lens of the Narcissistic Admiration and Rivalry Model and to investigate its psychometric characteristics. The total sample of the study consisted of 740…
Descriptors: Test Construction, Personality Traits, Social Media, Psychometrics
Dambha, Tasneem; Swanepoel, De Wet; Mahomed-Asmail, Faheema; De Sousa, Karina C.; Graham, Marien A.; Smits, Cas – Journal of Speech, Language, and Hearing Research, 2022
Purpose: This study compared the test characteristics, test-retest reliability, and test efficiency of three novel digits-in-noise (DIN) test procedures to a conventional antiphasic 23-trial adaptive DIN (D23). Method: One hundred twenty participants with an average age of 42 years (SD = 19) were included. Participants were tested and retested…
Descriptors: Auditory Tests, Screening Tests, Efficiency, Test Format
Gil-Llario, María Dolores; Flores-Buils, Raquel; Elipe-Miravet, Marcel; Fernández-García, Olga; Ballester-Arnal, Rafael – Journal of Applied Research in Intellectual Disabilities, 2022
Background: This paper presents a description of the development and psychometric properties of a self-report instrument for the assessment of sexual behaviour and concerns of people with mild intellectual disabilities (SEBECOMID-S). Methods and procedures: The study included 281 people with mild intellectual disabilities. The psychometric…
Descriptors: Test Construction, Psychometrics, Measurement Techniques, Sexuality
Levin, Nathan; Baker, Ryan S.; Nasiar, Nidhi; Fancsali, Stephen; Hutt, Stephen – International Educational Data Mining Society, 2022
Research into "gaming the system" behavior in intelligent tutoring systems (ITS) has been around for almost two decades, and detection has been developed for many ITSs. Machine learning models can detect this behavior in both real-time and in historical data. However, intelligent tutoring system designs often change over time, in terms…
Descriptors: Intelligent Tutoring Systems, Artificial Intelligence, Models, Cheating
Almohalha, Lucieny; Santos, Jair Lício Ferreira; Pfeifer, Luzia Iara – Journal of Occupational Therapy, Schools & Early Intervention, 2022
The purpose of this research was to organize a cross-cultural adaptation study and analyze the reproducibility and test-retest reliability of the Infant Sensory Profile 2 (ISP2Br) to Brazilian babies. It was hypothesized that the instrument would be validated for use with Brazilian babies. The English language version of the profile was translated…
Descriptors: Foreign Countries, Infants, Sensory Experience, Portuguese

Peer reviewed
Direct link
