Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Raykov, Tenko; Marcoulides, George A.; Harrison, Michael; Menold, Natalja – Educational and Psychological Measurement, 2019
This note confronts the common use of a single coefficient alpha as an index informing about reliability of a multicomponent measurement instrument in a heterogeneous population. Two or more alpha coefficients could instead be meaningfully associated with a given instrument in finite mixture settings, and this may be increasingly more likely the…
Descriptors: Statistical Analysis, Test Reliability, Measures (Individuals), Computation
Williams, Logan; Kemp, Simon – Assessment & Evaluation in Higher Education, 2019
We examined the reliability of grading master's theses at a New Zealand university, where a variant of the academic journal review system is employed. The overall correlation between the grades recommended by internal and external markers of master's theses in psychology and applied psychology at this university was 0.39, which is similar to that…
Descriptors: Interrater Reliability, Masters Theses, Foreign Countries, Grades (Scholastic)
Bliss, Alex; Dekerle, Jeanne – Measurement in Physical Education and Exercise Science, 2019
Knee flexor and extensor muscular assessment via isokinetic dynamometry is common practice and established in the research literature. However, reporting assessment methodology regarding reciprocal and nonreciprocal movements is often vague or absent. Such methodological issues are crucial for accurate assessments. Therefore, knee extensor and…
Descriptors: Motor Reactions, Muscular Strength, Males, Test Reliability
Bramley, Tom; Vitello, Sylvia – Assessment in Education: Principles, Policy & Practice, 2019
Comparative Judgement (CJ) is an increasingly widely investigated method in assessment for creating a scale, for example of the quality of essays. One area that has attracted attention in CJ studies is the optimisation of the selection of pairs of objects for judgement. One approach is known as adaptive comparative judgement (ACJ). It has been…
Descriptors: Reliability, Evaluation Methods, Comparative Analysis, Essay Tests
Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2019
This note discusses the merits of coefficient alpha and their conditions in light of recent critical publications that miss out on significant research findings over the past several decades. That earlier research has demonstrated the empirical relevance and utility of coefficient alpha under certain empirical circumstances. The article highlights…
Descriptors: Test Validity, Test Reliability, Test Items, Correlation
Sunahase, Takeru; Baba, Yukino; Kashima, Hisashi – International Educational Data Mining Society, 2019
Peer assessment is a promising solution for scaling up the grading of a large number of submissions. The reliability of evaluations is one of the critical issues in peer assessment; several probabilistic models have been proposed for obtaining reliable grades from peers. Peer correction is a similar framework, in which students are instructed to…
Descriptors: Peer Evaluation, Error Correction, Grading, Reliability
Abdalla, Widad – ProQuest LLC, 2019
Trend scoring is often used in large-scale assessments to monitor for rater drift when the same constructed response items are administered in multiple test administrations. In trend scoring, a set of responses from Time "A" are rescored by raters at Time "B." The purpose of this study is to examine the ability of…
Descriptors: Scoring, Interrater Reliability, Test Items, Error Patterns
Petscher, Y.; Pentimonti, J.; Stanley, C. – National Center on Improving Literacy, 2019
Reliability is the consistency of a set of scores that are designed to measure the same thing. Reliability is a statistical property of scores that must be demonstrated rather than assumed.
Descriptors: Scores, Measurement, Test Reliability, Error Patterns
Maxwell, Bruce; Boon, Helen; Tanchuk, Nicolas; Rauwerda, Bryan – Journal of Moral Education, 2021
This article documents the adaptation, piloting and validation of a measure of teachers' ethical sensitivity. To create the test, we modified a measure from dentistry drawing on literature in teacher professional ethics and drew on the expertise of professional ethics scholars and practitioners. Based on the results of Rasch analysis combined with…
Descriptors: Ethics, Moral Values, Scores, Teacher Education Programs
Sari Pramila-Savukoski; Heli-Maria Kuivila; Jonna Juntunen; Miro Koskenranta; Erika Jarva; Anna-Maria Tuomikoski; Mira Hammarén; Kristina Mikkonen – International Journal of Research in Education and Science, 2024
There is a clear need for highly competent health sciences experts. No instrument currently exists for assessing the generic competences of health sciences students. The aim of this study is to develop and psychometrically test the Health sciences Generic Competence (HealthGenericCom) instrument. The instrument development four step process has…
Descriptors: Test Construction, Psychometrics, Health Sciences, Competence
Zhong Jian Chee; Anke M. Scheeren; Marieke de Vries – Autism: The International Journal of Research and Practice, 2024
Despite several psychometric advantages over the 50-item Autism Spectrum Quotient, an instrument used to measure autistic traits, the abridged AQ-28 and its cross-cultural validity have not been examined as extensively. Therefore, this study aimed to examine the factor structure and measurement invariance of the AQ-28 in 818 Dutch (M[subscript…
Descriptors: Autism Spectrum Disorders, Questionnaires, Factor Structure, Factor Analysis
Marisa G. Filipe; Cátia Severino; Marina Vigário; Sónia Frota – International Journal of Language & Communication Disorders, 2024
Background: As delays or disorders in early language and communication are the most prevalent symptom in children with disabilities, early screening is crucial to promote prevention, early diagnosis, and intervention. However, to the best of our knowledge, no screening tool is available for the joint assessment of early language and social…
Descriptors: Portuguese, Infants, Toddlers, Check Lists
Carol Reeves; J. J. Sylvia IV – Journal of Technical Writing and Communication, 2024
Since its release in late 2022, ChatGPT and subsequent generative artificial intelligence (GAI) tools have raised a wide variety of questions and concerns for the field of technical communication: How will these tools be incorporated into professional settings? How might we appropriately integrate these tools into our research and teaching? In…
Descriptors: Computer Simulation, Computer Uses in Education, Writing Instruction, Prompting
Zofia Mazur-Socha; Mariola Laguna; Peter Gollwitzer – Music Education Research, 2024
This article reports on the development and validation of the Instrumental Practice Goal Realization Inventory (IPGRI) designed to assess the process of self-directed study, beginning with setting the intention to practice and ending with the evaluation of one's performance. This new tool is based on the theoretical model of action phases. The…
Descriptors: Music Education, Music Activities, Measures (Individuals), Test Construction
Samantha Ridout; Sigmund Eldevik – Review Journal of Autism and Developmental Disorders, 2024
This review is aimed at identifying assessment instruments used to measure treatment outcomes in children with autism spectrum disorder who received early and intensive behavioral interventions. Forty three articles were included and appraised using the Council for Exceptional Children's Standards for Evidence Based Practice quality index rater.…
Descriptors: Outcomes of Treatment, Autism Spectrum Disorders, Psychometrics, Behavior Problems

Peer reviewed
Direct link
