Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Dillon, Emily; Holingue, Calliope; Herman, Dana; Landa, Rebecca J. – Journal of Speech, Language, and Hearing Research, 2021
Purpose: Social communication or pragmatic skills are continuously distributed in the general population. Impairment in these skills is associated with two clinical disorders, autism spectrum disorder (ASD) and social (pragmatic) communication disorder. Such impairment can impact a child's peer acceptance, school performance, and current and later…
Descriptors: Psychometrics, Pragmatics, Rating Scales, Elementary School Students
Kaharu, Sarintan N.; Mansyur, Jusman – Pegem Journal of Education and Instruction, 2021
This study aims to develop a test that can be used to explore mental models and representation patterns of objects in liquid fluid. The test developed by adapting the Reeves's Development Model was carried out in several stages, namely: determining the orientation and test segments; initial survey; preparation of the initial draft; try out;…
Descriptors: Test Construction, Schemata (Cognition), Scientific Concepts, Water
Rebernik, Teja; Jacobi, Jidde; Tiede, Mark; Wieling, Martijn – Journal of Speech, Language, and Hearing Research, 2021
Purpose: This study compares two electromagnetic articulographs manufactured by Northern Digital, Inc.: the NDI Wave System (from 2008) and the NDI Vox-EMA System (from 2020). Method: Four experiments were completed: (1) comparison of statically positioned sensors; (2) tracking dynamic movements of sensors manipulated using a motor-driven LEGO…
Descriptors: Measurement Equipment, Articulation (Speech), Accuracy, Reliability
Seeber, Marco; Vlegels, Jef; Reimink, Elwin; Marusic, Ana; Pina, David G. – Research Evaluation, 2021
We have limited understanding of why reviewers tend to strongly disagree when scoring the same research proposal. Thus far, research that explored disagreement has focused on the characteristics of the proposal or the applicants, while ignoring the characteristics of the reviewers themselves. This article aims to address this gap by exploring…
Descriptors: Foreign Countries, Evaluators, Interrater Reliability, Research Proposals
Hogan, Thomas; DeStefano, Marissa; Gilby, Caitlin; Kosman, Dana; Peri, Joshua – Applied Measurement in Education, 2021
Buros' "Mental Measurements Yearbook (MMY)" has provided professional reviews of commercially published psychological and educational tests for over 80 years. It serves as a kind of conscience for the testing industry. For a random sample of 50 entries in the "19th MMY" (a total of 100 separate reviews) this study determined…
Descriptors: Test Reviews, Interrater Reliability, Psychological Testing, Educational Testing
Kazimi, Parviz Firudin Oqlu – Journal of Practical Studies in Education, 2021
The reliability of information in the global information space is one of the most important problems of globalization. The credibility of various information resources is currently being studied and considered in different ways. In some cases, the problem of the reliability of information can be assessed as harmful and dangerous. This article,…
Descriptors: Information Sources, Reliability, Credibility, Classification
Lambert, Richard G.; Holcomb, T. Scott; Bottoms, Bryndle L. – Center for Educational Measurement and Evaluation, 2021
The validity of the Kappa coefficient of chance-corrected agreement has been questioned when the prevalence of specific rating scale categories is low and agreement between raters is high. The researchers proposed the Lambda Coefficient of Rater-Mediated Agreement as an alternative to Kappa to address these concerns. Lambda corrects for chance…
Descriptors: Interrater Reliability, Teacher Evaluation, Test Validity, Evaluation Methods
McLeod, Bryce D.; Sutherland, Kevin S.; Broda, Michael; Granger, Kristen L.; Martinez, Ruben G.; Conroy, Maureen A.; Snyder, Patricia A.; Southam-Gerow, Michael A. – Prevention Science, 2022
Though treatment integrity measurement is important for research intended to promote social and behavioral outcomes of children at risk for emotional and behavioral disorders (EBDs) in early childhood settings, measurement gaps exist in the field. This paper reports on the development and preliminary psychometric assessment of the treatment…
Descriptors: Psychometrics, Measures (Individuals), Fidelity, Integrity
Beula M. Magimairaj; Philip Capin; Sandra L. Gillam; Sharon Vaughn; Greg Roberts; Anna-Maria Fall; Ronald B. Gillam – Grantee Submission, 2022
Purpose: Our aim was to evaluate the psychometric properties of the online administered format of the Test of Narrative Language--Second Edition (TNL-2; Gillam & Pearson, 2017), given the importance of assessing children's narrative ability and considerable absence of psychometric studies of spoken language assessments administered online.…
Descriptors: Computer Assisted Testing, Language Tests, Story Telling, Language Impairments
Gersten, Russell; Jayanthi, Madhavi; Newman-Gonchar, Rebecca; Anderson, Daniel; Spallone, Samantha; Taylor, Mary Jo – Regional Educational Laboratory Southeast, 2020
Several school districts in Georgia use two teacher-administered diagnostic assessments of student knowledge of mathematics as part of their multi-tiered system of support in grades K-8: the Global Strategy Stage (GloSS; New Zealand Ministry of Education, 2012) and the Individual Knowledge Assessment of Number (IKAN; New Zealand Ministry of…
Descriptors: Mathematics Tests, Diagnostic Tests, Test Reliability, Test Validity
Regional Educational Laboratory Southeast, 2020
This document are the appendixes for the report, "The Reliability and Consequential Validity of Two Teacher-Administered Student Mathematics Diagnostic Assessments." Rather than relying on occasional testimonials from the field, decisions about using diagnostic assessments across the state should be based on psychometric data from an…
Descriptors: Mathematics Tests, Diagnostic Tests, Test Reliability, Test Validity
Regional Educational Laboratory Southeast, 2020
Teachers need to assess their students' current level of mathematical understanding to provide appropriate interventions for students who are struggling. Several school districts in Georgia currently use two assessments for this purpose--the Global Strategy Stage (GloSS) and the Individual Knowledge Assessment of Number (IKAN). The IKAN is…
Descriptors: Mathematics Tests, Diagnostic Tests, Test Reliability, Test Validity
Lee, Morgan P.; Croteau, Ethan; Gurung, Ashish; Botelho, Anthony F.; Heffernan, Neil T. – International Educational Data Mining Society, 2023
The use of Bayesian Knowledge Tracing (BKT) models in predicting student learning and mastery, especially in mathematics, is a well-established and proven approach in learning analytics. In this work, we report on our analysis examining the generalizability of BKT models across academic years attributed to "detector rot." We compare the…
Descriptors: Bayesian Statistics, Models, Generalizability Theory, Longitudinal Studies
Halvorsen, Marianne Berg; Helverschou, Sissel Berge; Axelsdottir, Brynhildur; Brøndbo, Per Håkan; Martinussen, Monica – Journal of Autism and Developmental Disorders, 2023
There is a need for more knowledge of valid and standardized measures of mental health problems among children and adolescents with intellectual disability (ID). In this study, we systematically reviewed and evaluated the psychometric properties of instruments used to assess general mental health problems in this population. Following PRISMA…
Descriptors: Measures (Individuals), Clinical Diagnosis, Mental Health, Mental Disorders
Stark, Kristabel; Bettini, Elizabeth; Cumming, Michelle; O'Brien, Kristen Merrill; Brunsting, Nelson; Huggins-Manley, Corinne; Binkert, Gino; Shaheen, Tashnuva – Remedial and Special Education, 2023
Special education teachers' (SETs) working conditions play a crucial role in shaping the size, quality, and effectiveness of the U.S. SET workforce and thereby shape the quality of instruction provided to students with disabilities. Valid measures of SETs' working conditions are essential for conducting robust research on how to improve working…
Descriptors: Special Education, Teaching Conditions, Special Education Teachers, Students with Disabilities

Peer reviewed
Direct link
