Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Ramon-Casas, Marta; Nuño, Neus; Pons, Ferran; Cunillera, Toni – Assessment & Evaluation in Higher Education, 2019
This article presents an empirical evaluation of the validity and reliability of a peer-assessment activity to improve academic writing competences. Specifically, we explored a large group of psychology undergraduate students with different initial writing skills. Participants (n = 365) produced two different essays, which were evaluated by their…
Descriptors: Peer Evaluation, Validity, Reliability, Writing Skills
Brown, Deirdre A.; Lamb, Michael E. – Applied Cognitive Psychology, 2019
In this brief review, we reflect upon the key contributions of research examining children's eyewitness testimony. Children's testimonial ability became a focus of interest for researchers about 40 years ago in the wake of several high-profile child abuse cases that prompted questions about children's reliability in the face of problematic…
Descriptors: Child Abuse, Reliability, Accuracy, Children
Donegan, Sarah; Dias, Sofia; Welton, Nicky J. – Research Synthesis Methods, 2019
When numerous treatments exist for a disease (Treatments 1, 2, 3, etc), network meta-regression (NMR) examines whether each relative treatment effect (eg, mean difference for 2 vs 1, 3 vs 1, and 3 vs 2) differs according to a covariate (eg, disease severity). Two consistency assumptions underlie NMR: consistency of the treatment effects at the…
Descriptors: Reliability, Regression (Statistics), Outcomes of Treatment, Statistical Analysis
Ma, Timmy; Komarova, Natalia L. – Cognitive Science, 2019
Learning in natural environments is often characterized by a degree of inconsistency from an input. These inconsistencies occur, for example, when learning from more than one source, or when the presence of environmental noise distorts incoming information; as a result, the task faced by the learner becomes ambiguous. In this study, we investigate…
Descriptors: Reliability, Associative Learning, Symbolic Learning, Sequential Learning
Jannetts, Stephen; Schaeffler, Felix; Beck, Janet; Cowen, Steve – International Journal of Language & Communication Disorders, 2019
Background: Occupational voice problems constitute a serious public health issue with substantial financial and human consequences for society. Modern mobile technologies such as smartphones have the potential to enhance approaches to prevention and management of voice problems. This paper addresses an important aspect of smartphone-assisted voice…
Descriptors: Voice Disorders, Handheld Devices, Acoustics, Assistive Technology
Kelley, Kairn Stetler; Littenberg, Benjamin – Journal of Speech, Language, and Hearing Research, 2019
Method: Sixty English-speaking children, 7-14 years old with normal hearing, had a single study visit during which each test was administered twice. Changes on retest were summarized by within-subject standard deviation ( S[subscript w]), compared among tests, and compared with binomial model predictions. Correlates of variance were explored.…
Descriptors: Children, Early Adolescents, Listening Skills, Test Reliability
Raykov, Tenko; Marcoulides, George A.; Harrison, Michael; Menold, Natalja – Educational and Psychological Measurement, 2019
This note confronts the common use of a single coefficient alpha as an index informing about reliability of a multicomponent measurement instrument in a heterogeneous population. Two or more alpha coefficients could instead be meaningfully associated with a given instrument in finite mixture settings, and this may be increasingly more likely the…
Descriptors: Statistical Analysis, Test Reliability, Measures (Individuals), Computation
Williams, Logan; Kemp, Simon – Assessment & Evaluation in Higher Education, 2019
We examined the reliability of grading master's theses at a New Zealand university, where a variant of the academic journal review system is employed. The overall correlation between the grades recommended by internal and external markers of master's theses in psychology and applied psychology at this university was 0.39, which is similar to that…
Descriptors: Interrater Reliability, Masters Theses, Foreign Countries, Grades (Scholastic)
Bliss, Alex; Dekerle, Jeanne – Measurement in Physical Education and Exercise Science, 2019
Knee flexor and extensor muscular assessment via isokinetic dynamometry is common practice and established in the research literature. However, reporting assessment methodology regarding reciprocal and nonreciprocal movements is often vague or absent. Such methodological issues are crucial for accurate assessments. Therefore, knee extensor and…
Descriptors: Motor Reactions, Muscular Strength, Males, Test Reliability
Bramley, Tom; Vitello, Sylvia – Assessment in Education: Principles, Policy & Practice, 2019
Comparative Judgement (CJ) is an increasingly widely investigated method in assessment for creating a scale, for example of the quality of essays. One area that has attracted attention in CJ studies is the optimisation of the selection of pairs of objects for judgement. One approach is known as adaptive comparative judgement (ACJ). It has been…
Descriptors: Reliability, Evaluation Methods, Comparative Analysis, Essay Tests
Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2019
This note discusses the merits of coefficient alpha and their conditions in light of recent critical publications that miss out on significant research findings over the past several decades. That earlier research has demonstrated the empirical relevance and utility of coefficient alpha under certain empirical circumstances. The article highlights…
Descriptors: Test Validity, Test Reliability, Test Items, Correlation
Sunahase, Takeru; Baba, Yukino; Kashima, Hisashi – International Educational Data Mining Society, 2019
Peer assessment is a promising solution for scaling up the grading of a large number of submissions. The reliability of evaluations is one of the critical issues in peer assessment; several probabilistic models have been proposed for obtaining reliable grades from peers. Peer correction is a similar framework, in which students are instructed to…
Descriptors: Peer Evaluation, Error Correction, Grading, Reliability
Abdalla, Widad – ProQuest LLC, 2019
Trend scoring is often used in large-scale assessments to monitor for rater drift when the same constructed response items are administered in multiple test administrations. In trend scoring, a set of responses from Time "A" are rescored by raters at Time "B." The purpose of this study is to examine the ability of…
Descriptors: Scoring, Interrater Reliability, Test Items, Error Patterns
Petscher, Y.; Pentimonti, J.; Stanley, C. – National Center on Improving Literacy, 2019
Reliability is the consistency of a set of scores that are designed to measure the same thing. Reliability is a statistical property of scores that must be demonstrated rather than assumed.
Descriptors: Scores, Measurement, Test Reliability, Error Patterns
Maxwell, Bruce; Boon, Helen; Tanchuk, Nicolas; Rauwerda, Bryan – Journal of Moral Education, 2021
This article documents the adaptation, piloting and validation of a measure of teachers' ethical sensitivity. To create the test, we modified a measure from dentistry drawing on literature in teacher professional ethics and drew on the expertise of professional ethics scholars and practitioners. Based on the results of Rasch analysis combined with…
Descriptors: Ethics, Moral Values, Scores, Teacher Education Programs

Peer reviewed
Direct link
