Publication Date
| In 2026 | 0 |
| Since 2025 | 621 |
| Since 2022 (last 5 years) | 3121 |
| Since 2017 (last 10 years) | 7362 |
| Since 2007 (last 20 years) | 15000 |
Descriptor
| Test Reliability | 15006 |
| Test Validity | 10245 |
| Reliability | 9748 |
| Foreign Countries | 7119 |
| Test Construction | 4807 |
| Validity | 4189 |
| Measures (Individuals) | 3872 |
| Factor Analysis | 3820 |
| Psychometrics | 3513 |
| Interrater Reliability | 3117 |
| Correlation | 3037 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1319 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 249 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Anna Cecilia McWhirter; Katherine A. Hails; David S. DeGarmo; Laura Lee McIntyre; S. Andrew Garbacz; Elizabeth A. Stormshak – Grantee Submission, 2024
Reliable and valid assessment of parenting and child behaviors is critical for clinicians and researchers alike, and observational measures of parenting behaviors are often considered the gold standard for assessing parenting and parent-child interaction quality. The current study sought to evaluate the reliability and validity of the Coder…
Descriptors: Questionnaires, Test Reliability, Test Validity, Kindergarten
Benjamin Lee Wolden – ProQuest LLC, 2024
Clinical Reasoning (CR) integrates thinking and decision-making in clinical practice (Huhn et al., 2019). CR is an established area of research in Doctor of Physical Therapy (DPT) education (Musolino & Jensen, 2019; Jensen & Mostrom, 2012) and has been acknowledged as a core competency of physical therapist residency education (APTA…
Descriptors: Thinking Skills, Skill Development, Simulation, Learning Experience
Sarah French; Ashton Dickerson; Raoul A. Mulder – Higher Education: The International Journal of Higher Education Research, 2024
High-stakes examinations enjoy widespread use as summative assessments in higher education. We review the arguments for and against their use, across seven common themes: memory recall and knowledge retention; student motivation and learning; authenticity and real-world relevance; validity and reliability; academic misconduct and contract…
Descriptors: High Stakes Tests, Program Effectiveness, Evidence Based Practice, Summative Evaluation
On-Soon Lee – Journal of Pan-Pacific Association of Applied Linguistics, 2024
Despite the increasing interest in using AI tools as assistant agents in instructional settings, the effectiveness of ChatGPT, the generative pretrained AI, for evaluating the accuracy of second language (L2) writing has been largely unexplored in formative assessment. Therefore, the current study aims to examine how ChatGPT, as an evaluator,…
Descriptors: Foreign Countries, Undergraduate Students, English (Second Language), Second Language Learning
ACT Education Corp., 2024
This technical manual provides an overview of the Mosaic™ by ACT®: Social Emotional Learning Screener. The Mosaic by ACT: Social Emotional Learning Screener (hereafter referred to as the Screener) assesses the social emotional skills of students in elementary school (Grades 3-5), middle school (Grades 6-8), and high school (Grades 9-12). Each…
Descriptors: Social Emotional Learning, Screening Tests, Elementary School Students, Middle School Students
Carazo-Vargas, Pedro; Salazar-Obando, Joshua; Vargas-Montero, Andrea; Alvarado-Barrantes, Ricardo; Siles-Canales, Francisco; Moncada-Jiménez, José – Measurement in Physical Education and Exercise Science, 2020
The aim of the study was to determine the convergent validity of a portable polysomnograph and an accelerometer for measuring sleep efficiency and movement in college students. Volunteers were 29 healthy students (males = 15, females = 14) who simultaneously wore the Nox T3 portable polysomnograph and the ActiGraph wGT3X-BT accelerometer. Both…
Descriptors: Sleep, Efficiency, Motion, Measurement Equipment
Buchan, Duncan S.; Boddy, Lynne M.; McLellan, Gillian – Measurement in Physical Education and Exercise Science, 2020
This study evaluated agreement in activity outcomes from ActiGraph accelerometers worn on both wrists in a laboratory and free-living setting. Part 1: Thirty-seven participants (25.5 ± 10.5 years) completed laboratory activities. Part 2: Thirty-nine participants (28.5 ± 9.8 years) wore accelerometers for 7 days. Outcomes included average…
Descriptors: Physical Activities, Measurement Equipment, Handedness, Physics
Crompvoets, Elise A. V.; Béguin, Anton A.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2020
Pairwise comparison is becoming increasingly popular as a holistic measurement method in education. Unfortunately, many comparisons are required for reliable measurement. To reduce the number of required comparisons, we developed an adaptive selection algorithm (ASA) that selects the most informative comparisons while taking the uncertainty of the…
Descriptors: Comparative Analysis, Statistical Analysis, Mathematics, Measurement
Maio, Shannon; Dumas, Denis; Organisciak, Peter; Runco, Mark – Creativity Research Journal, 2020
In recognition of the capability of text-mining models to quantify aspects of language use, some creativity researchers have adopted text-mining models as a mechanism to objectively and efficiently score the Originality of open-ended responses to verbal divergent thinking tasks. With the increasing use of text-mining models in divergent thinking…
Descriptors: Creative Thinking, Scores, Reliability, Data Analysis
Cramer, Kenneth; DeBlock, Denise – Collected Essays on Learning and Teaching, 2020
Following 20 years of publishing rank and reputation scores for Canada's 49 institutions of higher education, the present analysis tested five hypotheses: (1) rank and reputation should be positively correlated across schools for each year; (2) rank and reputation should be positively correlated across the 20 years for each school; (3) a school's…
Descriptors: Periodicals, Universities, Reputation, Educational History
Derouchey, Joe D.; Tomkinson, Grant R.; Rhoades, Jesse L.; Fitzgerald, John S. – Measurement in Physical Education and Exercise Science, 2020
Anthropometry is important for predicting sports performance. While 3-dimensional (3D) body scanners increase the feasibility of anthropometric assessment, reliability data on athletes are lacking. The aim of this study was to determine the test-retest reliability of a portable, single-camera 3D body scanning system (Styku S100) to assess…
Descriptors: Measurement Equipment, Human Body, Body Composition, Reliability
Borbély-Pecze, Tibor Bors – British Journal of Guidance & Counselling, 2020
An overview of the evolution of career information in light of the changing nature of the world of work is presented. Owing to the constant fundamental changes in the labour market, the distribution of paid work has been also constantly changing. In this article, a more dynamic and -- often temporary -- interplay between citizens and their…
Descriptors: Career Development, Information Sources, Validity, Reliability
Wesolowski, Brian C. – Music Educators Journal, 2020
Validity, reliability, and fairness are three prominent indicators for evaluating the quality of assessment processes. Each of the indicators is most often written about and applied in the context of large-scale assessment. As a result, the technical properties of these indicators make them limited in both their practicality and relevance for…
Descriptors: Music Education, Test Validity, Test Reliability, Student Evaluation
Ingham, Barry; Bentley, Alice; Rhodes, Jenny; Dagnan, Dave – Journal of Applied Research in Intellectual Disabilities, 2020
Background: This article describes the development and use of the Formulation Understanding Measure to evaluate team formulation with staff supporting people with intellectual disabilities. Method: A quantitative design with an opportunistic sample was used to evaluate the psychometric properties of the Formulation Understanding Measure (FUM)…
Descriptors: Intellectual Disability, Psychometrics, Test Construction, Teamwork
Langfeldt, Liv; Nedeva, Maria; Sörlin, Sverker; Thomas, Duncan A. – Minerva: A Review of Science, Learning and Policy, 2020
Notions of research quality are contextual in many respects: they vary between fields of research, between review contexts and between policy contexts. Yet, the role of these co-existing notions in research, and in research policy, is poorly understood. In this paper we offer a novel framework to study and understand research quality across three…
Descriptors: Research Methodology, Educational Quality, Policy, Novelty (Stimulus Dimension)

Peer reviewed
Direct link
