Publication Date
| In 2026 | 3 |
| Since 2025 | 636 |
| Since 2022 (last 5 years) | 3137 |
| Since 2017 (last 10 years) | 7378 |
| Since 2007 (last 20 years) | 15016 |
Descriptor
| Test Reliability | 15015 |
| Test Validity | 10252 |
| Reliability | 9751 |
| Foreign Countries | 7126 |
| Test Construction | 4811 |
| Validity | 4189 |
| Measures (Individuals) | 3875 |
| Factor Analysis | 3821 |
| Psychometrics | 3515 |
| Interrater Reliability | 3122 |
| Correlation | 3037 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1320 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
On-Soon Lee – Journal of Pan-Pacific Association of Applied Linguistics, 2024
Despite the increasing interest in using AI tools as assistant agents in instructional settings, the effectiveness of ChatGPT, the generative pretrained AI, for evaluating the accuracy of second language (L2) writing has been largely unexplored in formative assessment. Therefore, the current study aims to examine how ChatGPT, as an evaluator,…
Descriptors: Foreign Countries, Undergraduate Students, English (Second Language), Second Language Learning
ACT Education Corp., 2024
This technical manual provides an overview of the Mosaic™ by ACT®: Social Emotional Learning Screener. The Mosaic by ACT: Social Emotional Learning Screener (hereafter referred to as the Screener) assesses the social emotional skills of students in elementary school (Grades 3-5), middle school (Grades 6-8), and high school (Grades 9-12). Each…
Descriptors: Social Emotional Learning, Screening Tests, Elementary School Students, Middle School Students
Carazo-Vargas, Pedro; Salazar-Obando, Joshua; Vargas-Montero, Andrea; Alvarado-Barrantes, Ricardo; Siles-Canales, Francisco; Moncada-Jiménez, José – Measurement in Physical Education and Exercise Science, 2020
The aim of the study was to determine the convergent validity of a portable polysomnograph and an accelerometer for measuring sleep efficiency and movement in college students. Volunteers were 29 healthy students (males = 15, females = 14) who simultaneously wore the Nox T3 portable polysomnograph and the ActiGraph wGT3X-BT accelerometer. Both…
Descriptors: Sleep, Efficiency, Motion, Measurement Equipment
Buchan, Duncan S.; Boddy, Lynne M.; McLellan, Gillian – Measurement in Physical Education and Exercise Science, 2020
This study evaluated agreement in activity outcomes from ActiGraph accelerometers worn on both wrists in a laboratory and free-living setting. Part 1: Thirty-seven participants (25.5 ± 10.5 years) completed laboratory activities. Part 2: Thirty-nine participants (28.5 ± 9.8 years) wore accelerometers for 7 days. Outcomes included average…
Descriptors: Physical Activities, Measurement Equipment, Handedness, Physics
Crompvoets, Elise A. V.; Béguin, Anton A.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2020
Pairwise comparison is becoming increasingly popular as a holistic measurement method in education. Unfortunately, many comparisons are required for reliable measurement. To reduce the number of required comparisons, we developed an adaptive selection algorithm (ASA) that selects the most informative comparisons while taking the uncertainty of the…
Descriptors: Comparative Analysis, Statistical Analysis, Mathematics, Measurement
Maio, Shannon; Dumas, Denis; Organisciak, Peter; Runco, Mark – Creativity Research Journal, 2020
In recognition of the capability of text-mining models to quantify aspects of language use, some creativity researchers have adopted text-mining models as a mechanism to objectively and efficiently score the Originality of open-ended responses to verbal divergent thinking tasks. With the increasing use of text-mining models in divergent thinking…
Descriptors: Creative Thinking, Scores, Reliability, Data Analysis
Cramer, Kenneth; DeBlock, Denise – Collected Essays on Learning and Teaching, 2020
Following 20 years of publishing rank and reputation scores for Canada's 49 institutions of higher education, the present analysis tested five hypotheses: (1) rank and reputation should be positively correlated across schools for each year; (2) rank and reputation should be positively correlated across the 20 years for each school; (3) a school's…
Descriptors: Periodicals, Universities, Reputation, Educational History
Derouchey, Joe D.; Tomkinson, Grant R.; Rhoades, Jesse L.; Fitzgerald, John S. – Measurement in Physical Education and Exercise Science, 2020
Anthropometry is important for predicting sports performance. While 3-dimensional (3D) body scanners increase the feasibility of anthropometric assessment, reliability data on athletes are lacking. The aim of this study was to determine the test-retest reliability of a portable, single-camera 3D body scanning system (Styku S100) to assess…
Descriptors: Measurement Equipment, Human Body, Body Composition, Reliability
Borbély-Pecze, Tibor Bors – British Journal of Guidance & Counselling, 2020
An overview of the evolution of career information in light of the changing nature of the world of work is presented. Owing to the constant fundamental changes in the labour market, the distribution of paid work has been also constantly changing. In this article, a more dynamic and -- often temporary -- interplay between citizens and their…
Descriptors: Career Development, Information Sources, Validity, Reliability
Wesolowski, Brian C. – Music Educators Journal, 2020
Validity, reliability, and fairness are three prominent indicators for evaluating the quality of assessment processes. Each of the indicators is most often written about and applied in the context of large-scale assessment. As a result, the technical properties of these indicators make them limited in both their practicality and relevance for…
Descriptors: Music Education, Test Validity, Test Reliability, Student Evaluation
Ingham, Barry; Bentley, Alice; Rhodes, Jenny; Dagnan, Dave – Journal of Applied Research in Intellectual Disabilities, 2020
Background: This article describes the development and use of the Formulation Understanding Measure to evaluate team formulation with staff supporting people with intellectual disabilities. Method: A quantitative design with an opportunistic sample was used to evaluate the psychometric properties of the Formulation Understanding Measure (FUM)…
Descriptors: Intellectual Disability, Psychometrics, Test Construction, Teamwork
Langfeldt, Liv; Nedeva, Maria; Sörlin, Sverker; Thomas, Duncan A. – Minerva: A Review of Science, Learning and Policy, 2020
Notions of research quality are contextual in many respects: they vary between fields of research, between review contexts and between policy contexts. Yet, the role of these co-existing notions in research, and in research policy, is poorly understood. In this paper we offer a novel framework to study and understand research quality across three…
Descriptors: Research Methodology, Educational Quality, Policy, Novelty (Stimulus Dimension)
Hicks, Nathan M. – ProQuest LLC, 2020
Grades serve as one of the primary indicators of student learning, directing subsequent actions for students, instructors, and administrators, alike. Therefore, grade validity--that is, the extent to which grades communicate a meaningful and credible representation of what they purport to measure--is of utmost importance. However, a grade cannot…
Descriptors: Grading, Scoring Rubrics, Interrater Reliability, Test Validity
Marianna Papadopoulou; Sophia Stasi; Daphne Bakalidou; Effie Papageorgiou; Aristi Tsokani; Theodora Bratsi; George Papathanasiou – Journal of Developmental and Physical Disabilities, 2020
To explore the psychometric properties of the Greek version of the World Health Organization Disability Assessment Schedule (WHODAS 2.0-12 item) in adult patients suffering from motor disabilities. The questionnaire of WHODAS 2.0-12 item was officially translated and cross-culturally adapted into Greek (WHODAS 2.0-12Gr). 136 adult patients with…
Descriptors: Adults, Patients, Disabilities, Evaluation
Mislevy, Robert J.; Oliveri, Maria Elena – Educational Measurement: Issues and Practice, 2019
In this digital ITEMS module, Dr. Robert [Bob] Mislevy and Dr. Maria Elena Oliveri introduce and illustrate a sociocognitive perspective on educational measurement, which focuses on a variety of design and implementation considerations for creating fair and valid assessments for learners from diverse populations with diverse sociocultural…
Descriptors: Educational Testing, Reliability, Test Validity, Test Reliability

Peer reviewed
Direct link
