Publication Date
| In 2026 | 12 |
| Since 2025 | 958 |
| Since 2022 (last 5 years) | 4567 |
| Since 2017 (last 10 years) | 10500 |
| Since 2007 (last 20 years) | 21963 |
Descriptor
| Test Validity | 21786 |
| Validity | 13791 |
| Test Reliability | 10864 |
| Foreign Countries | 9887 |
| Test Construction | 6897 |
| Factor Analysis | 5761 |
| Measures (Individuals) | 5633 |
| Predictive Validity | 5022 |
| Psychometrics | 4820 |
| Reliability | 4635 |
| Correlation | 4376 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1397 |
| Australia | 705 |
| Canada | 626 |
| China | 528 |
| United States | 439 |
| Indonesia | 389 |
| United Kingdom | 363 |
| Germany | 340 |
| California | 338 |
| Netherlands | 336 |
| Spain | 311 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Bayraktar, Aysegul; Yalcin, Seher – International Journal of Assessment Tools in Education, 2021
In this study, the aim was to both develop a valid and reliable measurement tool for determining teachers' attitudes as well as to determine their opinions towards design and skill workshops (DSW). In addition, the researchers aimed to determine how teachers rank design and skill workshops based on their importance. Since an attempt was made to…
Descriptors: Teacher Attitudes, Elementary School Teachers, Secondary School Teachers, Value Judgment
Bello, Hassan; Abdullah, Nor Athiyah – Electronic Journal of e-Learning, 2021
Computer-based assessment or e-assessment system is an e-learning system where information communication technology is utilized for examination activity, grading, and recording of responses of the examinees. It includes the entire assessment process from the examinees, teachers, institutions, examination agencies, and the public. E-assessment…
Descriptors: Evaluation Methods, Computer Assisted Testing, Technology Uses in Education, Program Effectiveness
Scott, Kristin C.; Nimon, Kim – Journal of Research on Technology in Education, 2021
Mishra and Koehler's technological pedagogical content knowledge (TPACK) theory can help provide the framework for measuring needed knowledge, skills, and abilities (KSAs) in faculty members at 2-year public colleges. This study tests a self-assessment survey in a large sample of 2-year public college faculty members. Using factor analysis, the…
Descriptors: Construct Validity, Two Year Colleges, College Faculty, Teacher Attitudes
Sudina, Ekaterina; Brown, Jason; Datzman, Brien; Oki, Yukiko; Song, Katherine; Cavanaugh, Robert; Thiruchelvam, Bala; Plonsky, Luke – Innovation in Language Learning and Teaching, 2021
'Grit' has been identified as an important predictor of success in a number of academic and non-academic domains (Duckworth, A. L., C. Peterson, M. D. Matthews, and D. R. Kelly. 2007. "Grit: Perseverance and Passion for Long-Term Goals." "Journal of Personality and Social Psychology" 92: 1087-1101.…
Descriptors: Measures (Individuals), Factor Analysis, Second Language Learning, Second Language Instruction
Kim, Peter – Language Teaching Research Quarterly, 2021
Foreign language aptitude is defined as one's potential to learn a second language. A language learner with higher aptitude is predicted to learn more, faster, and reach a higher level of proficiency. If this is the case, one way to validate the construct of aptitude and its measure is to conduct a validation study in which measures of aptitude is…
Descriptors: Morphology (Languages), Syntax, Second Language Learning, Second Language Instruction
Emma Armstrong-Carter; Kathy T. Do; Joao F. Guassi Moreira; Mitchell J. Prinstein; Eva H. Telzer – Grantee Submission, 2021
Introduction: This longitudinal study designed and tested the validity of a new measure of pro-social risk taking -- risks that individuals take in order to help others. Methods: The sample was racially and ethnically diverse adolescents in the rural Southeastern United States (N = 867; Mage = 12.82 years, 10-14 years at Wave 1; 50% Girls, 33%…
Descriptors: Race, Ethnicity, Goodness of Fit, Factor Analysis
Khoshdel, Fahimeh – International Journal of Language Testing, 2017
In the current study, the validity of C-Test is investigated using the construct identification approach. Based on construct identification approach, the factors which are deemed to affect item difficulty in C-Test items were identified. To this aim, 11 factors were selected to enter into Linear Logistic Testing Model (LLTM) analysis to…
Descriptors: Cloze Procedure, Language Tests, Test Items, Difficulty Level
Cetin, Saban – Journal of Education and Practice, 2017
This study aims to develop a measurement tool having measurement reliability with the aim of determining attitudes for values acquisition of secondary school students. The study was conducted on totally 325 high school senior students as 200 female and 125 male students in spring semester of 2014-2015 educational year. In the study, expert opinion…
Descriptors: Attitude Measures, Test Validity, Test Reliability, Values Education
Karren, Benjamin C. – Journal of Psychoeducational Assessment, 2017
The Gilliam Autism Rating Scale-Third Edition (GARS-3) is a norm-referenced tool designed to screen for autism spectrum disorders (ASD) in individuals between the ages of 3 and 22 (Gilliam, 2014). The GARS-3 test kit consists of three different components and includes an "Examiner's Manual," summary/response forms (50), and the…
Descriptors: Autism, Pervasive Developmental Disorders, Rating Scales, Norm Referenced Tests
Bull, Rebecca; Yao, Shih-Ying; Ng, Ee Lynn – International Journal of Early Childhood, 2017
The early childhood sector in Singapore has witnessed vast changes in the past two decades. One of the key policy aims is to improve classroom quality. To ensure a rigorous evaluation of the quality of early childhood environments in Singapore, it is important to determine whether commonly used assessments of quality are valid indicators across…
Descriptors: Foreign Countries, Rating Scales, Educational Environment, Educational Quality
Drengenberg, Nicholas; Bain, Alan – Higher Education Research and Development, 2017
This paper addresses the wicked problem of measuring the productivity of learning and teaching in higher education. We show how fundamental validity issues and difficulties identified in educational productivity research point to the need for a qualitatively different framework when considering the entire question. We describe the work that needs…
Descriptors: Productivity, Measurement, Higher Education, Learning
Newman, Ian R.; Gibb, Maia; Thompson, Valerie A. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2017
It is commonly assumed that belief-based reasoning is fast and automatic, whereas rule-based reasoning is slower and more effortful. Dual-Process theories of reasoning rely on this speed-asymmetry explanation to account for a number of reasoning phenomena, such as base-rate neglect and belief-bias. The goal of the current study was to test this…
Descriptors: Logical Thinking, Beliefs, Bias, Problem Solving
Park, Ryoungsun; Kim, Jiseon; Chung, Hyewon; Dodd, Barbara G. – Educational and Psychological Measurement, 2017
The current study proposes novel methods to predict multistage testing (MST) performance without conducting simulations. This method, called MST test information, is based on analytic derivation of standard errors of ability estimates across theta levels. We compared standard errors derived analytically to the simulation results to demonstrate the…
Descriptors: Testing, Performance, Prediction, Error of Measurement
Richards, Jeffrey A.; Xu, Dongxin; Gilkerson, Jill; Yapanel, Umit; Gray, Sharmistha; Paul, Terrance – Journal of Speech, Language, and Hearing Research, 2017
Purpose: To produce a novel, efficient measure of children's expressive vocal development on the basis of automatic vocalization assessment (AVA), child vocalizations were automatically identified and extracted from audio recordings using Language Environment Analysis (LENA) System technology. Method: Assessment was based on full-day audio…
Descriptors: Automation, Children, Speech Evaluation, Nonprint Media
Roy-Charland, Annie; Colangelo, Gabrielle; Foglia, Victoria; Reguigui, Leïla – Reading and Writing: An Interdisciplinary Journal, 2017
In tests used to measure reading comprehension, validity is important in obtaining accurate results. Unfortunately, studies have shown that people can correctly answer some questions of these tests without reading the related passage. These findings bring forth the need to address whether this phenomenon is observed in multiple-choice only tests…
Descriptors: Standardized Tests, Reading Tests, Reading Comprehension, Test Validity

Peer reviewed
Direct link
