Publication Date
| In 2026 | 5 |
| Since 2025 | 627 |
| Since 2022 (last 5 years) | 2564 |
| Since 2017 (last 10 years) | 5599 |
| Since 2007 (last 20 years) | 9195 |
Descriptor
| Test Validity | 21771 |
| Test Reliability | 10011 |
| Test Construction | 5891 |
| Foreign Countries | 4955 |
| Psychometrics | 2963 |
| Factor Analysis | 2941 |
| Measures (Individuals) | 2377 |
| Higher Education | 2250 |
| Evaluation Methods | 2085 |
| College Students | 1813 |
| Correlation | 1723 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 728 |
| Practitioners | 429 |
| Teachers | 142 |
| Administrators | 96 |
| Policymakers | 57 |
| Counselors | 36 |
| Students | 20 |
| Parents | 13 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 2 |
| More ▼ | |
Location
| Turkey | 807 |
| Australia | 347 |
| Canada | 324 |
| China | 300 |
| United States | 188 |
| Indonesia | 172 |
| Spain | 169 |
| United Kingdom | 160 |
| Netherlands | 159 |
| California | 156 |
| Germany | 153 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 1 |
Kane, Michael T. – Assessment in Education: Principles, Policy & Practice, 2016
How we choose to use a term depends on what we want to do with it. If "validity" is to be used to support a score interpretation, validation would require an analysis of the plausibility of that interpretation. If validity is to be used to support score uses, validation would require an analysis of the appropriateness of the proposed…
Descriptors: Test Validity, Test Interpretation, Test Use, Scores
Gafni, Naomi – Assessment in Education: Principles, Policy & Practice, 2016
Naomi Gafni, director of Research and Development, National Institute for Testing and Evaluation, Jerusalem, Israel, has devoted a substantial part of her career to the development of admissions tests and other educational tests and to the investigation of their validity. As such she is keenly aware of the complexities involved in this process.…
Descriptors: Test Validity, Test Interpretation, Test Use, Test Construction
Sternod, Latisha; French, Brian – Journal of Psychoeducational Assessment, 2016
The Watson-Glaser™ II Critical Thinking Appraisal (Watson-Glaser II; Watson & Glaser, 2010) is a revised version of the "Watson-Glaser Critical Thinking Appraisal®" (Watson & Glaser, 1994). The Watson-Glaser II introduces a simplified model of critical thinking, consisting of three subdimensions: recognize assumptions, evaluate…
Descriptors: Cognitive Tests, Critical Thinking, Test Construction, Test Reliability
College Board, 2023
Over the past several years, content experts, psychometricians, and researchers have been hard at work developing, refining, and studying the digital SAT. The work is grounded in foundational best practices and advances in measurement and assessment design, with fairness for students informing all of the work done. This paper shares learnings from…
Descriptors: College Entrance Examinations, Psychometrics, Computer Assisted Testing, Best Practices
Al-Jarf, Reima – Online Submission, 2023
This article aims to give a comprehensive guide to planning and designing vocabulary tests which include Identifying the skills to be covered by the test; outlining the course content covered; preparing a table of specifications that shows the skill, content topics and number of questions allocated to each; and preparing the test instructions. The…
Descriptors: Vocabulary Development, Learning Processes, Test Construction, Course Content
Böttcher, Franziska; Thiel, Felicitas – Higher Education: The International Journal of Higher Education Research, 2018
Several concepts have been developed to implement research-oriented teaching in higher education in the last 15 years. The definition of research competences, however, has received minor attention so far. Some approaches to modeling research competences describe these competences along the research process but either focus on a specific academic…
Descriptors: Undergraduate Students, Graduate Students, Student Evaluation, Research Skills
Deygers, Bart; Van den Branden, Kris; Van Gorp, Koen – Language Testing, 2018
University entrance language tests are often administered under the assumption that even if language proficiency does not determine academic success, a certain proficiency level is still required. Nevertheless, little research has focused on how well L2 students cope with the linguistic demands of their studies in the first months after passing an…
Descriptors: Foreign Countries, College Entrance Examinations, Language Tests, Justice
Sternberg, Robert J. – Educational Psychology Review, 2018
This article reviews four interrelated approaches to reducing an inequitable gap in cognitive and educational test scores between individuals of a dominant culture and individuals of other cultures or subcultures. These approaches include (a) use of broader measures, (b) performance- and project-based assessments, (c) direct measurement of…
Descriptors: Educational Testing, Cognitive Tests, Scores, Cultural Differences
Aypay, Ayse – International Journal of Psychology and Educational Studies, 2018
This study introduces the topics of reward addiction and sensitivity to punishment in academic contexts to the literature. This study was designed firstly to develop reliable and valid measurement tools that can measure high school students' reward addiction and sensitivity to punishment in academic contexts, and secondly to test the structural…
Descriptors: Addictive Behavior, Rewards, Punishment, High School Students
Behizadeh, Nadia; Neely, Adrian – Equity & Excellence in Education, 2018
In this case study, we examine the consequential validity of using edTPA in a social justice-oriented, urban teacher preparation program. According to the developers of edTPA, a primary purpose is to support teacher candidate learning, yet our analysis suggests that edTPA does not support learning when used during student teaching. Our 16…
Descriptors: Performance Based Assessment, Test Validity, Social Justice, Urban Teaching
Brouzos, Andreas; Vassilopoulos, Stephanos P.; Baourda, Vasiliki C. – Journal for Specialists in Group Work, 2018
Studies examining processes in youth group psychotherapy are scarce. This article reports on the development and psychometric properties of the Psychoeducational Group Alliance Scale for Children (PGAS-c). This scale was designed to assess the therapeutic alliance that develops between the members and the facilitator in a psychoeducational group…
Descriptors: Psychotherapy, Test Construction, Psychometrics, Group Counseling
Paget, Michael; Brar, Gurbir; Veale, Pamela; Busche, Kevin; Coderre, Sylvain; Woloschuk, Wayne; McLaughlin, Kevin – Advances in Health Sciences Education, 2018
Prior studies have shown a correlation between the grades students receive and how they rate their teacher in the classroom. In this study, the authors probe this association on clinical rotations and explore potential mechanisms. All In-Training Evaluation Reports (ITERs) for students on mandatory clerkship rotations from April 1, 2013 to January…
Descriptors: Correlation, Student Evaluation of Teacher Performance, Regression (Statistics), Rating Scales
Leder, Gilah C.; Forgasz, Helen J. – ZDM: The International Journal on Mathematics Education, 2018
Assessment in mathematics is assumed to provide credible and important information about what students know and can do. In this paper we focus on large scale tests and question whether mathematics assessment is essentially gender neutral. We consider aspects of test validity and discuss issues of terminology related to gender and mathematics. In…
Descriptors: Mathematics Education, Evaluation Methods, Gender Bias, Test Content
Whittaker, Andrea; Pecheone, Raymond; Stansbury, Kendyll – Education Policy Analysis Archives, 2018
Stanford Center for Assessment, Learning, and Equity (SCALE) provides a commentary on the manuscripts in this special issue, responding to criticisms of edTPA as an assessment that narrows the curriculum, heavily relies on students' academic writing skills, and creates additional burdens for teacher candidates. The commentary highlights how edTPA…
Descriptors: Preservice Teachers, Teacher Evaluation, Preservice Teacher Education, High Stakes Tests
Ayse, Eliüsük Bülbül – Educational Research and Reviews, 2018
Seligman's "well-being scale" PERMA evaluates people's level of well-being in five dimensions: P: Positive and Negative emotions, E: Engagement, R: Relationships, M: Meaning, A: Accomplishment, N: Negative Emotion and H: Health. This scale measures a person's level of well being using five components. The measurement scale developed…
Descriptors: Foreign Countries, Rating Scales, Well Being, Psychological Patterns

Peer reviewed
Direct link
