Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Barnett, Elisabeth A.; Reddy, Vikash – Center for the Analysis of Postsecondary Readiness, 2017
Many postsecondary institutions, and community colleges in particular, require that students demonstrate specified levels of literacy and numeracy before taking college-level courses. Typically, students have been assessed using two widely available tests--ACCUPLACER and Compass. However, placement testing practice is beginning to change for three…
Descriptors: Student Placement, College Entrance Examinations, Educational Practices, Computer Assisted Testing
Naemi, Bobby; Seybert, Jacob; Robbins, Steven; Kyllonen, Patrick – ETS Research Report Series, 2014
This report introduces the "WorkFORCE"™ Assessment for Job Fit, a personality assessment utilizing the "FACETS"™ core capability, which is based on innovations in forced-choice assessment and computer adaptive testing. The instrument is derived from the fivefactor model (FFM) of personality and encompasses a broad spectrum of…
Descriptors: Personality Assessment, Personality Traits, Personality Measures, Test Validity
Arslan, Rumiye; Nalinci, Gülbin Zeren – Turkish Online Journal of Educational Technology - TOJET, 2014
The aim of this study is to develop a scale determining the visual literacy levels of university students. After reviewing the relevant literature a 75 item draft scale was prepared. The scale was applied to 3rd and 4th year students of Education Faculty of Amasya University. Non-functional items have been excluded from the scale as a result of…
Descriptors: Higher Education, Visual Literacy, Test Construction, College Students
Buzick, Heather; Stone, Elizabeth – Educational Measurement: Issues and Practice, 2014
Read aloud is a testing accommodation that has been studied by many researchers, and its use on K-12 assessments continues to be debated because of its potential to change the measured construct or unfairly increase test scores. This study is a summary of quantitative research on the read aloud accommodation. Previous studies contributed…
Descriptors: Meta Analysis, Reading Aloud to Others, Educational Research, Statistical Analysis
Michaelides, Michalis P. – Assessment in Education: Principles, Policy & Practice, 2014
Student examinees are key stakeholders in large-scale, high-stakes, public examination systems. How they perceive the purpose, comprehend the technical characteristics of testing and how they interpret scores influence their response to the system demands and their preparation for the examinations; this information relates to intended and…
Descriptors: Foreign Countries, National Competency Tests, High Stakes Tests, Student Attitudes
Wetzel, Angela Payne – ProQuest LLC, 2011
Previous systematic reviews indicate a lack of reporting of reliability and validity evidence in subsets of the medical education literature. Psychology and general education reviews of factor analysis also indicate gaps between current and best practices; yet, a comprehensive review of exploratory factor analysis in instrument development across…
Descriptors: Medical Education, Scholarship, Writing for Publication, Evidence
Harayama, Nancy Eiko – ProQuest LLC, 2013
The No Child Left Behind Act mandated the development of statewide alternate assessments to measure the academic achievement of students with the most significant cognitive disabilities. The valid assessment of all test takers is critical due to its high-stakes nature and the use of its results to inform instruction. Given the heterogeneity of the…
Descriptors: Educational Assessment, Alternative Assessment, Mental Retardation, High Stakes Tests
Bailey, Heather – Learning and Individual Differences, 2012
Working memory span tasks are popular measures, in part, because performance on these tasks predicts performance on other measures of cognitive ability. The traditional method of span-task administration is the experimenter-paced version, whose reliability and validity have been repeatedly demonstrated. However, computer-paced span tasks are…
Descriptors: Short Term Memory, Pacing, Cognitive Tests, Cognitive Ability
Kane, Michael – Language Testing, 2012
The argument-based approach to validation involves two steps; specification of the proposed interpretations and uses of the test scores as an interpretive argument, and the evaluation of the plausibility of the proposed interpretive argument. More ambitious interpretations and uses tend to involve an extended network of inferences and assumptions…
Descriptors: Testing, Language Tests, Inferences, Test Validity
Power, Allan; Faught, Brent E.; Przysucha, Eryk; McPherson, Moira; Montelpare, William – Measurement in Physical Education and Exercise Science, 2012
In this study the authors examine the test-retest reliability and concurrent validity of the Repeat Ice Skating Test (RIST). This was an on-ice field anaerobic test that measured average peak power and was validated with 3 anaerobic lab tests: (a) vertical jump, (b) the Margaria-Kalamen stair test, and (c) the Wingate Anaerobic Test. The…
Descriptors: Team Sports, Test Validity, Foreign Countries, Validity
Zhang, Xijuan; Savalei, Victoria – Educational and Psychological Measurement, 2016
Many psychological scales written in the Likert format include reverse worded (RW) items in order to control acquiescence bias. However, studies have shown that RW items often contaminate the factor structure of the scale by creating one or more method factors. The present study examines an alternative scale format, called the Expanded format,…
Descriptors: Factor Structure, Psychological Testing, Alternative Assessment, Test Items
Young, Charles; Campbell, Megan – British Journal of Guidance & Counselling, 2014
This article provides GP-CORE norms for a South African university sample, which are compared to published data obtained from a United Kingdom university sample. The measure appears to be both reliable and valid for this multilingual and multicultural South African sample. The profiles of the psychological distress reported by white South African…
Descriptors: Foreign Countries, Well Being, Comparative Analysis, Psychological Needs
Tarhini, Ali; Hassouna, Mohammad; Abbasi, Muhammad Sharif; Orozco, Jorge – Electronic Journal of e-Learning, 2015
Simpler is better. There are a lot of "needs" in e-Learning, and there's often a limit to the time, talent, and money that can be thrown at them individually. Contemporary pedagogy in technology and engineering disciplines, within the higher education context, champion instructional designs that emphasize peer instruction and rich…
Descriptors: Foreign Countries, Educational Technology, Technology Uses in Education, Technology Integration
Parkes, Kelly A.; Powell, Sean R. – Arts Education Policy Review, 2015
The purpose of this article is to describe and analyze the edTPA, a performance assessment created by the Stanford Center for Assessment, Learning, and Equity (SCALE) and administered by Pearson, Inc., to assess the professional readiness of student teachers. We challenge claims made in support of using this assessment, specifically within the…
Descriptors: Teacher Evaluation, Performance Based Assessment, Student Teacher Evaluation, Evaluation Methods
du Plessis, Santie – Online Submission, 2015
The study objectives were to develop, trial and evaluate a cross-cultural adaptation of the Adaptive Behavior Assessment System-Second Edition Teacher Form (ABAS-II TF) ages 5-21 for use with Indigenous Australian students ages 5-14. This study introduced a multiphase mixed-method design with semi-structured and informal interviews, school…
Descriptors: Foreign Countries, Indigenous Populations, Adjustment (to Environment), Psychological Testing

Peer reviewed
Direct link
