Publication Date
| In 2026 | 2 |
| Since 2025 | 613 |
| Since 2022 (last 5 years) | 2550 |
| Since 2017 (last 10 years) | 5585 |
| Since 2007 (last 20 years) | 9181 |
Descriptor
| Test Validity | 21757 |
| Test Reliability | 10004 |
| Test Construction | 5884 |
| Foreign Countries | 4949 |
| Psychometrics | 2962 |
| Factor Analysis | 2941 |
| Measures (Individuals) | 2373 |
| Higher Education | 2249 |
| Evaluation Methods | 2084 |
| College Students | 1812 |
| Correlation | 1722 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 728 |
| Practitioners | 429 |
| Teachers | 142 |
| Administrators | 96 |
| Policymakers | 57 |
| Counselors | 36 |
| Students | 20 |
| Parents | 13 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 2 |
| More ▼ | |
Location
| Turkey | 806 |
| Australia | 347 |
| Canada | 324 |
| China | 300 |
| United States | 188 |
| Indonesia | 171 |
| Spain | 168 |
| United Kingdom | 160 |
| Netherlands | 158 |
| California | 155 |
| Germany | 153 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 1 |
Carlson, Sarah; Seipel, Ben; McMaster, Kristen – Society for Research on Educational Effectiveness, 2011
Many researchers focus on assessing the cognitive components of reading comprehension. However, researchers are challenged to find the best way to measure the cognitive components of reading comprehension because many reading comprehension assessments differ in terms of format (i.e., cloze, multiple-chose, open-ended); presentation (i.e., print);…
Descriptors: Reading Tests, Reading Comprehension, Psychometrics, Test Reliability
Wei, Xi-Jun; Tong, Kai-yu; Hu, Xiao-ling – International Journal of Rehabilitation Research, 2011
Responsiveness of clinical assessments is an important element in the report of clinical effectiveness after rehabilitation. The correlation could reflect the validity of assessments as an indication of clinical performance before and after interventions. This study investigated the correlation and responsiveness of Fugl-Meyer Assessment (FMA),…
Descriptors: Patients, Measures (Individuals), Program Effectiveness, Correlation
Mier, Constance M. – Research Quarterly for Exercise and Sport, 2011
The accuracy of video analysis of the passive straight-leg raise test (PSLR) and the validity of the sit-and-reach test (SR) were tested in 60 men and women. Computer software measured static hip-joint flexion accurately. High within-session reliability of the PSLR was demonstrated (R greater than 0.97). Test-retest (separate days) reliability for…
Descriptors: Video Technology, Accuracy, Test Validity, Test Reliability
Yun, Sung Hyun; Vonk, M. Elizabeth – Research on Social Work Practice, 2011
The present study demonstrates the development and initial examination of psychometric properties of the Intimate Violence Responsibility Scale (IVRS) in a community-based sample (N = 527). The underlying factor structure of the IVRS was tested by the exploratory factor analysis (Principal Axis Factoring), which identifies the four factors:…
Descriptors: Evidence, Violence, Test Validity, Factor Structure
Reynolds-Keefer, Laura; Johnson, Robert – Practical Assessment, Research & Evaluation, 2011
In developing attitudinal instruments for young children, researchers, program evaluators, and clinicians often use response scales with pictures or images (e.g., smiley faces) as anchors. This article considers connections between word-based and picture based Likert scales and highlights the value in translating conventions used in word-based…
Descriptors: Likert Scales, Questionnaires, Test Validity, Pictorial Stimuli
McGrath, Robert E.; Kim, Brian H.; Hough, Leaetta – Psychological Bulletin, 2011
In their comment, M. L. Rohling et al. (2011) accused us of offering a "misleading" review of response bias. In fact, the additional findings they provided on this topic are relevant only to bias assessment in 1 of the domains we discussed, neuropsychological assessment. Furthermore, we contend that, even in that 1 domain, the additional findings…
Descriptors: Response Style (Tests), Bias, Test Validity, Research Methodology
Kahraman, Nilufer; Thompson, Tony – Journal of Educational Measurement, 2011
A practical concern for many existing tests is that subscore test lengths are too short to provide reliable and meaningful measurement. A possible method of improving the subscale reliability and validity would be to make use of collateral information provided by items from other subscales of the same test. To this end, the purpose of this article…
Descriptors: Test Length, Test Items, Alignment (Education), Models
Akrofi, Solomon; Clarke, Nicholas; Vernon, Guy – International Journal of Training and Development, 2011
Evaluating the returns on intangible assets in general and executive human capital in particular is still a challenging endeavour. One possible means of addressing this challenge involves developing a broad measure of executive learning and development (L&D), encapsulating both the formal and informal activities that closely reflect the dynamic…
Descriptors: Measures (Individuals), Test Validity, Administrators, Business Administration
Cimolin, Veronica; Galli, Manuela; Vimercati, Sara Laura; Albertini, Giorgio – Research in Developmental Disabilities: A Multidisciplinary Journal, 2011
Gait analysis (GA) is widely used for clinical evaluations and it is recognized as a central element in the quantitative evaluation of gait, in the planning of treatments and in the pre vs. post intervention evaluations in children with Cerebral Palsy (CP). Otherwise, GA produces a large volume of data and there is the clinical need to provide…
Descriptors: Intervention, Outcomes of Treatment, Cerebral Palsy, Children
Wagner, Matthias Oliver; Kastner, Julia; Petermann, Franz; Bos, Klaus – Research in Developmental Disabilities: A Multidisciplinary Journal, 2011
The Movement Assessment Battery for Children-2 (M-ABC-2) is one of the most commonly used tests for the diagnosis of specific developmental disorders of motor function (F82). The M-ABC-2 comprises eight subtests per age band (AB) that are assigned to three dimensions: manual dexterity, aiming and catching, and balance. However, while previous…
Descriptors: Test Validity, Diagnostic Tests, Psychomotor Skills, Developmental Disabilities
Ligtvoet, Rudy; van der Ark, L. Andries; Bergsma, Wicher P.; Sijtsma, Klaas – Psychometrika, 2011
We propose three latent scales within the framework of nonparametric item response theory for polytomously scored items. Latent scales are models that imply an invariant item ordering, meaning that the order of the items is the same for each measurement value on the latent scale. This ordering property may be important in, for example,…
Descriptors: Intelligence Tests, Measures (Individuals), Methods, Item Response Theory
Crosby, James W. – Journal of Psychoeducational Assessment, 2011
The "Social Skills Improvement System" (SSIS; Gresham & Elliot, 2008) is designed to assist in the screening and classification of students (ages 8 to 18) who are suspected of having significant social skills deficits, and to offer support in the development of interventions for those found to display significant social skills…
Descriptors: Interpersonal Competence, Improvement, Rating Scales, Intervention
Tieso, Carol L.; Hutcheson, Virginia H. – Planning and Changing, 2014
The authors of this article review the development and discuss potential uses for a new instrument that evolved from follow-up research conducted after completion of a five-year study of innovative curricular and instructional practices. The instrument is A Stakeholder's Perceptions of Innovative Reform Efforts (ASPIRE). The primary purpose of…
Descriptors: Probability, Educational Change, Success, Teacher Surveys
Alberola Colomar, María Pilar – Language Learning in Higher Education, 2014
This article presents and analyses a classroom-based assessment method to test students' speaking skills in a variety of professional settings in tourism. The assessment system has been implemented in the Communication in English for Tourism course, as part of the Tourism Management degree programme, at Florida Universitaria (affiliated to the…
Descriptors: English for Special Purposes, Tourism, Oral Language, Language Tests
Zandi, Hamed; Kaivanpanah, Shiva; Alavi, Seyed Mohammad – Iranian Journal of Language Teaching Research, 2014
Reviewing the test specifications to improve the quality of language tests may be a routine process in professional testing systems. However, there is a paucity of research about the effect of specifications review on improving the quality of small-scale tests. The purpose of the present study was twofold: how specifications review could help…
Descriptors: Test Reliability, Test Validity, Language Tests, Test Items

Peer reviewed
Direct link
