Publication Date
| In 2026 | 6 |
| Since 2025 | 481 |
| Since 2022 (last 5 years) | 1960 |
| Since 2017 (last 10 years) | 4532 |
| Since 2007 (last 20 years) | 7017 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10022 |
| Test Construction | 4374 |
| Foreign Countries | 3840 |
| Psychometrics | 2435 |
| Factor Analysis | 2302 |
| Measures (Individuals) | 1787 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1264 |
| Factor Structure | 1249 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 840 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 163 |
| Spain | 131 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 103 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Loakes, Deborah; Moses, Karin; Simpson, Jane; Wigglesworth, Gillian – Language Assessment Quarterly, 2012
This article reports on the development and piloting of a vocabulary recognition test designed for Indigenous Australian children. The research is both application oriented and development oriented. The aims of the article are to determine how well the test is used as a test instrument and the extent to which children recognize vocabulary items in…
Descriptors: Language Tests, Foreign Countries, Language Skills, Word Recognition
Hayward, Elizabeth O. – Cultural Studies of Science Education, 2012
In this paper I explore how Margaret Beier, Lesley Miller, and Shu Wang make claims for the validity and reliability of the instrument they developed to explore the construct of "possible selves" as described in their manuscript, "Science Games and the Development of Scientific Possible Selves."
Descriptors: Self Concept Measures, Measurement Techniques, Test Construction, Test Validity
Yao, Lihua – Psychometrika, 2012
Multidimensional computer adaptive testing (MCAT) can provide higher precision and reliability or reduce test length when compared with unidimensional CAT or with the paper-and-pencil test. This study compared five item selection procedures in the MCAT framework for both domain scores and overall scores through simulation by varying the structure…
Descriptors: Item Banks, Test Length, Simulation, Adaptive Testing
Geisinger, Kurt F. – International Journal of Testing, 2012
This article sets the stage for the description of a variety of approaches to test reviewing worldwide. It describes the importance of test reviewing as a protection of the public and of society and also the benefits of this activity for test users, who must choose measures to use in particular situations with particular clients at a particular…
Descriptors: Test Reviews, Evaluation Methods, Evaluation Criteria, Global Approach
Fang, Jiqian; Power, Mick; Lin, Yueqing; Zhang, Jinxin; Hao, Yuantao; Chatterji, Somnath – Gerontologist, 2012
Purpose of the study: To explore short-form versions of World Health Organization Quality of Life (WHOQOL-OLD) with acceptable psychometric properties, which was developed for older adults by the WHOQOL research group, containing 24 items initially. Design and Methods: We randomly sampled two-thirds of respondents from the data of WHOQOL-OLD field…
Descriptors: Quality of Life, Test Reliability, Correlation, Psychometrics
Gutierrez, Xavier – Canadian Journal of Applied Linguistics / Revue canadienne de linguistique appliquee, 2012
Implicit and explicit knowledge of the second language (L2) are two central constructs in the field of second language acquisition (SLA). In recent years, there has been a renewed interest in obtaining valid and reliable measures of L2 learners' implicit and explicit knowledge (e.g., Bowles, 2011; R. Ellis, 2005). The purpose of the present study…
Descriptors: Grammar, Second Language Learning, Spanish, Language Research
Yurdakul, Isil Kabakci; Odabasi, Hatice Ferhan; Kilicer, Kerem; Coklar, Ahmet Naci; Birinci, Gurkay; Kurt, Adile Askim – Computers & Education, 2012
The purpose of this study is to develop a TPACK (technological pedagogical content knowledge) scale based on the centered component of TPACK framework in order to measure preservice teachers' TPACK. A systematic and step-by-step approach was followed for the development of the scale. The validity and reliability studies of the scale were carried…
Descriptors: Foreign Countries, Preservice Teachers, Test Validity, Test Reliability
Kempf, Emily J.; Schwartzman, Roy; Wilson, Erin; Henry, Kelly Bouas – Journal of Applied Learning in Higher Education, 2011
An educational environment characterized by shrinking fiscal and physical resources has spurred many institutions to undertake comprehensive academic program review. Such a process assesses the performance of programs and prioritizes them for de-emphasis, maintenance, or enhancement. How can applied learning be properly acknowledged within program…
Descriptors: Experiential Learning, Program Evaluation, College Programs, Evaluation Criteria
Winchell, Brooke – ProQuest LLC, 2011
The purpose of the study was to (a) examine the psychometric properties of The Assessment, Evaluation, and Programming System for Infants and Children (AEPS Test); (b) provide a process for establishing psychometric properties for other Curriculum Based Assessments (CBAs); and (c) identify and guide evaluation and subsequent revisions of the AEPS…
Descriptors: Curriculum Based Assessment, Psychometrics, Item Response Theory, Test Theory
Brooks, D. Christopher; Marsh, Lauren; Wilcox, Kimerly; Cohen, Brad – Journal of Faculty Development, 2011
In response to the well-documented need for rigorous evaluations of faculty development programs and increasing demands for institutional accountability, University of Minnesota's Office of Information Technology (OIT) researchers have developed an approach to program evaluation that assesses individual level changes to participants' attitudes,…
Descriptors: Program Evaluation, Information Technology, Faculty Development, Accountability
Rivera, Jennifer E. – Career and Technical Education Research, 2011
The State of New York Agriculture Science Education secondary program is required to have a certification exam for students to assess their agriculture science education experience as a Regent's requirement towards graduation. This paper focuses on the procedure used to develop and validate two content sub-test questions within a…
Descriptors: Test Items, Item Banks, Test Construction, Test Validity
Freeman, Jennifer; Flessner, Christopher A.; Garcia, Abbe – Journal of Abnormal Child Psychology, 2011
The Children's Yale-Brown Obsessive Compulsive Scale (CY-BOCS) is the instrument of choice for assessing symptom severity in older children (i.e., 8-18 years) diagnosed with obsessive-compulsive disorder (OCD). The reliability and validity of this measure for use among younger children (i.e., 5-8 years of age), however, has never been examined.…
Descriptors: Test Reliability, Test Validity, Young Children, Measures (Individuals)
Schermer, Julie Aitken; MacDougall, Robyn – Journal of Career Assessment, 2011
The Jackson Career Explorer (JCE) is a short form and continuous version of the Jackson Vocational Interest Survey (JVIS). The 34 scales of the JCE were investigated in relation to the Career Directions Inventory (CDI). Participants (N = 282) aged 14-57 years were volunteers from local high schools and colleges and completed both measures. The…
Descriptors: Interest Inventories, Vocational Interests, Test Reliability, Test Validity
Matza, Louis S.; Van Brunt, David L.; Cates, Charlotte; Murray, Lindsey T. – Journal of Attention Disorders, 2011
Aims: Childhood attention-deficit/hyperactivity disorder (ADHD) frequently persists into adulthood and continues to impair health-related quality of life (HRQL). Thus, it is important to have validated symptom and HRQL measures for assessing treatment outcomes in this population. The purpose of the current analysis was to assess test-retest…
Descriptors: Attention Deficit Hyperactivity Disorder, Quality of Life, Test Reliability, Patients
Zysberg, Leehu; Levy, Anat; Zisberg, Anna – Journal of Psychoeducational Assessment, 2011
Two studies describe the development of the Audiovisual Test of Emotional Intelligence (AVEI), aimed at candidate selection in educational settings. Study I depicts the construction of the test and the preliminary examination of its psychometric properties in a sample of 92 college students. Item analysis allowed the modification of problem items,…
Descriptors: Emotional Intelligence, Intelligence Tests, Admission Criteria, Test Construction

Peer reviewed
Direct link
