Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Berkhout, Louise; Hoekman, Joop; Goorhuis-Brouwer, Sieneke M. – Early Child Development and Care, 2012
The objective of this study was to develop an instrument to observe the play behaviour of a whole group of children from four to six years of age in a classroom setting on the basis of video recording. The instrument was developed in collaboration with experienced teachers and experts on play. Categories of play were derived from the literature…
Descriptors: Observation, Video Technology, Play, Test Construction
Ruscio, John; Seaman, Florence; D'Oriano, Carianne; Stremlo, Elena; Mahalchik, Krista – Measurement: Interdisciplinary Research and Perspectives, 2012
Scholarly impact is studied frequently and used to make consequential decisions (e.g., hiring, tenure, promotion, research support, professional honors), and therefore it is important to measure it accurately. Developments in information technology and statistical methods provide promising new metrics to complement traditional information sources…
Descriptors: Citation Indexes, Citation Analysis, Outcome Measures, Scholarship
Mehra, Vandana; Omidian, Faranak – Turkish Online Journal of Distance Education, 2012
The study of student's attitude towards e-learning can in many ways help managers better prepare in light of e-learning for the future. This article describes the process of the development of an instrument to measure university students' attitude towards e-learning. The scale was administered to 200 University students from two countries (India…
Descriptors: Foreign Countries, Electronic Learning, College Students, Student Attitudes
Yin, Yue – Educational Assessment, 2012
This study examines the potential of the tree diagram, a type of graphic organizer, as an assessment tool to measure students' knowledge structures in statistics education. Students' knowledge structures in statistics have not been sufficiently assessed in statistics, despite their importance. This article first presents the rationale and method…
Descriptors: Statistics, Mathematics Education, Instructional Materials, Visual Aids
Lundgren, Tobias; Luoma, Jason B.; Dahl, JoAnne; Strosahl, Kirk; Melin, Lennart – Cognitive and Behavioral Practice, 2012
Two studies were conducted to develop and evaluate an instrument intended to identify and measure personal values, values attainment, and persistence in the face of barriers. Study 1 describes a content validity approach to the construction and preliminary validation of the Bull's Eye Values Survey (BEVS), using a sample of institutionalized…
Descriptors: Psychometrics, Identification, Measures (Individuals), Investigations
Runnqvist, Elin; Costa, Albert – Bilingualism: Language and Cognition, 2012
Levy, Mc Veigh, Marful and Andreson (2007) found that naming pictures in L2 impaired subsequent recall of the L1 translation words. This was interpreted as evidence for a domain-general inhibitory mechanism (RIF) underlying first language attrition. Because this result is at odds with some previous findings and theoretical assumptions, we wanted…
Descriptors: Language Skill Attrition, Language Dominance, Memory, Bilingualism
Barth, Amy E.; Stuebing, Karla K.; Fletcher, Jack M.; Cirino, Paul T.; Romain, Melissa; Francis, David; Vaughn, Sharon – Reading Psychology, 2012
We evaluated the reliability and validity of two oral reading fluency scores for 1-minute equated passages: median score and mean score. These scores were calculated from measures of reading fluency administered up to five times over the school year to students in grades six to eight (n = 1,317). Both scores were highly reliable with strong…
Descriptors: Reading Fluency, Test Validity, Test Reliability, Scores
Steedle, Jeffrey T. – Assessment & Evaluation in Higher Education, 2012
Value-added scores from tests of college learning indicate how score gains compare to those expected from students of similar entering academic ability. Unfortunately, the choice of value-added model can impact results, and this makes it difficult to determine which results to trust. The research presented here demonstrates how value-added models…
Descriptors: College Outcomes Assessment, Postsecondary Education, Achievement Tests, Models
Wakita, Takafumi; Ueshima, Natsumi; Noguchi, Hiroyuki – Educational and Psychological Measurement, 2012
This study examined whether the number of options in the Likert scale influences the psychological distance between categories. The most important assumption when using the Likert scale is that the psychological distance between options is equal. The authors proposed a new algorithm for calculating the scale values of options by applying item…
Descriptors: Likert Scales, Test Items, Personality Measures, Item Response Theory
Yap, Melvin J.; Balota, David A.; Sibley, Daragh E.; Ratcliff, Roger – Journal of Experimental Psychology: Human Perception and Performance, 2012
Empirical work and models of visual word recognition have traditionally focused on group-level performance. Despite the emphasis on the prototypical reader, there is clear evidence that variation in reading skill modulates word recognition performance. In the present study, we examined differences among individuals who contributed to the English…
Descriptors: Evidence, Reaction Time, Word Recognition, Dictionaries
The Strengths Assessment Inventory: Reliability of a New Measure of Psychosocial Strengths for Youth
Brazeau, James N.; Teatero, Missy L.; Rawana, Edward P.; Brownlee, Keith; Blanchette, Loretta R. – Journal of Child and Family Studies, 2012
A new measure, the Strengths Assessment Inventory-Youth self-report (SAI-Y), was recently developed to assess the strengths of children and adolescents between the ages of 10 and 18 years. The SAI-Y differs from similar measures in that it provides a comprehensive assessment of strengths that are intrinsic to the individual as well as strengths…
Descriptors: Error of Measurement, Psychometrics, Secondary School Students, Adolescents
Nehm, Ross H.; Haertig, Hendrik – Journal of Science Education and Technology, 2012
Our study examines the efficacy of Computer Assisted Scoring (CAS) of open-response text relative to expert human scoring within the complex domain of evolutionary biology. Specifically, we explored whether CAS can diagnose the explanatory elements (or Key Concepts) that comprise undergraduate students' explanatory models of natural selection with…
Descriptors: Evolution, Undergraduate Students, Interrater Reliability, Computers
Greiff, Samuel; Wustenberg, Sascha; Funke, Joachim – Applied Psychological Measurement, 2012
This article addresses two unsolved measurement issues in dynamic problem solving (DPS) research: (a) unsystematic construction of DPS tests making a comparison of results obtained in different studies difficult and (b) use of time-intensive single tasks leading to severe reliability problems. To solve these issues, the MicroDYN approach is…
Descriptors: Problem Solving, Tests, Measurement, Structural Equation Models
Bertelli, Marco; Scuticchio, Daniela; Ferrandi, Angela, Lassi, Stefano; Mango, Francesco; Ciavatta, Claudio; Porcelli, Cesare; Bianco, Annamaria; Monchieri, Sergio – Research in Developmental Disabilities: A Multidisciplinary Journal, 2012
SPAID (Psychiatric Instrument for the Intellectually Disabled Adult) is the first Italian tool-package for carrying out psychiatric diagnosis in adults with Intellectual Disabilities (ID). It includes the "G" form, for general diagnostic orientation, and specific checklists for all groups of syndromes stated by the available…
Descriptors: Personality Problems, Self Control, Mental Retardation, Autism
Black, David S.; Sussman, Steve; Johnson, C. Anderson; Milam, Joel – Assessment, 2012
The Mindful Attention Awareness Scale (MAAS) has the longest empirical track record as a valid measure of trait mindfulness. Most of what is understood about trait mindfulness comes from administering the MAAS to relatively homogenous samples of Caucasian adults. This study rigorously evaluates the psychometric properties of the MAAS among Chinese…
Descriptors: Adolescents, High School Students, Measures (Individuals), Psychometrics

Peer reviewed
Direct link
