Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Lecoutre, Bruno; Lecoutre, Marie-Paule; Poitevineau, Jacques – Psychological Methods, 2010
P. R. Killeen's (2005a) probability of replication ("p[subscript rep]") of an experimental result is the fiducial Bayesian predictive probability of finding a same-sign effect in a replication of an experiment. "p[subscript rep]" is now routinely reported in "Psychological Science" and has also begun to appear in…
Descriptors: Research Methodology, Guidelines, Probability, Computation
Development and Psychometric Evaluation of the Yale-Brown Obsessive-Compulsive Scale--Second Edition
Storch, Eric A.; Rasmussen, Steven A.; Price, Lawrence H.; Larson, Michael J.; Murphy, Tanya K.; Goodman, Wayne K. – Psychological Assessment, 2010
The Yale-Brown Obsessive-Compulsive Scale (Y-BOCS; Goodman, Price, Rasmussen, Mazure, Delgado, et al., 1989) is acknowledged as the gold standard measure of obsessive-compulsive disorder (OCD) symptom severity. A number of areas where the Y-BOCS may benefit from revision have emerged in past psychometric studies of the Severity Scale and Symptom…
Descriptors: Check Lists, Construct Validity, Validity, Measures (Individuals)
McBride, James R.; Ysseldyke, Jim; Milone, Michael; Stickney, Eric – Canadian Journal of School Psychology, 2010
Technical adequacy and information/cost return were examined for four early reading measures: the Dynamic Indicators of Basic Early Literacy Skills (DIBELS), STAR Early Literacy (SEL), Group Reading Assessment and Diagnostic Evaluation (GRADE), and the Texas Primary Reading Inventory (TPRI). All four assessments were administered to the same…
Descriptors: Early Reading, Reading Achievement, Adaptive Testing, Phonemic Awareness
Montecinos, Carmen; Rittershaussen, Sylvia; Solis, Maria Cristina; Contreras, Ines; Contreras, Claudia – Asia-Pacific Journal of Teacher Education, 2010
The instrument Samples of Teaching Performance (STP) was developed to assess student teachers' capacity to plan, deliver and evaluate a unit of instruction. The current study reports consequential validity data collected from supervisors (n = 20) and student teachers (n = 62) from three elementary and five secondary teacher preparation programs…
Descriptors: Student Teaching, Student Teachers, Performance Based Assessment, Supervision
Wolfe, Edward W.; Matthews, Staci; Vickers, Daisy – Journal of Technology, Learning, and Assessment, 2010
This study examined the influence of rater training and scoring context on training time, scoring time, qualifying rate, quality of ratings, and rater perceptions. One hundred twenty raters participated in the study and experienced one of three training contexts: (a) online training in a distributed scoring context, (b) online training in a…
Descriptors: Writing Evaluation, Writing Tests, Qualifications, Program Effectiveness
Branscum, Paul; Sharma, Manoj; Kaye, Gail; Succop, Paul – Journal of Nutrition Education and Behavior, 2010
Objective: The objective of this study was to report the construct validity and internal consistency reliability of the Food Behavior Checklist modified for children (FBC-MC), with low-income, Youth Expanded Food and Nutrition Education Program (EFNEP)-eligible children. Methods: Using a cross-sectional research design, construct validity was…
Descriptors: Check Lists, Research Design, Nutrition, Construct Validity
Kapci, Emine Gul; Kucuker, Sevgi; Uslu, Runa I. – Topics in Early Childhood Special Education, 2010
The majority of eligible children cannot access early intervention services in Turkey, often because they are not assessed. The authors adapted the "Ages and Stages Questionnaires" (ASQ) for Turkish children ages 3 to 72 months. Study participants consisted of 375 children who were classified as at risk for developmental delays, 564…
Descriptors: Early Intervention, Eligibility, Classification, Foreign Countries
Wu, Pei-Chen; Huang, Tsai-Wei – Measurement and Evaluation in Counseling and Development, 2010
This study was to apply the mixed Rasch model to investigate person heterogeneity of Beck Depression Inventory-II-Chinese version (BDI-II-C) and its effects on dimensionality and construct validity. Person heterogeneity was reflected by two latent classes that differ qualitatively. Additionally, person heterogeneity adversely affected the…
Descriptors: Construct Validity, Validity, Depression (Psychology), Item Response Theory
Joseph, Dana L.; Newman, Daniel A. – Educational and Psychological Measurement, 2010
A major stumbling block for emotional intelligence (EI) research has been the lack of adequate evidence for discriminant validity. In a sample of 280 dyads, self- and peer-reports of EI and Big Five personality traits were used to confirm an a priori four-factor model for the Wong and Law Emotional Intelligence Scale (WLEIS) and a five-factor…
Descriptors: Emotional Intelligence, Measurement Techniques, Validity, Personality Traits
Lew, Magdeleine D. N.; Alwis, W. A. M.; Schmidt, Henk G. – Assessment & Evaluation in Higher Education, 2010
The purpose of the two studies presented here was to evaluate the accuracy of students' self-assessment ability, to examine whether this ability improves over time and to investigate whether self-assessment is more accurate if students believe that it contributes to improving learning. To that end, the accuracy of the self-assessments of 3588…
Descriptors: Self Evaluation (Individuals), Beliefs, Learning Processes, Correlation
Ricketts, Chris; Brice, Julie; Coombes, Lee – Advances in Health Sciences Education, 2010
The purpose of multiple choice tests of medical knowledge is to estimate as accurately as possible a candidate's level of knowledge. However, concern is sometimes expressed that multiple choice tests may also discriminate in undesirable and irrelevant ways, such as between minority ethnic groups or by sex of candidates. There is little literature…
Descriptors: Medical Students, Testing Accommodations, Ethnic Groups, Learning Disabilities
Rupp, Andre A.; Gushta, Matthew; Mislevy, Robert J.; Shaffer, David Williamson – Journal of Technology, Learning, and Assessment, 2010
We are currently at an exciting juncture in developing effective means for assessing so-called 21st-century skills in an innovative yet reliable fashion. One of these avenues leads through the world of "epistemic games" (Shaffer, 2006a), which are games designed to give learners the rich experience of professional practica within a discipline.…
Descriptors: Research Methodology, Educational Research, Evaluation Methods, Educational Games
Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2010
Item response theory (IRT) methods are generally used to create score scales for large-scale tests. Research has shown that IRT scales are stable across groups and over time. Most studies have focused on items that are dichotomously scored. Now Rasch and other IRT models are used to create scales for tests that include polytomously scored items.…
Descriptors: Measures (Individuals), Item Response Theory, Robustness (Statistics), Item Analysis
Chiat, Shula; Roy, Penny – Journal of Speech, Language, and Hearing Research, 2007
Purpose: To determine the psychometric properties of the Preschool Repetition (PSRep) Test (Roy & Chiat, 2004), to establish the range of performance in typically developing children and variables affecting this performance, and to compare the performance of clinically referred children. Method: The PSRep Test comprises 18 words and 18…
Descriptors: Phonology, Psychometrics, Interrater Reliability, Followup Studies
Curcic, Svjetlana; Johnstone, Robin S. – Computers in the Schools, 2016
This study examined the effects of an intervention in writing with digital interactive books. To improve the writing skills of seventh- and eighth-grade students with a learning disability in reading, we conducted a quasi-experimental study in which the students read interactive digital books (i-books), took notes, wrote summaries, and acted as…
Descriptors: Intervention, Writing Skills, Learning Disabilities, Cartoons

Peer reviewed
Direct link
