Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Jordan, Jeremy S.; Turner, Brian A. – Measurement in Physical Education and Exercise Science, 2008
Researchers in a number of disciplines have examined the utility of single-item measures for both affective and cognitive constructs. While these authors have indicated that, under certain circumstances, the use of single-item measures is appropriate, there remains concern regarding the reliability and validity of single-item measures. This study…
Descriptors: Job Satisfaction, Test Reliability, Test Validity, Measures (Individuals)
Gelade, Garry A. – Intelligence, 2008
This paper examines the distribution of national IQ in geographical space. When the heritability of IQ and its dependence on eco-social factors are considered from a global perspective, they suggest that the IQs of neighboring countries should be similar. Using previously published IQ data for 113 nations (Lynn, R., & Vanhanen, T., (2006). IQ and…
Descriptors: Global Approach, Intelligence Quotient, Geographic Location, Socioeconomic Influences
Vigneau, Francois; Bors, Douglas A. – Intelligence, 2008
Various taxonomies of Raven's Advanced Progressive Matrices (APM) items have been proposed in the literature to account for performance on the test. In the present article, three such taxonomies based on information processing, namely Carpenter, Just and Shell's [Carpenter, P.A., Just, M.A., & Shell, P., (1990). What one intelligence test…
Descriptors: Intelligence, Intelligence Tests, Factor Analysis, Classification
Lannie, Amanda L.; Martens, Brian K. – Journal of Behavioral Education, 2008
Four fifth-grade students were presented with frustration-level math probes while three performance dimensions were measured (i.e., percent intervals on-task, percent correct digits, and digits correct per minute (DCM)). Using a multiple baseline design across participants, students were trained to self-monitor time on-task, accuracy, and…
Descriptors: Intervals, Interrater Reliability, Rewards, Grade 5
Henson, Robert; Roussos, Louis; Douglas, Jeff; He, Xuming – Applied Psychological Measurement, 2008
Cognitive diagnostic models (CDMs) model the probability of correctly answering an item as a function of an examinee's attribute mastery pattern. Because estimation of the mastery pattern involves more than a continuous measure of ability, reliability concepts introduced by classical test theory and item response theory do not apply. The cognitive…
Descriptors: Diagnostic Tests, Classification, Probability, Item Response Theory
Heh, Peter – ProQuest LLC, 2009
The current study examined the validation and alignment of the PASA-Science by determining whether the alternate science assessment anchors linked to the regular education science anchors; whether the PASA-Science assessment items are science; whether the PASA-Science assessment items linked to the alternate science eligible content, and what…
Descriptors: Program Effectiveness, Special Education, Science Education, Science Tests
Setzer, J. Carl; He, Yi – GED Testing Service, 2009
Reliability Analysis for the Internationally Administered 2002 Series GED (General Educational Development) Tests Reliability refers to the consistency, or stability, of test scores when the authors administer the measurement procedure repeatedly to groups of examinees (American Educational Research Association [AERA], American Psychological…
Descriptors: Educational Research, Error of Measurement, Scores, Test Reliability
Carr, W. David; Frey, Bruce B.; Swann, Elizabeth – Athletic Training Education Journal, 2009
Objective: To establish the validity and reliability of an online assessment instrument's items developed to track educational outcomes over time. Design and Setting: A descriptive study of the validation arguments and reliability testing of the assessment items. The instrument is available to graduating students enrolled in entry-level Athletic…
Descriptors: Athletics, Educational Objectives, Outcomes of Education, Validity
Petersen, George J.; Kelly, Victoria L.; Reimer, Catherine N.; Mosunich, Daniel; Thompson, Debra – Journal of School Public Relations, 2009
This study explored the perspectives of 350 California superintendents from various-sized school districts in relation to their ability to support student learning while addressing the numerous and complex personnel, social, and economic challenges faced by schools. Specifically, this study investigated the attitudes and opinions of district…
Descriptors: Superintendents, Administrator Attitudes, Social Influences, Barriers
Ploegh, Karin; Tillema, Harm H.; Segers, Mien S. R. – Studies in Educational Evaluation, 2009
With the increasing popularity of peer assessment as an assessment tool, questions may arise about its measurement quality. Among such questions, the extent peer assessment practices adhere to standards of measurement. It has been claimed that new forms of assessment, require new criteria to judge their validity and reliability, since they aim for…
Descriptors: Peer Evaluation, Measurement, Summative Evaluation, Formative Evaluation
Murphy, Timothy; MacLaren, Iain; Flynn, Sharon – International Journal of Teaching and Learning in Higher Education, 2009
This study examines various aspects of an effective teaching evaluation system. In particular, reference is made to the potential of Fink's (2008) four main dimensions of teaching as a summative evaluation model for effective teaching and learning. It is argued that these dimensions can be readily accommodated in a Teaching Portfolio process. The…
Descriptors: Portfolios (Background Materials), College Faculty, Teacher Effectiveness, Summative Evaluation
Kaminski, Jennifer Wyatt; David-Ferdon, Corinne; Battistich, Victor A. – Journal of Research in Character Education, 2009
The Social and Character Development (SACD) research program was designed to evaluate the effectiveness of seven elementary-school-based programs developed to promote social and emotional competence, positive behavior, a positive school climate, and academic achievement, and to decrease negative behavior. Procedures undertaken by the SACD…
Descriptors: Emotional Intelligence, Academic Achievement, Factor Structure, Personality
Hitt, Austin M.; Helms, Emory C. – Professional Educator, 2009
This paper discusses an instructional approach designed to help preservice teachers understand how assessments can be influenced by personal biases. In order to achieve this objective, we developed an analogy-based activity called "The Dog Show Analogy." After participating in the activity, we have observed that the participating preservice…
Descriptors: Preservice Teachers, Student Evaluation, Teacher Education Programs, Experimenter Characteristics
Yaman, Erkan – Educational Sciences: Theory and Practice, 2009
The aim of this research is to develop the Mobbing Scale and examine its validity and reliability. The sample of the study consisted of 515 persons from Sakarya and Bursa. In this study, construct validity, internal consistency, test-retest reliability, and item analysis of the scale were examined. As a result of factor analysis for construct…
Descriptors: Test Construction, Barriers, Correlation, Work Environment
Aouad, Julie; Savage, Robert – Canadian Journal of School Psychology, 2009
The simple view of reading (SVR) provides a conceptual framework for describing the processes involved when readers comprehend text. Strong evidence for the SVR comes from factor-analytic studies showing dissociations between decoding and comprehension skills. The aim of the present study is to investigate whether predecoding and comprehension…
Descriptors: Listening Comprehension, Early Intervention, Psychologists, School Psychologists

Peer reviewed
Direct link
