Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Eyler, Amy A.; Brownson, Ross C.; Aytur, Semra A.; Cradock, Angie L.; Doescher, Mark; Evenson, Kelly R.; Kerr, Jacqueline; Maddock, Jay; Pluto, Delores L.; Steinman, Lesley; Tompkins, Nancy O'Hara; Troped, Philip; Schmid, Thomas L. – Journal of School Health, 2010
Objectives: To develop a comprehensive inventory of state physical education (PE) legislation, examine trends in bill introduction, and compare bill factors. Methods: State PE legislation from January 2001 to July 2007 was identified using a legislative database. Analysis included components of evidence-based school PE from the Community Guide and…
Descriptors: Physical Education, Teacher Certification, Content Analysis, Trend Analysis
Skorupski, William P.; Carvajal, Jorge – Educational and Psychological Measurement, 2010
This study is an evaluation of the psychometric issues associated with estimating objective level scores, often referred to as "subscores." The article begins by introducing the concepts of reliability and validity for subscores from statewide achievement tests. These issues are discussed with reference to popular scaling techniques, classical…
Descriptors: Testing Programs, Test Validity, Achievement Tests, Scores
Pesman, Haki; Eryilmaz, Ali – Journal of Educational Research, 2010
The authors aimed to propose a valid and reliable diagnostic instrument by developing a three-tier test on simple electric circuits. Based on findings from the interviews, open-ended questions, and the related literature, the test was developed and administered to 124 high school students. In addition to some qualitative techniques for…
Descriptors: Misconceptions, Diagnostic Tests, Psychometrics, Physics
Topcu, Mustafa Sami – Evaluation & Research in Education, 2010
This study aimed to develop and validate the Attitudes towards Socioscientific Issues Scale (ATSIS) for undergraduate students. In the first step, data were collected from 160 undergraduate students from the departments of science education and elementary education to provide validity of the scale. In light of the results of an exploratory factor…
Descriptors: Science and Society, Attitude Measures, Student Attitudes, Undergraduate Students
Kocakulah, Mustafa Sabri – Journal of Science Education and Technology, 2010
This study aims to develop and apply a rubric to evaluate the solutions of pre-service primary science teachers to questions about Newton's Laws of Motion. Two groups were taught the topic using the same teaching methods and administered four questions before and after teaching. Furthermore, 76 students in the experiment group were instructed…
Descriptors: Control Groups, Scientific Concepts, Academic Achievement, Motion
Neuman, S.B.; Koh, S.; Dwyer, J. – Early Childhood Research Quarterly, 2008
The purpose of this study was to develop a valid and reliable tool for measuring the quality of the language and literacy environment in home-based settings. Based on a convergence of research on the ecological and psychological factors associated with early literacy development, the Child/Home Environmental Language and Literacy Observation…
Descriptors: Observation, Interrater Reliability, Urban Areas, Psychometrics
Gray, K. M.; Tonge, B. J.; Sweeney, D. J.; Einfeld, S. L. – Journal of Autism and Developmental Disorders, 2008
The ability to identify children who require specialist assessment for the possibility of autism at as early an age as possible has become a growing area of research. A number of measures have been developed as potential screening tools for autism. The reliability and validity of one of these measures for screening for autism in young children…
Descriptors: Check Lists, Autism, Interrater Reliability, Young Children
Milanowski, Anthony T.; Heneman, Herbert G., III; Kimball, Steven M. – Wisconsin Center for Education Research (NJ1), 2011
This paper reports on a study of the current state of the art in teaching assessment. The major goal of the study was to examine a sample of assessment systems and then develop a specification for a state-of the art performance assessment system to be used for human capital management functions. The authors hope was that this specification would…
Descriptors: Human Capital, Management Systems, Formative Evaluation, Performance Based Assessment
Chi, Youngshin – ProQuest LLC, 2011
This study investigated the breakdown effect of a listening comprehension test, whether test takers are affected in comprehending lectures by impediments, and collected test takers' cognitive awareness on test tasks which contain listening breakdown factors how they perceived these impediments. In this context of the study, a "Breakdown" is a test…
Descriptors: Generalizability Theory, Listening Comprehension, Intervals, Second Languages
Coe, Michael; Hanita, Makoto; Nishioka, Vicki; Smiley, Richard – National Center for Education Evaluation and Regional Assistance, 2011
The 6+1 Trait[R] Writing model (Culham 2003) emphasizes writing instruction in which teachers and students analyze writing using a set of characteristics, or "traits," of written work: ideas, organization, voice, word choice, sentence fluency, conventions, and presentation. The Ideas trait includes the main content and message, including…
Descriptors: Models, Writing Instruction, Instructional Effectiveness, Grade 5
Potemski, Amy; Rowland, Cortney; Witham, Peter – Center for Educator Compensation Reform, 2011
A significant number of educator compensation reform efforts are under way throughout the country. These school-, district-, and state-level programs come in all shapes and sizes--some are small and focus only on a cohort of teachers or schools, whereas others are large and target entire districts or groups of districts. The structure of these…
Descriptors: Program Effectiveness, Educational Change, Rewards, Program Evaluation
Hirao, Katsura – ProQuest LLC, 2011
A self-report assessment scale of school connectedness was validated in this study based on the data from middle-school children in a northeastern state of the United States (n = 145). The scale was based on the School Bonding Model (Morita, 1991), which was derived reductively from the social control (bond) theory (Hirschi, 1969). This validation…
Descriptors: Grade 8, Peer Acceptance, African American Children, Validity
Saricoban, Arif – Hacettepe University Journal of Education, 2011
In this article the researcher has examined the current situation in test (a) construction: designing, structuring, developing, (b) administering, and (c) assessing the foreign language tests to see if we are still at the same point (traditional) and has given some suggestions on this indispensable issue. To collect the necessary data the 4th year…
Descriptors: Second Language Instruction, Language Tests, Second Language Learning, Language Skills
Unlu, Huseyin – Educational Sciences: Theory and Practice, 2011
In this study, the development of a Likert-type attitude scale for the profession of physical education teaching (ASPPET) was aimed. The group of the study was consisted of totally 556 pre-service physical education teachers. In order to determine the structural validity of ASPPET, an exploratory and confirmative factor analyses were performed. A…
Descriptors: Physical Education, Factor Structure, Measures (Individuals), Factor Analysis
Norton, Anderson; McCloskey, Andrea; Hudson, Rick A. – Journal of Mathematics Teacher Education, 2011
In order to evaluate the effectiveness of an experimental elementary mathematics field experience course, we have designed a new assessment instrument. These video-based prediction assessments engage prospective teachers in a video analysis of a child solving mathematical tasks. The prospective teachers build a model of that child's mathematics…
Descriptors: Video Technology, Interrater Reliability, Prediction, Knowledge Base for Teaching

Peer reviewed
Direct link
