Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Wiliam, Dylan – Assessment in Education: Principles, Policy & Practice, 2008
While international comparisons such as those provided by PISA may be meaningful in terms of overall judgements about the performance of educational systems, caution is needed in terms of more fine-grained judgements. In particular it is argued that the results of PISA to draw conclusions about the quality of instruction in different systems is…
Descriptors: Test Bias, Test Construction, Comparative Testing, Evaluation
Jaswal, Vikram K.; McKercher, David A.; VanderBorght, Mieke – Child Development, 2008
Two studies investigated 3- to 5-year-olds' trust in a reliable informant when judging novel labels and novel plural and past tense forms. In Study 1, children (N = 24) endorsed the names of new objects given by an informant who had earlier labeled familiar objects correctly over the names given by an informant who had labeled the same objects…
Descriptors: Nouns, Morphemes, Young Children, Trust (Psychology)
Milanowski, Anthony T.; Heneman, Herbert G., III; Kimball, Steven M. – Wisconsin Center for Education Research (NJ1), 2011
This paper reports on a study of the current state of the art in teaching assessment. The major goal of the study was to examine a sample of assessment systems and then develop a specification for a state-of the art performance assessment system to be used for human capital management functions. The authors hope was that this specification would…
Descriptors: Human Capital, Management Systems, Formative Evaluation, Performance Based Assessment
Chi, Youngshin – ProQuest LLC, 2011
This study investigated the breakdown effect of a listening comprehension test, whether test takers are affected in comprehending lectures by impediments, and collected test takers' cognitive awareness on test tasks which contain listening breakdown factors how they perceived these impediments. In this context of the study, a "Breakdown" is a test…
Descriptors: Generalizability Theory, Listening Comprehension, Intervals, Second Languages
Coe, Michael; Hanita, Makoto; Nishioka, Vicki; Smiley, Richard – National Center for Education Evaluation and Regional Assistance, 2011
The 6+1 Trait[R] Writing model (Culham 2003) emphasizes writing instruction in which teachers and students analyze writing using a set of characteristics, or "traits," of written work: ideas, organization, voice, word choice, sentence fluency, conventions, and presentation. The Ideas trait includes the main content and message, including…
Descriptors: Models, Writing Instruction, Instructional Effectiveness, Grade 5
Potemski, Amy; Rowland, Cortney; Witham, Peter – Center for Educator Compensation Reform, 2011
A significant number of educator compensation reform efforts are under way throughout the country. These school-, district-, and state-level programs come in all shapes and sizes--some are small and focus only on a cohort of teachers or schools, whereas others are large and target entire districts or groups of districts. The structure of these…
Descriptors: Program Effectiveness, Educational Change, Rewards, Program Evaluation
Hirao, Katsura – ProQuest LLC, 2011
A self-report assessment scale of school connectedness was validated in this study based on the data from middle-school children in a northeastern state of the United States (n = 145). The scale was based on the School Bonding Model (Morita, 1991), which was derived reductively from the social control (bond) theory (Hirschi, 1969). This validation…
Descriptors: Grade 8, Peer Acceptance, African American Children, Validity
Saricoban, Arif – Hacettepe University Journal of Education, 2011
In this article the researcher has examined the current situation in test (a) construction: designing, structuring, developing, (b) administering, and (c) assessing the foreign language tests to see if we are still at the same point (traditional) and has given some suggestions on this indispensable issue. To collect the necessary data the 4th year…
Descriptors: Second Language Instruction, Language Tests, Second Language Learning, Language Skills
Unlu, Huseyin – Educational Sciences: Theory and Practice, 2011
In this study, the development of a Likert-type attitude scale for the profession of physical education teaching (ASPPET) was aimed. The group of the study was consisted of totally 556 pre-service physical education teachers. In order to determine the structural validity of ASPPET, an exploratory and confirmative factor analyses were performed. A…
Descriptors: Physical Education, Factor Structure, Measures (Individuals), Factor Analysis
Norton, Anderson; McCloskey, Andrea; Hudson, Rick A. – Journal of Mathematics Teacher Education, 2011
In order to evaluate the effectiveness of an experimental elementary mathematics field experience course, we have designed a new assessment instrument. These video-based prediction assessments engage prospective teachers in a video analysis of a child solving mathematical tasks. The prospective teachers build a model of that child's mathematics…
Descriptors: Video Technology, Interrater Reliability, Prediction, Knowledge Base for Teaching
Tierney, Robin D.; Simon, Marielle; Charland, Julie – Educational Forum, 2011
Knowing that grades can have long-term consequences for students, teachers voice concern about being fair in the grading process. However, their interpretations of fairness are varied and sometimes contradictory. This study looked at how teachers in one standards-based educational system determined secondary students' grades, focusing specifically…
Descriptors: Grades (Scholastic), Academic Achievement, Grading, Educational Principles
Perpina, Conxa; Cebolla, Ausias; Botella, Cristina; Lurbe, Empar; Torro, Maria-Isabel – Journal of Clinical Child and Adolescent Psychology, 2011
The aims of this study were to validate the Emotional Eating Scale version for children (EES-C) in a Spanish population and study the differences in emotional eating among children with binge eating (BE), overeating (OE), and no episodes of disordered eating (NED). The questionnaire was completed by 199 children aged 9 to 16 years. Confirmatory…
Descriptors: Check Lists, Helplessness, Eating Disorders, Factor Structure
McDonell, James R.; Waters, Tracy J. – Social Indicators Research, 2011
This paper reports the development and validation of the Neighborhood Observation Scale, a 41 item measure of neighborhood physical appearance, social appearance, safety, and amenities. Three independent ratings were collected on each of 244 neighborhoods in 132 census block groups in five South Carolina counties, for a total of 732 observations.…
Descriptors: Neighborhoods, Child Abuse, Safety, Validity
Rhodes, Terrel L. – Change: The Magazine of Higher Learning, 2011
People are inundated with technology on campuses and in their lives. Students are increasingly technology savvy, expecting faculty and administrators to function comfortably within the digital world. They have responded by using technology more and more in teaching and learning. In this article, the author focuses on one such use--student…
Descriptors: Portfolios (Background Materials), Undergraduate Study, Campuses, Evaluation
Stewart, Jeffrey; White, David A. – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2011
Multiple-choice tests such as the Vocabulary Levels Test (VLT) are often viewed as a preferable estimator of vocabulary knowledge when compared to yes/no checklists, because self-reporting tests introduce the possibility of students overreporting or underreporting scores. However, multiple-choice tests have their own unique disadvantages. It has…
Descriptors: Guessing (Tests), Scoring Formulas, Multiple Choice Tests, Test Reliability

Peer reviewed
Direct link
