Publication Date
| In 2026 | 0 |
| Since 2025 | 60 |
| Since 2022 (last 5 years) | 286 |
| Since 2017 (last 10 years) | 782 |
| Since 2007 (last 20 years) | 2044 |
Descriptor
| Interrater Reliability | 3126 |
| Foreign Countries | 655 |
| Test Reliability | 504 |
| Evaluation Methods | 503 |
| Test Validity | 411 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 25 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Peer reviewedFrankel, Karen A.; Boyum, Lisa A.; Harmon, Robert J. – Journal of the American Academy of Child and Adolescent Psychiatry, 2004
Objective: To present data from a general infant psychiatry clinic, including range and frequency of presenting symptoms, relationship between symptoms and diagnoses, and comparison of two diagnostic systems, DSM-IV and Diagnostic Classification of Mental Health and Developmental Disorders of Infancy and Early Childhood (DC: 0-3). Method: A…
Descriptors: Psychiatry, Infants, Interrater Reliability, Clinics
Kang, Hyun-Ah; Poertner, John – Child Abuse & Neglect: The International Journal, 2006
Objective: The purpose of this study was to determine the level of inter-rater reliability of the Illinois Structured Decision Support Protocol by examining the level of Child Protective Services (CPS) caseworkers' agreement regarding state interventions. The Protocol was designed to guide CPS workers to consistent decisions related to the level…
Descriptors: Interrater Reliability, Decision Support Systems, Intervention, Caseworkers
Reber, Rolf – American Psychologist, 2006
This paper comments on the article "Psychology and Phenomenology: A Clarification" by H. H. Kendler. Kendler contrasted objective phenomena going on in the mind with phenomenological convictions. He concluded, on the basis of a thoughtful analysis, that scientific psychology cannot validate moral principles, which have to be agreed upon by…
Descriptors: Psychology, Psychological Studies, Phenomenology, Moral Values
Pell, Godfrey; Homer, Matthew S.; Roberts, Trudie E. – International Journal of Research & Method in Education, 2008
Increasingly, academic institutions are being required to improve the validity of the assessment process; unfortunately, often this is at the expense of reliability. In medical schools (such as Leeds), standardized tests of clinical skills, such as "Objective Structured Clinical Examinations" (OSCEs) are widely used to assess clinical…
Descriptors: Medical Education, Standardized Tests, Clinical Experience, Criterion Referenced Tests
Clark, Douglas B.; Sampson, Victor – Journal of Research in Science Teaching, 2008
The national science standards, along with prominent researchers, call for increased focus on scientific argumentation in the classroom. Over the past decade, researchers have developed sophisticated online science learning environments to support these opportunities for scientific argumentation. Assessing the quality of dialogic argumentation,…
Descriptors: Persuasive Discourse, Interrater Reliability, Concept Formation, Discourse Modes
Wang, Hao-Chuan; Chang, Chun-Yen; Li, Tsai-Yen – Computers & Education, 2008
The work aims to improve the assessment of creative problem-solving in science education by employing language technologies and computational-statistical machine learning methods to grade students' natural language responses automatically. To evaluate constructs like creative problem-solving with validity, open-ended questions that elicit…
Descriptors: Interrater Reliability, Earth Science, Problem Solving, Grading
Shriberg, David; Bonner, Mike; Sarr, Brianna J.; Walker, Ashley Marks; Hyland, Megan; Chester, Christie – School Psychology Review, 2008
Social justice is an aspiration that most, if not all, school psychologists likely support, yet there is a lack of research delineating how this term translates to school psychology practice. This article presents the results of a Delphi study of 44 cultural diversity experts in school psychology regarding (a) defining social justice from a school…
Descriptors: Social Justice, Delphi Technique, School Psychologists, Cultural Pluralism
Burdsal, Charles A.; Harrison, Paul D. – Assessment & Evaluation in Higher Education, 2008
The purpose of this research is to provide additional empirical evidence supporting the use of both a multidimensional profile and an overall evaluation of teaching effectiveness as valid indicators of student perceptions of effective classroom instruction. A factor analytic teaching evaluation instrument was used that also included open-ended…
Descriptors: Student Evaluation of Teacher Performance, Factor Analysis, Profiles, Multidimensional Scaling
Prathanee, Benjamas; Pongjanyakul, Amornrat; Chano, Jiraporn – International Journal of Language & Communication Disorders, 2008
Background: Children with delayed speech and language development are at considerable risk for later language impairment, social and behavioural problems, and illiteracy. Early diagnosis is needed for intervention planning and prevention. However, a speech and language test for Thai children has not been available. Aims: To establish a Thai Speech…
Descriptors: Delayed Speech, Language Impairments, Language Tests, Interrater Reliability
Liow, Jong-Leng – European Journal of Engineering Education, 2008
Peer assessment has been studied in various situations and actively pursued as a means by which students are given more control over their learning and assessment achievement. This study investigated the reliability of staff and student assessments in two oral presentations with limited feedback for a school-based thesis course in engineering…
Descriptors: Feedback (Response), Student Evaluation, Grade Point Average, Peer Evaluation
Crawford, Lindy; Lloyd, Susan; Knoth, Kelly – Assessment for Effective Intervention, 2008
Type and quality of revisions made by students between first and final drafts of a state writing test were scored using a revision taxonomy. Scorers categorized revisions first by unit (e.g., word, phrase, sentence), and then by type (e.g., addition, substitution, spelling). They then evaluated the impact of each revision on the readability of the…
Descriptors: Writing Tests, Revision (Written Composition), State Standards, Writing Evaluation
Hunn, Lorie L. – ProQuest LLC, 2009
This study explored and compared the ways in which school-based cooperating teachers and college supervisors evaluate student teachers. The scores allocated to student teachers by school-based cooperating teachers and college supervisors in the final field experience evaluations of student teachers were analyzed. A mixed methods research design…
Descriptors: Cooperating Teachers, Leadership, Research Design, Student Teachers
Chiat, Shula; Roy, Penny – Journal of Speech, Language, and Hearing Research, 2007
Purpose: To determine the psychometric properties of the Preschool Repetition (PSRep) Test (Roy & Chiat, 2004), to establish the range of performance in typically developing children and variables affecting this performance, and to compare the performance of clinically referred children. Method: The PSRep Test comprises 18 words and 18…
Descriptors: Phonology, Psychometrics, Interrater Reliability, Followup Studies
Anseel, Frederik; Lievens, Filip – Journal of Career Development, 2007
This study examines how feedback interest after career assessment can be influenced by changing individuals' beliefs about the importance and modifiability of the various performance dimensions. In an experiment, 82 master students completed a computerized assessment tool developed for assessing managerial potential. Results showed that…
Descriptors: Feedback (Response), Interrater Reliability, Counselors, Career Counseling
McCandless, Stephen; O'Laughlin, Liz – Journal of Attention Disorders, 2007
Objective: Current theories hypothesize that deficits in executive functioning (EF) are responsible for the symptoms of ADHD and that specific patterns of EF deficits may be associated with different subtypes of ADHD. The present study evaluates the validity and clinical usefulness of the Behavior Rating Inventory of Executive Function, a behavior…
Descriptors: Test Validity, Interrater Reliability, Attention Deficit Disorders, Rating Scales

Direct link
