Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Yun, Jiyeo – ProQuest LLC, 2017
Since researchers investigated automatic scoring systems in writing assessments, they have dealt with relationships between human and machine scoring, and then have suggested evaluation criteria for inter-rater agreement. The main purpose of my study is to investigate the magnitudes of and relationships among indices for inter-rater agreement used…
Descriptors: Interrater Reliability, Essays, Scoring, Evaluators
McGough, David J. – AERA Online Paper Repository, 2017
This paper describes the implementation of an inter-rater reliability measure for assessing portfolio scores in a teacher education program. The reliability coefficient for the portfolio scores from completers of a newly revised program were compared with the reliability coefficient of the scores from a second set of reviewers who discussed the…
Descriptors: Interrater Reliability, Teacher Education Programs, Program Evaluation, Portfolio Assessment
Puskulluoglu, Elif Iliman; Altinkurt, Yahya – Online Submission, 2017
The purpose of this study is to develop a data collection tool in order to define the levels of teachers' structural empowerment. The sample of the research consists of teachers of primary, secondary and high schools. For the construct validity, explanatory and confirmatory factor analyses are done. The five-factor structure, emerged as the result…
Descriptors: Teacher Empowerment, Elementary School Teachers, Secondary School Teachers, Measures (Individuals)
Kate S. Wolfe; Sarah L. Hoiland – Numeracy, 2017
In this paper, our goals were to assess the suitability of the Subjective Numeracy Scale (SNS), developed for health-care use, in a new context with predominantly minority students at a South Bronx community college and to identify any race/ ethnicity, gender, and ESL enrollment effects. The scale assesses perceptions of quantitative reasoning…
Descriptors: Numeracy, Community College Students, Minority Group Students, Measures (Individuals)
Nijakowska, Joanna; Tsagari, Dina; Spanoudis, George – Studies in Second Language Learning and Teaching, 2020
The aim of this study was to validate a 24-item TEPID (Teachers of EFL Preparedness to Include Dyslexics) scale measuring the beliefs of 546 pre-service and in-service teachers of English as a foreign language (EFL) across three countries (Cyprus, Greece, and Poland) on their preparedness to include learners with dyslexia in mainstream foreign…
Descriptors: English (Second Language), Second Language Instruction, Language Teachers, Dyslexia
Lin, Shuqiong; Luo, Wen; Tong, Fuhui; Irby, Beverly J.; Alecio, Rafael Lara; Rodriguez, Linda; Chapa, Selena – Cogent Education, 2020
Student learning objectives (SLOs) have become an increasingly popular tool for teacher evaluations as an alternative to Value-added Models (VAMs). However, the use of SLOs faces two major challenges. First, the target setting is mostly subjective and arbitrary. Second, there is little evidence on the reliability and validity of the tool. In this…
Descriptors: Student Educational Objectives, Teacher Evaluation, Data Use, Academic Achievement
Montroy, Janelle J.; Zucker, Tricia A.; Assel, Michael M.; Landry, Susan H.; Anthony, Jason L.; Williams, Jeffrey M.; Hsu, Hsien-Yuan; Crawford, April; Johnson, Ursula Y.; Carlo, Maria S.; Taylor, Heather B. – Early Education and Development, 2020
There is a significant need for kindergarten entry assessments (KEA) that meet state education agency (SEA) requirements and are psychometrically sound measures of a broad range of school readiness domains such as language, literacy, math, science, executive function, and social-emotional skills. Research Findings: In this paper, we describe five…
Descriptors: Kindergarten, School Readiness, Student Evaluation, Test Construction
Rickert, Nicolette P.; Skinner, Ellen A.; Roeser, Robert W. – International Journal of Behavioral Development, 2020
In response to growing interest in mindfulness as a support for educators, the current study sought to create and test a new multidimensional and multi-informant measure of teacher mindfulness in the classroom. To counter some of the limitations of context-general self-reports, we designed two theoretically based classroom-specific measures that…
Descriptors: Metacognition, Middle School Students, Middle School Teachers, Teacher Behavior
Cai, Yuyang; Kunnan, Antony John – Language Testing, 2020
An essential hypothesis of modern language assessment theory pertains to the interaction between strategy use ability (strategic competence) and second language knowledge. However, how they interact with each other is rarely explored. Drawing on relevant research in the literature, in this paper we proposed three interaction patterns (i.e.,…
Descriptors: English (Second Language), Second Language Learning, Nursing Education, Reading Tests
Chan, Stephanie W. Y.; Cheung, Wai Ming; Huang, Yanli; Lam, Wai-Ip; Lin, Chin-Hsi – Language Testing, 2020
Demand for second-language (L2) Chinese education for kindergarteners has grown rapidly, but little is known about these kindergarteners' L2 skills, with existing studies focusing on school-age populations and alphabetic languages. Accordingly, we developed a six-subtest Chinese character acquisition assessment to measure L2 kindergarteners'…
Descriptors: Chinese, Second Language Learning, Second Language Instruction, Written Language
Gage, Nicholas A.; Prykanowski, Debra; Hirn, Regina – Behavioral Disorders, 2014
Reliability of direct observation outcomes ensures the results are consistent, dependable, and trustworthy. Typically, reliability of direct observation measurement approaches is assessed using interobserver agreement (IOA) and the calculation of observer agreement (e.g., percentage of agreement). However, IOA does not address intraobserver…
Descriptors: Observation, Measurement Techniques, Reliability, Emotional Disturbances
Aljunied, Mariam; Frederickson, Norah – Educational Psychology in Practice, 2014
Despite embracing a bio-psycho-social perspective, the World Health Organization's International Classification of Functioning, Disability and Health (ICF) assessment framework has had limited application to date with children who have special educational needs (SEN). This study examines its utility for educational psychologists' work with…
Descriptors: Educational Psychology, Classification, Clinical Diagnosis, Special Needs Students
Chu, Szu-Yin – International Journal of Early Years Education, 2015
Positive Behaviour Intervention and Support (PBIS) is an evidence-based approach that has been proven to be effective in remediating problem behaviours in children. The purpose of this study was to evaluate the effectiveness of the family-centred PBIS approach when involving Taiwanese families in the treatment of off-task and non-compliant…
Descriptors: Foreign Countries, Positive Reinforcement, Teaching Methods, Behavior Modification
Finelli, Cynthia J.; Borrego, Maura; Rasoulifar, Golnoosh – Journal of Engineering Education, 2015
The diversity of engineering education research provides an opportunity for cross-fertilization of ideas and creativity, but it also can result in fragmentation of the field and duplication of effort. One solution is to establish a standardized taxonomy of engineering education terms to map the field and communicate and connect research…
Descriptors: Engineering Education, Taxonomy, Vocabulary, Educational Research
Hunter, Simon C.; Houghton, Stephen; Wood, Lisa – Australian Educational and Developmental Psychologist, 2015
While there is increasing recognition of the need to go beyond measures of mental ill health, there is a relative dearth of validated tools for assessing mental well-being among adolescents. The Warwick-Edinburgh Mental Well-being Scale (WEMWBS) is a promising tool for use in this context, and this study evaluated its use in an Australian context.…
Descriptors: Foreign Countries, Well Being, Adolescents, High School Students

Direct link
Peer reviewed
