Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Patrisius Istiarto Djiwandono; Daniel Ginting – Language Education & Assessment, 2025
The teaching of English as a foreign language in Indonesia has a long history, and it is always important to ask whether the assessment of the students' language skills has been valid and reliable. A screening of many articles in several prominent databases reveal that a number of evaluation studies have been done by Indonesian scholars in the…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning
Cankoy, Osman; Özder, Hasan – EURASIA Journal of Mathematics, Science & Technology Education, 2017
The aim of this study is to develop a scoring rubric to assess primary school students' problem posing skills. The rubric including five dimensions namely solvability, reasonability, mathematical structure, context and language was used. The raters scored the students' problem posing skills both with and without the scoring rubric to test the…
Descriptors: Generalizability Theory, Elementary School Students, Foreign Countries, Problem Solving
Wan, Ming Wai; Brooks, Ami; Green, Jonathan; Abel, Kathryn; Elmadih, Alya – International Journal of Behavioral Development, 2017
This study investigated the psychometrics of a recently developed global rating measure of videotaped parent-infant interaction, the "Manchester Assessment of Caregiver-Infant Interaction" (MACI), in a normative sample. Inter-rater reliability, stability over time, and convergent and discriminant validity were tested. Six-minute play…
Descriptors: Rating Scales, Parent Child Relationship, Infants, Interaction
Özdas, Faysal; Batdi, Veli – Journal of Education and Training Studies, 2017
This thematic-based meta-analytic study aims to examine the effect of creativity on the academic success and learning retention scores of students. In the context of this aim, 18 out of 225 studies regarding creativity that were carried out between 2001 and 2011 have been obtained from certain national and international databases. The studies…
Descriptors: Meta Analysis, Creativity, Scores, Retention (Psychology)
Nurnberger-Haag, Julie; Kratky, Joseph; Karpinski, Aryn C. – International Electronic Journal of Mathematics Education, 2022
Skills and understanding of operations with negative numbers, which are typically taught in middle school, are crucial aspects of numerical competence necessary for all subsequent mathematics. To more swiftly and coherently develop the field's understanding of how to foster this critical competence, we need shared measures that allow us to compare…
Descriptors: Numbers, Number Concepts, Middle School Students, Secondary School Mathematics
Vidal Rodeiro, Carmen; Chambers, Lucy – Research Matters, 2022
Many high-stakes qualifications include non-exam assessments that are marked by teachers. Awarding bodies then apply a moderation process to bring the marking of these assessments to an agreed standard. Comparative Judgement (CJ) is a technique where two (or more) pieces of work are compared at a time, allowing an overall rank order of work to be…
Descriptors: Evaluation Methods, Portfolios (Background Materials), Decision Making, Task Analysis
Park, Diana E.; Bridges, Laurie M. – Communications in Information Literacy, 2022
There is a common classroom refrain, "Don't use "Wikipedia"; it's unreliable." Unfortunately, this simple dismissal of the world's largest repository of information fails to engage students in a critical conversation about how knowledge within "Wikipedia" is constructed and shared. "Wikipedia" is available…
Descriptors: Encyclopedias, Electronic Publishing, Collaborative Writing, Information Literacy
Arunachalam, Sudha; Avtushka, Valeryia; Luyster, Rhiannon J.; Guthrie, Whitney – Language Learning and Development, 2022
Vocabulary checklists completed by caregivers are a common way of measuring children's vocabulary knowledge. We provide evidence from checklist data from 31 children with and without autism spectrum disorder. When asked to report twice about whether or not their child produces a particular word, caregivers are largely consistent in their…
Descriptors: Verbs, Vocabulary Development, Nouns, Language Acquisition
Akase, Masaki – Language Testing in Asia, 2022
The purpose of this study is to equate and further validate three forms of the vocabulary size test (VST) created by Aizawa and Mochizuki (2010). These three forms, VST 1, 2, and 3, were administered to a cohort of 189 high school students ranging in age from 16 to 18 in April of their 1st, 2nd, and 3rd year of high school. Although these…
Descriptors: Vocabulary Development, Vocabulary Skills, Language Tests, Longitudinal Studies
Al-Shenikat, Feryal Abdel-Hadi – Educational Research and Reviews, 2022
The present study aimed to find out the level of critical thinking skills of a Jordanian sample of blind students and its relationship with some variables, namely the gender and class level variable. To achieve the objectives of the study, the researcher developed the California Critical Thinking Scale in line with the characteristics of the…
Descriptors: Foreign Countries, Blindness, Students with Disabilities, Critical Thinking
Halfon, Ester; Biton, Yaniv – International Journal of Education in Mathematics, Science and Technology, 2022
As part of efforts to improve the quality of mathematics' teaching and evaluation, we examined the focus of math teachers' considerations in evaluating students' achievements, as well as the links between these focuses, regarding differences between students and the validity and reliability of assessment methods and examinations. Based on the…
Descriptors: Mathematics Teachers, Mathematics Instruction, Teacher Attitudes, Student Evaluation
Lenz, A. Stephen; Li, Chi – Measurement and Evaluation in Counseling and Development, 2022
The factor structure, measurement invariance, and internal consistency of the Patient Health Questionnaire for Depression and Anxiety (PHQ-4) was examined with a rural, predominately Hispanic sample (N = 711). Findings supported use of a one-factor model across gender, age groups, and Spanish-speaking groups. Counseling practice and research…
Descriptors: Psychometrics, Error of Measurement, Patients, Questionnaires
Zhu, Peitao; Liu, Yanhong; Luke, Melissa M.; Wang, Qiu – Measurement and Evaluation in Counseling and Development, 2022
Researchers developed and initially validated a client-report measure of counselors' cultural humility, entitled the Cultural Humility and Enactment Scale (CHES). The sample includes 434 adults recruited from web-based surveys. Exploratory factor analyses were performed to examine the initial factor structure of the CHES. Bivariate correlations…
Descriptors: Cultural Awareness, Measures (Individuals), Construct Validity, Predictive Validity
Manzano, Dexter L. – International Journal of Language Testing, 2022
The increasing popularity of self-assessment prompted several scholars to investigate its effectiveness and accuracy in relation to teacher assessment. However, most of these studies focused only on the consistency estimate perspective. Thus, the current study investigated the interrater reliability between self- and teacher assessment of…
Descriptors: Oral Language, Self Evaluation (Individuals), College Students, Interrater Reliability
Uluocak, Mustafa; Ipek, Ozan – International Journal of Progressive Education, 2022
The purpose of the study is to determine pre-service Turkish language teachers' use of text structure elements and their awareness and experience with argumentative writing. The research was designed as a case study, which included 115 undergraduate students studying Turkish language teaching. The data of the study consisted of the participants'…
Descriptors: Turkish, Language Teachers, Preservice Teachers, Persuasive Discourse

Peer reviewed
Direct link
