Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Gustilo, Leah E. – Online Submission, 2016
The present study aimed at characterizing what skilled or more proficient ESL college writing is in the Philippine setting through a contrastive analysis of three groups of variables identified from previous studies: resources, processes, and performance of ESL writers. Based on Chenoweth and Hayes' (2001; 2003) framework, the resource level…
Descriptors: Language Proficiency, English (Second Language), Second Language Learning, Foreign Countries
Samir, Aynaz; Tabatabaee-Yazdi, Mona – International Journal of Language Testing, 2020
The present study aimed to examine and validate a rubric for translation quality assessment using Rasch analysis. To this end, the researchers interviewed 20 expert translation instructors to identify the factors they consider important for assessing the quality of students' translation. Based on the specific commonalities found throughout the…
Descriptors: Translation, Scoring Rubrics, Second Language Learning, Second Language Instruction
Purnomo, Yoppy Wahyu; Pramudiani, Puri; Aziz, Tian Abdul; Kaur, Amrita; Ismail, Siti Noor; Nuriadin, Ishaq – Australian Journal of Teacher Education, 2020
Teachers' beliefs towards educational research has become one significant factor in explaining the gap between research and practice. The present study aimed at reviewing the scale to measure teachers' beliefs about the causes and problems related to research-based practices, describing beliefs that teachers appear to hold, and examining its…
Descriptors: Beliefs, Negative Attitudes, Attitude Measures, Test Validity
Polat, Murat – Novitas-ROYAL (Research on Youth and Language), 2020
Classroom practices, materials and teaching methods in language classes have changed a lot in the last decades and continue to evolve; however, the commonly used techniques to test students' foreign language skills have not changed much regardless of the recent awareness in Bloom's taxonomy. Testing units at schools rely mostly on multiple choice…
Descriptors: Multiple Choice Tests, Test Format, Test Items, Difficulty Level
Tarekegn, Getachew; Terfa, Deresse; Tadesse, Mesfin; Atnafu, Mulugeta; Alemu, Mekbib – Journal of Science Teacher Education, 2020
This study explores Ethiopian preservice primary science teachers' perception of mentoring practices. Using a survey design, the Amharic translation of Mentoring for Effective Primary Science Teaching (MEPST) was administered to 239 graduating preservice science teachers, enrolled in four teacher education colleges. In addition, we interviewed 20…
Descriptors: Preservice Teachers, Translation, Semitic Languages, Foreign Countries
Min, Shangchao; He, Lianzhen; Zhang, Jie – Language Teaching, 2020
This article reviews a selected sample of 70 empirical studies in journal articles and doctoral dissertations on language assessment in China between 2011 and 2018. Following a brief introduction to the history and current state of language assessment in China, the article presents a critical review of language assessment research on six themes…
Descriptors: Language Tests, Test Reliability, Test Validity, Journal Articles
Zulaiha, Siti; Mulyono, Herri – Cogent Education, 2020
The training of teachers is one of the most critical factors in improving the quality of teaching and assessment in the classroom. EFL teachers need to be literate in language assessment; this can be achieved through training. A total of 147 Junior High School EFL teachers was surveyed to identify their training needs in assessmen. Semi-structured…
Descriptors: Junior High School Teachers, Teacher Attitudes, Language Teachers, English (Second Language)
Shah, Harshini; Niland, Katherine; Kharsa, Miranda; Caronongan, Pia; Moiduddin, Emily – US Department of Health and Human Services, 2020
In 2017, the Office of Planning, Research, and Evaluation (OPRE) in the Administration for Children and Families (ACF) funded Mathematica to conduct the Infant and Toddler Teacher and Caregiver Competencies (ITTCC) project. The project aims to examine existing efforts across states, institutions of higher education, professional organizations, and…
Descriptors: Infants, Toddlers, Caregivers, Preschool Teachers
Osler, James Edward, II – Journal of Educational Technology, 2015
This monograph provides an epistemological rational for the Accumulative Manifold Validation Analysis [also referred by the acronym "AMOVA"] statistical methodology designed to test psychometric instruments. This form of inquiry is a form of mathematical optimization in the discipline of linear stochastic modelling. AMOVA is an in-depth…
Descriptors: Statistical Analysis, Test Validity, Test Reliability, Inquiry
Harrison, George M. – Journal of Educational Measurement, 2015
The credibility of standard-setting cut scores depends in part on two sources of consistency evidence: intrajudge and interjudge consistency. Although intrajudge consistency feedback has often been provided to Angoff judges in practice, more evidence is needed to determine whether it achieves its intended effect. In this randomized experiment with…
Descriptors: Interrater Reliability, Standard Setting (Scoring), Cutting Scores, Feedback (Response)
Dressler, William W.; Balieiro, Mauro C.; dos Santos, José Ernesto – Field Methods, 2015
This article reports the replication after 10 years of cultural consensus analyses in four cultural domains in the city of Ribeirão Preto, Brazil. Additionally, two methods for evaluating residual agreement are applied to the data, and a new technique for evaluating how cultural knowledge is represented by residual agreement is introduced. We…
Descriptors: Foreign Countries, Culture, Change, Reliability
Zaporozhets, Olga; Fox, Christine M.; Beltyukova, Svetlana A.; Laux, John M.; Piazza, Nick J.; Salyers, Kathleen – Measurement and Evaluation in Counseling and Development, 2015
This study was to develop a linear measure of change using University of Rhode Island Change Assessment items that represented Prochaska and DiClemente's theory. The resulting Toledo Measure of Change is short, is easy to use, and provides reliable scores for identification of individuals' stage of change and progression within that stage.
Descriptors: Item Response Theory, Change, Measures (Individuals), Test Construction
McGill, Ryan J. – Journal of Psychoeducational Assessment, 2015
The Cognitive Assessment System-Second Edition (CAS2) is an individually administered measure of cognitive ability designed for children and adolescents ages 5 through 18 years. The measure, authored by Jack A. Naglieri, J. P. Das, and Sam Goldstein, was published by Pro-Ed in 2014 and is the first revision of the Cognitive Assessment System (CAS;…
Descriptors: Cognitive Tests, Children, Adolescents, Cognitive Processes
Villarreal, Victor – Journal of Psychoeducational Assessment, 2015
The Woodcock-Johnson IV Tests of Achievement (WJ IV ACH; Schrank, Mather, & McGrew, 2014a) is an individually administered measure containing tests of reading, mathematics, written language, and academic knowledge. Areas of reading, mathematics, and written language each include tests of basic skills, fluency, and application. Academic…
Descriptors: Achievement Tests, Scoring, Test Construction, Item Analysis
Unicomb, Rachael; Colyvas, Kim; Harrison, Elisabeth; Hewat, Sally – Journal of Speech, Language, and Hearing Research, 2015
Purpose: Case-study methodology studying change is often used in the field of speech-language pathology, but it can be criticized for not being statistically robust. Yet with the heterogeneous nature of many communication disorders, case studies allow clinicians and researchers to closely observe and report on change. Such information is valuable…
Descriptors: Case Studies, Research Methodology, Statistical Analysis, Change

Peer reviewed
Direct link
