Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
Lehan, Tara; Hussey, Heather; Mika, Eva – Journal of University Teaching and Learning Practice, 2016
Throughout the dissertation process, the chair and committee members provide feedback regarding quality to help the doctoral candidate to produce the highest-quality document and become an independent scholar. Nevertheless, results of previous research suggest that overall dissertation quality generally is poor. Because much of the feedback about…
Descriptors: Graduate Students, Doctoral Dissertations, Student Evaluation, Feedback (Response)
Temel, Gülhan Orekici; Erdogan, Semra; Selvi, Hüseyin; Kaya, Irem Ersöz – Educational Sciences: Theory and Practice, 2016
Studies based on longitudinal data focus on the change and development of the situation being investigated and allow for examining cases regarding education, individual development, cultural change, and socioeconomic improvement in time. However, as these studies require taking repeated measures in different time periods, they may include various…
Descriptors: Investigations, Sample Size, Longitudinal Studies, Interrater Reliability
Engelmann, Jeanine E. – Athletic Training Education Journal, 2016
Context: Peer assessment is widely used in medical education as a formative evaluation and preparatory tool for students. Athletic training students learn similar knowledge, skills, and affective traits as medical students. Peer assessment has been widely studied with beneficial results in medical education, yet athletic training education has…
Descriptors: Peer Evaluation, Undergraduate Students, College Athletics, Professional Education
Ke, Xiaohua; Zeng, Yongqiang; Luo, Haijiao – Journal of Educational Measurement, 2016
This article presents a novel method, the Complex Dynamics Essay Scorer (CDES), for automated essay scoring using complex network features. Texts produced by college students in China were represented as scale-free networks (e.g., a word adjacency model) from which typical network features, such as the in-/out-degrees, clustering coefficient (CC),…
Descriptors: Scoring, Automation, Essays, Networks
Trierweiler, Tammy J.; Lewis, Charles; Smith, Robert L. – Journal of Educational Measurement, 2016
In this study, we describe what factors influence the observed score correlation between an (external) anchor test and a total test. We show that the anchor to full-test observed score correlation is based on two components: the true score correlation between the anchor and total test, and the reliability of the anchor test. Findings using an…
Descriptors: Scores, Correlation, Tests, Test Reliability
Oxendine, Derek – ProQuest LLC, 2016
The Multigroup Ethnic Identity Measure-Revised (MEIM-R; Phinney & Ong, 2007) has been used and validated with a number of ethnic groups. Unfortunately, no studies have examined the psychometric properties of the MEIM-R on an American Indian or Lumbee sample, and American Indians were not included in the sample during scale development. The…
Descriptors: Ethnicity, American Indians, Psychometrics, Tribes
Floyd, Natosha N. – ProQuest LLC, 2016
The purpose of this study was to examine the psychometric properties of the Michigan School Libraries for the 21st Century Measurement Benchmarks (SL21). The instrument consists of 19 items with three subscales: Building the 21st Century Learning Environment Subscale, Teaching for 21st Century Learning Subscale, and Leading the Way to 21st Century…
Descriptors: School Libraries, Benchmarking, Psychometrics, Reliability
Stefanic, Nicholas; Randles, Clint – Music Education Research, 2015
The purpose of this study was to explore the reliability of measures of both individual and group creative work using the consensual assessment technique (CAT). CAT was used to measure individual and group creativity among a population of pre-service music teachers enrolled in a secondary general music class (n = 23) and was evaluated from…
Descriptors: Music Education, Creativity, Preservice Teachers, Music Teachers
Wendel, Erica; Cawthon, Stephanie W.; Ge, Jin Jin; Beretvas, S. Natasha – Journal of Deaf Studies and Deaf Education, 2015
The authors assessed the quality of single-case design (SCD) studies that assess the impact of interventions on outcomes for individuals who are deaf or hard-of-hearing (DHH). More specifically, the What Works Clearinghouse (WWC) standards for SCD research were used to assess design quality and the strength of evidence of peer-reviewed studies…
Descriptors: Deafness, Partial Hearing, Intervention, Research Design
Wendel, Erica; Cawthon, Stephanie W.; Ge, Jin Jin; Beretvas, S. Natasha – Grantee Submission, 2015
The authors assessed the quality of single-case design (SCD) studies that assess the impact of interventions on outcomes for individuals who are deaf or hard-of-hearing (DHH). More specifically, the What Works Clearinghouse (WWC) standards for SCD research were used to assess design quality and strength of evidence of peer-reviewed studies…
Descriptors: Deafness, Partial Hearing, Intervention, Research Design
Vo, Tina; Hammack, Rebekah – Journal of Science Teacher Education, 2022
National reform documents and shifts in educational standards have continued to highlight the importance of engineering and engineering practices within science literacy. High-quality engineering opportunities must be present in formal education due to their association with problem-solving and critical thinking. Given this directive to reach…
Descriptors: Academic Standards, Engineering, Scientific Literacy, Educational Quality
Helman, Amanda; Dennis, Minyi Shih; Kern, Lee – Learning Disability Quarterly, 2022
English learners (ELs) with reading disabilities (RDs) have been among the lowest performers on academic achievement tests that assess vocabulary. To meet academic demands and prepare for college or careers, ELs with RDs clearly need support in terms of vocabulary acquisition; however, relevant research is scarce. This study investigated the…
Descriptors: Vocabulary Development, English Language Learners, Reading Difficulties, Biology
Pelánek, Radek; Effenberger, Tomáš; Kukucka, Adam – Journal of Educational Data Mining, 2022
We study the automatic identification of educational items worthy of content authors' attention. Based on the results of such analysis, content authors can revise and improve the content of learning environments. We provide an overview of item properties relevant to this task, including difficulty and complexity measures, item discrimination, and…
Descriptors: Item Analysis, Identification, Difficulty Level, Case Studies
Kotoka, Love; Kriek, Jeanne – Journal of Baltic Science Education, 2022
Learners underperform in stoichiometry as they lack conceptual reasoning of the underlying concepts and the ability to solve stoichiometric problems. Therefore, it was necessary to determine if there is a statistical correlation between problem-solving skills and conceptual reasoning in stoichiometry and if so, whether one can significantly…
Descriptors: Prediction, Correlation, Science Instruction, Chemistry

Peer reviewed
Direct link
