Publication Date
| In 2026 | 0 |
| Since 2025 | 433 |
| Since 2022 (last 5 years) | 1911 |
| Since 2017 (last 10 years) | 4483 |
| Since 2007 (last 20 years) | 6968 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 830 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 159 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 111 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Bush, Martin – Assessment & Evaluation in Higher Education, 2015
The humble multiple-choice test is very widely used within education at all levels, but its susceptibility to guesswork makes it a suboptimal assessment tool. The reliability of a multiple-choice test is partly governed by the number of items it contains; however, longer tests are more time consuming to take, and for some subject areas, it can be…
Descriptors: Guessing (Tests), Multiple Choice Tests, Test Format, Test Reliability
Pugh, Debra; Hamstra, Stanley J.; Wood, Timothy J.; Humphrey-Murto, Susan; Touchie, Claire; Yudkowsky, Rachel; Bordage, Georges – Advances in Health Sciences Education, 2015
Internists are required to perform a number of procedures that require mastery of technical and non-technical skills, however, formal assessment of these skills is often lacking. The purpose of this study was to develop, implement, and gather validity evidence for a procedural skills objective structured clinical examination (PS-OSCE) for internal…
Descriptors: Graduate Students, Medical Students, Internal Medicine, Skills
Mascitelli, Andréa N.; Rojahn, Johannes; Nicolaides, Vias C.; Moore, Linda; Hastings, Richard P.; Christian-Jones, Ceri – Journal of Applied Research in Intellectual Disabilities, 2015
Background: The Behaviour Problems Inventory-Short Form (BPI-S) is a spin-off of the BPI-01 that was empirically developed from a large BPI-01 data set. In this study, the reliability and factorial validity of the BPI-S was investigated for the first time on newly collected data from adults with intellectual disabilities. Methods: The sample…
Descriptors: Behavior Problems, Test Validity, Test Reliability, Adults
Yang, Fuyi; Xu, Jianzhong – Journal of Psychoeducational Assessment, 2015
This study reports on the psychometric evaluation of the Chinese version of the Homework Management Scale (HMS). The HMS was designed to assess students' homework management strategies. Based on a randomized split of 884 high school students in China, we conducted exploratory factor analysis on Group 1 (n = 442) and confirmatory factor analysis on…
Descriptors: Foreign Countries, High School Students, Psychometrics, Measures (Individuals)
Severo, Milton; Gaio, A. Rita; Povo, Ana; Silva-Pereira, Fernanda; Ferreira, Maria Amélia – Anatomical Sciences Education, 2015
In theory the formula scoring methods increase the reliability of multiple-choice tests in comparison with number-right scoring. This study aimed to evaluate the impact of the formula scoring method in clinical anatomy multiple-choice examinations, and to compare it with that from the number-right scoring method, hoping to achieve an…
Descriptors: Anatomy, Multiple Choice Tests, Scoring, Decision Making
Doraiswamy, Nithya; Porter, Kristen M.; Wilson, Grant; Paprzycki, Peter; Czerniak, Charlene M.; Tuttle, Nicole; Czajkowski, Kevin – Journal of School Leadership, 2016
This paper describes the development and validation of a science teacher leadership instrument modeled on the seven domains of the Teacher Leader Model (TLM) Standards (The Teacher Leadership Exploratory Consortium, 2011). Instrument development was part of National Science Foundation--funded Mathematics and Science Partnership (MSP) program that…
Descriptors: Test Construction, Test Validity, Teacher Leadership, Teacher Behavior
Menold, Natalja; Tausch, Anja – Sociological Methods & Research, 2016
Effects of rating scale forms on cross-sectional reliability and measurement equivalence were investigated. A randomized experimental design was implemented, varying category labels and number of categories. The participants were 800 students at two German universities. In contrast to previous research, reliability assessment method was used,…
Descriptors: Rating Scales, Test Reliability, Measurement, Classification
Thaneerananon, Taveep; Triampo, Wannapong; Nokkaew, Artorn – International Journal of Instruction, 2016
Nowadays, one of the biggest challenges of education in Thailand is the development and promotion of the students' thinking skills. The main purposes of this research were to develop an analytical thinking test for 6th grade students and evaluate the students' analytical thinking. The sample was composed of 3,567 6th grade students in 2014…
Descriptors: Test Construction, Thinking Skills, Opinions, Cognitive Tests
Climie, Emma; Henley, Laura – British Journal of Special Education, 2016
School-based practitioners are often called upon to provide assessment and recommendations for struggling students. These assessments often open doors to specialised services or interventions and provide opportunities for students to build competencies in areas of need. However, these assessments often fail to highlight the abilities of these…
Descriptors: Student Evaluation, Alternative Assessment, Relevance (Education), Models
Basha, Ertan; Kaya, Mehmet – Universal Journal of Educational Research, 2016
The purpose of this study is to examine validity and reliability of the Albanian version of the Depression, Anxiety and Stress Scale (DASS), which is developed by Lovibond and Lovibond (1995). The sample of this study is consisted of 555 subjects who were living in Kosovo. The results of confirmatory factor analysis indicated 42 items loaded on…
Descriptors: Foreign Countries, Depression (Psychology), Anxiety, Stress Variables
Jin, Ying; Eason, Hershel – Journal of Educational Issues, 2016
The effects of mean ability difference (MAD) and short tests on the performance of various DIF methods have been studied extensively in previous simulation studies. Their effects, however, have not been studied under multilevel data structure. MAD was frequently observed in large-scale cross-country comparison studies where the primary sampling…
Descriptors: Test Bias, Simulation, Hierarchical Linear Modeling, Comparative Analysis
Gillem, Angela R.; Bartoli, Eleonora; Bertsch, Kristin N.; McCarthy, Maureen A.; Constant, Kerra; Marrero-Meisky, Sheila; Robbins, Steven J.; Bellamy, Scarlett – Journal of Multicultural Counseling and Development, 2016
The Multicultural Counseling and Psychotherapy Test (MCPT), a measure of multicultural counseling competence (MCC), was validated in 2 phases. In Phase 1, the authors administered 451 test items derived from multicultural guidelines in counseling and psychology to 32 multicultural experts and 30 nonexperts. In Phase 2, the authors administered the…
Descriptors: Counseling Techniques, Cultural Relevance, Counselor Qualifications, Expertise
Alkhamra, Rana A.; Al-Jazi, Aya B. – International Journal of Language & Communication Disorders, 2016
Background: The Token Test for Children (2nd edition) (TTFC) is a measure for assessing receptive language. In this study we describe the translation process, validity and reliability of the Arabic Token Test for Children (A-TTFC). Aims: The aim of this study is to translate, validate and establish the reliability of the Arabic Token Test for…
Descriptors: Receptive Language, Tests, Children, Test Validity
Kumazawa, Takaaki; Shizuka, Tetsuhito; Mochizuki, Masamichi; Mizumoto, Atsushi – Language Testing in Asia, 2016
Placement testing is a crucial issue in Japanese universities. In the majority of language programs, classes are streamed by proficiency levels based on students' placement test score for efficient instruction because university students' proficiency levels vary greatly even in the same program. The Visualizing English Language Competency Test®…
Descriptors: Foreign Countries, Higher Education, English (Second Language), Language Proficiency
Zhang, Tan; Chen, Ang – AERA Online Paper Repository, 2016
Based on the Job Demands-Resources model, the study developed and validated an instrument that measures physical education teachers' job demands/resources perception. Expert review established content validity with the average item rating of 3.6/5.0. Construct validity and reliability were determined with a teacher sample (n=193). Exploratory…
Descriptors: Physical Education Teachers, Teaching Load, Resources, Measures (Individuals)

Peer reviewed
Direct link
