Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Gauns Dessai, Kissan G.; Kamat, Venkatesh V. – International Journal of Information and Communication Technology Education, 2018
Educational institutions worldwide conduct summative examinations to evaluate academic performance of students. Such summative examinations are normally subjective in nature in higher education institutions and needs manual evaluation. However, the manual evaluation of subjective answer-scripts often suffers from evaluation anomalies and the…
Descriptors: Computer Assisted Testing, Student Evaluation, Scoring Rubrics, Error Patterns
Tasgin, Adnan; Korucuk, Murat – Journal of Curriculum and Teaching, 2018
In this research it is aimed to develop an instrument that could be used to measure university students' satisfaction with foreign language lessons in a valid and reliable manner. The research was conducted on three separate study groups consisting of 460 students in the spring semester of the 2017-2018 academic year. In the research, firstly, an…
Descriptors: Foreign Countries, Test Validity, Test Reliability, Second Language Learning
Pivovarova, Margarita; Amrein-Beardsley, Audrey – Educational Assessment, 2018
While states are no longer required to set up teacher evaluation systems based in significant part on student test scores, quite a few continue to use value-added (VAMs) or student growth percentile (SGP) models for that purpose. In this study, we analyzed three years of teacher data to illustrate the performance of teachers' median growth…
Descriptors: Growth Models, Teacher Evaluation, Value Added Models, Reliability
Adedokun, Omolola A. – Journal of Extension, 2018
This article provides an illustrative description of the pre-post difference index (PPDI), a simple, nontechnical yet robust tool for examining the instructional sensitivity of assessment items. Extension educators often design pretest-posttest instruments to assess the impact of their curricula on participants' knowledge and understanding of the…
Descriptors: Extension Education, Extension Agents, Pretests Posttests, Curriculum Evaluation
Williams-Washington, Kristin N.; Mills, Chmaika P. – Journal of Multicultural Counseling and Development, 2018
Research indicates that race-based discrimination is detrimental to the mental and physical health of African Americans. The authors sought preliminary evidence of internal consistency and factorial validity of an African American Historical Trauma questionnaire administered to 400 participants. Reliability and exploratory factor analyses resulted…
Descriptors: Racial Discrimination, Trauma, African American History, Factor Analysis
Ingram, Jenni; Sammons, Pam; Lindorff, Ariel – Education Development Trust, 2018
This review examines a range of lesson observation frameworks designed for and used in the observation of teaching in mathematics. This includes frameworks specifically designed for international comparisons of teaching practices and teacher effectiveness, as well as those used for teaching development. Five chapters focus on: (1) Introduction…
Descriptors: Mathematics Instruction, Observation, Comparative Education, Teaching Methods
Australian Government Tertiary Education Quality and Standards Agency, 2018
Admissions transparency means that prospective domestic undergraduate students can easily find good quality admissions information that allows them to compare courses and providers and make informed study choices. In October 2016 the Higher Education Standards Panel (HESP) made recommendations to achieve greater transparency in higher education…
Descriptors: College Admission, Accountability, Access to Education, Access to Information
Popham, W. James – ASCD, 2018
What is assessment literacy? It is a handful of fundamental understandings about the testing concepts and procedures that influence educational decisions. And it just might be the most cost-effective means of real school improvement. With characteristic humor and aplomb, assessment expert W. James Popham strips away the psychometrician-speak and…
Descriptors: Student Evaluation, Educational Testing, Test Validity, Test Reliability
Nadasdy, Paul; Aizawa, Kazumi; Iso, Tatsuo – Research-publishing.net, 2018
The New General Service List Test (NGSLT) (Stoeckel & Bennett, 2015) was designed as a diagnostic test to measure students' written receptive vocabulary knowledge. This test battery was developed based upon the New General Service List (NGSL) (Browne, 2013), which makes it appealing to teachers in Japan, and especially those who see vocabulary…
Descriptors: Test Reliability, Receptive Language, Vocabulary, Language Tests
Karimi, Hamid; O'Brian, Sue; Onslow, Mark; Jones, Mark – Journal of Speech, Language, and Hearing Research, 2014
Purpose: Percentage of syllables stuttered (%SS) and severity rating (SR) scales are measures in common use to quantify stuttering severity and its changes during basic and clinical research conditions. However, their reliability has not been assessed with indices measuring both relative and absolute reliability. This study was designed to provide…
Descriptors: Reliability, Syllables, Stuttering, Severity (of Disability)
Grabovsky, Irina; Wainer, Howard – Journal of Educational and Behavioral Statistics, 2017
In this article, we extend the methodology of the Cut-Score Operating Function that we introduced previously and apply it to a testing scenario with multiple independent components and different testing policies. We derive analytically the overall classification error rate for a test battery under the policy when several retakes are allowed for…
Descriptors: Cutting Scores, Weighted Scores, Classification, Testing
Algozzine, Bob; Morsbach Sweeney, Holly; Choi, Jeong Hoon; Horner, Rob; Sailor, Wayne; McCart, Amy B.; Satter, Allyson; Lane, Kathleen Lynne – Journal of Psychoeducational Assessment, 2017
U.S. public education systems are required to provide free appropriate public education to students with disabilities in least restrictive environments that are appropriate to meet their individual needs. The practice of educating students with disabilities in neighborhood schools in age-appropriate general education classrooms and other school…
Descriptors: Fidelity, Program Implementation, Disabilities, Inclusion
Arnoux-Nicolas, Caroline; Sovet, Laurent; Lhotellier, Lin; Bernaud, Jean-Luc – International Journal for Educational and Vocational Guidance, 2017
The purpose of this study was to validate a psychometric instrument among French workers for assessing the meaning of work. Following an empirical framework, a two-step procedure consisted of exploring and then validating the scale among distinctive samples. The consequent Meaning of Work Inventory is a 15-item scale based on a four-factor model,…
Descriptors: Foreign Countries, Employee Attitudes, Work Attitudes, Measures (Individuals)
Bercovitz, Katherine; Pagnini, Francesco; Phillips, Deborah; Langer, Ellen – Creativity Research Journal, 2017
Despite the growing interest in mindfulness and its demonstrated benefits, there are concerns about the reliance on subjective assessment tools. This study focused on the measurement of Langerian mindfulness, which refers to the active process of noticing new things and flexibly responding to the current context. Some of its key features overlap…
Descriptors: Metacognition, Creativity, Task Analysis, Measurement
Miles, Anna – International Journal of Language & Communication Disorders, 2017
Background: Oesophageal abnormalities are common findings in a speech-language therapy videofluoroscopy clinic. Fluoroscopic screening involving oropharynx alone fails to identify these patients. Oesophageal screening as an adjunct to videofluoroscopy is gaining popularity. Yet currently, little is known about the reliability of speech and…
Descriptors: Interrater Reliability, Speech Therapy, Allied Health Personnel, Speech Language Pathology

Peer reviewed
Direct link
