Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 2 |
| Since 2017 (last 10 years) | 2 |
| Since 2007 (last 20 years) | 8 |
Descriptor
| Testing | 12 |
| Validity | 12 |
| Scoring | 11 |
| Reliability | 7 |
| Scores | 4 |
| Psychometrics | 3 |
| Achievement Tests | 2 |
| Comparative Analysis | 2 |
| Construct Validity | 2 |
| Correlation | 2 |
| English (Second Language) | 2 |
| More ▼ | |
Source
Author
Publication Type
| Journal Articles | 7 |
| Reports - Evaluative | 3 |
| Reports - Descriptive | 2 |
| Reports - Research | 2 |
| Books | 1 |
| Dissertations/Theses -… | 1 |
| Guides - Non-Classroom | 1 |
| Opinion Papers | 1 |
Education Level
| Higher Education | 2 |
| Elementary Secondary Education | 1 |
| Preschool Education | 1 |
Audience
| Practitioners | 1 |
| Teachers | 1 |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
| Wechsler Intelligence Scale… | 1 |
What Works Clearinghouse Rating
Nelson, Nickola Wolf; Plante, Elena – Language, Speech, and Hearing Services in Schools, 2022
Purpose: This study evaluated the equivalence of the Test of Integrated Language and Literacy Skills (TILLS) when administrated via telepractice (Tele-TILLS) and face-to-face methods. Method: Participants were 51 children and adolescents in three age bands, ages 6-7 years (n = 9), 8-11 years (n = 21), and 12-18 years (n = 21). Data were gathered…
Descriptors: Telecommunications, Standardized Tests, Language Skills, Literacy
Nixi Wang – ProQuest LLC, 2022
Measurement errors attributable to cultural issues are complex and challenging for educational assessments. We need assessment tests sensitive to the cultural heterogeneity of populations, and psychometric methods appropriate to address fairness and equity concerns. Built on the research of culturally responsive assessment, this dissertation…
Descriptors: Culturally Relevant Education, Testing, Equal Education, Validity
Ferrando, Pere J. – Psicologica: International Journal of Methodology and Experimental Psychology, 2015
Test-retest studies for assessing stability and change are widely used in different domains and allow improved or additional individual estimates of interest to be obtained. However, if these estimates are to be validly interpreted the responses given at Time-2 must be free of retest effects, and the fulfilment of this assumption must be…
Descriptors: Item Response Theory, Evaluation Methods, Responses, Testing
Rogler, Dawn – English Teaching Forum, 2014
This article presents principles and practices of effective assessment, outlining seven key concepts--usefulness, reliability, validity, practicality, washback, authenticity, and transparency--and demonstrating how to apply them in creating an exam blueprint. The article also discusses the importance of providing feedback after a test has been…
Descriptors: Testing, Student Evaluation, Validity, Reliability
Berliner, David – Cambridge Journal of Education, 2011
The inevitable responses to high stakes testing, wherein students' test scores are highly consequential for teachers and administrators, include cheating, excessive test preparation, changes in test scoring and other forms of gaming to ensure that test scores appear high. Over the last decade this has been demonstrated convincingly in the USA, but…
Descriptors: Test Preparation, School Restructuring, Testing, Construct Validity
Barrueco, Sandra; Lopez, Michael; Ong, Christine; Lozano, Patricia – Brookes Publishing Company, 2012
As the population of young dual language learners continues to rise, how can early childhood professionals choose culturally and linguistically appropriate assessments for Spanish-English bilingual preschoolers? They'll get expert guidance in this one-of-a-kind resource, a comprehensive roundup and analysis of 37 developmental assessments…
Descriptors: Disabilities, Preschool Children, Psychometrics, English (Second Language)
Schmitt, Norbert; Ng, Janice Wun Ching; Garras, John – Language Testing, 2011
Although the Word Associates Format (WAF) is becoming more frequently used as a depth-of-knowledge measure, relatively little validation has been carried out on it. This report of two validation studies tackles various important WAF issues yet to be satisfactorily resolved. Study 1 conducted introspective interviews regarding students' WAF…
Descriptors: Scoring, Vocabulary Development, Associative Learning, Validity
Peer reviewedEssex, Diane L. – Journal of Medical Education, 1976
Two multiple-choice scoring schemes--a partial credit scheme and a dichotomous approach--were compared analyzing means, variances, and reliabilities on alternate measures and student reactions. Students preferred the partial-credit approach, which is recommended if rewarding for partial knowledge is an important concern. (Editor/JT)
Descriptors: Higher Education, Medical Students, Multiple Choice Tests, Reliability
Grenwelge, Cheryl H. – Journal of Psychoeducational Assessment, 2009
The Woodcock Johnson III Brief Assessment is a "maximum performance test" (Reynolds, Livingston, Willson, 2006) that is designed to assess the upper levels of knowledge and skills of the test taker using both power and speed to obtain a large amount of information in a short period of time. The Brief Assessment also provides an adequate…
Descriptors: Test Results, Knowledge Level, Testing, Performance Tests
Peer reviewedMadden, Theodore M. – Psychology in the Schools, 1974
In efforts to clarify ambiguity in the scoring directions for part of the WISC, 100 children were given the test which was scored by two sets of criteria. Four sets of data were analyzed, with significant discrepancies apparent only between Verbal-Performance correlations secured by Wechsler in his standardization of the WISC and those secured in…
Descriptors: Cognitive Measurement, Evaluation Criteria, Psychometrics, Scoring
Scholfield, Phil – 1995
This book is a guide to categorizing, measuring, testing, and assessing aspects of language, and is intended for language teachers, speech therapists and other language-related practitioners, and researchers, in conjunction with other resources on research methods and statistics. The first part is a discussion of basic terminology and the varied…
Descriptors: Data Collection, Language Proficiency, Language Skills, Language Tests
Haladyna, Thomas M. – Educational Horizons, 2006
This article argues that the validity of standardized achievement test-score interpretation and use is problematic; consequently, confidence and trust in such test scores may often be unwarranted. The problem is particularly severe in high-stakes situations. This essay provides a context for understanding standardized achievement testing, then…
Descriptors: Validity, Testing, Achievement Tests, Standardized Tests

Direct link
