Publication Date
| In 2026 | 0 |
| Since 2025 | 17 |
| Since 2022 (last 5 years) | 74 |
| Since 2017 (last 10 years) | 189 |
| Since 2007 (last 20 years) | 384 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 274 |
| Researchers | 122 |
| Teachers | 102 |
| Administrators | 63 |
| Counselors | 28 |
| Parents | 21 |
| Policymakers | 21 |
| Students | 15 |
| Community | 8 |
Location
| Canada | 45 |
| Australia | 33 |
| California | 33 |
| United Kingdom | 23 |
| United States | 20 |
| Pennsylvania | 18 |
| United Kingdom (England) | 17 |
| New York | 15 |
| Japan | 14 |
| Michigan | 14 |
| New Jersey | 12 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Davies, Alan – Language Testing, 2012
In this article, the author begins by discussing four challenges on the concept of validity. These challenges are: (1) the appeal to logic and syllogistic reasoning; (2) the claim of reliability; (3) the local and the universal; and (4) the unitary and the divisible. In language testing validity cannot be achieved directly but only through a…
Descriptors: Language Tests, Test Validity, Test Reliability, Testing
Jordan, Sally – Computers & Education, 2012
Students were observed directly, in a usability laboratory, and indirectly, by means of an extensive evaluation of responses, as they attempted interactive computer-marked assessment questions that required free-text responses of up to 20 words and as they amended their responses after receiving feedback. This provided more general insight into…
Descriptors: Learner Engagement, Feedback (Response), Evaluation, Test Interpretation
Canivez, Gary L.; Kush, Joseph C. – Journal of Psychoeducational Assessment, 2013
Weiss, Keith, Zhu, and Chen (2013a) and Weiss, Keith, Zhu, and Chen (2013b), this issue, report examinations of the factor structure of the Wechsler Adult Intelligence Scale-Fourth Edition (WAIS-IV) and Wechsler Intelligence Scale for Children-Fourth Edition (WISC-IV), respectively; comparing Wechsler Hierarchical Model (W-HM) and…
Descriptors: Intelligence Tests, Factor Structure, Comparative Analysis, Arithmetic
Knaster, Cara A.; Micucci, Joseph A. – Assessment, 2013
Client ethnicity has been shown to affect clinicians' diagnostic impressions. However, it is not known whether interpretation of the Minnesota Multiphasic Personality Inventory-2 (MMPI-2) clinical scales is affected by ethnic bias. In this study, clinicians (82 males, 60 females) provided severity ratings for six symptoms based on three MMPI-2…
Descriptors: Personality Measures, Ethnicity, Racial Bias, Test Interpretation
January, Stacy-Ann A.; Ardoin, Scott P.; Christ, Theodore J.; Eckert, Tanya L.; White, Mary Jane – School Psychology Review, 2016
Universal screening in elementary schools often includes administering curriculum-based measurement in reading (CBM-R); but in first grade, nonsense word fluency (NWF) and, to a lesser extent, word identification fluency (WIF) are used because of concerns that CBM-R is too difficult for emerging readers. This study used Kane's argument-based…
Descriptors: Curriculum Based Assessment, Reading Tests, Test Interpretation, Test Use
Runnels, Judith – Taiwan Journal of TESOL, 2016
Since its release in 1979 the TOEIC® (Test of English for International Communication) has been consistently and widely used by educational institutions and companies of Japan despite criticisms that it provides little useable information about language ability. In order to both reduce the extreme focus on and also aid with the practical…
Descriptors: Foreign Countries, English Curriculum, Language Tests, Second Language Learning
Michaelides, Michalis P. – Assessment in Education: Principles, Policy & Practice, 2014
Student examinees are key stakeholders in large-scale, high-stakes, public examination systems. How they perceive the purpose, comprehend the technical characteristics of testing and how they interpret scores influence their response to the system demands and their preparation for the examinations; this information relates to intended and…
Descriptors: Foreign Countries, National Competency Tests, High Stakes Tests, Student Attitudes
Kane, Michael T. – Journal of Educational Measurement, 2013
To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…
Descriptors: Test Interpretation, Validity, Scores, Test Use
Bennett, Randy Elliot – Teachers College Record, 2014
Background/Context: There is little question that education is changing, seemingly quickly and in some cases dramatically. The mechanisms through which individuals learn are shifting from paper-based ones to electronic media. Simultaneously, the nature of what individuals must learn is evolving, in good part due to an exponential accumulation of…
Descriptors: Educational Assessment, Educational Change, Evaluation, Evaluation Needs
Rix, Samantha – Journal on English Language Teaching, 2012
This paper examines the utilization of construct validity in formative assessment for classroom-based purposes. Construct validity pertains to the notion that interpretations are made by educators who analyze test scores during formative assessment. The purpose of this paper is to note the challenges that educators face when interpreting these…
Descriptors: Construct Validity, Formative Evaluation, Scores, Tests
Crawford, John R.; Garthwaite, Paul H.; Morrice, Nicola; Duff, Kevin – Psychological Assessment, 2012
Supplementary methods for the analysis of the Repeatable Battery for the Assessment of Neuropsychological Status are made available, including (a) quantifying the number of abnormally low Index scores and abnormally large differences exhibited by a case and accompanying this with estimates of the percentages of the normative population expected to…
Descriptors: Neurological Impairments, Cognitive Tests, Psychological Testing, Adults
Camara, Wayne J.; Shaw, Emily J. – Educational Measurement: Issues and Practice, 2012
The measurement community needs to better understand how to interact with the media to effectively disseminate important findings from educational testing efforts. To this end, the current paper will review media coverage of educational testing and related issues and elaborate on areas of concern and opportunities for improved communication…
Descriptors: Test Results, Educational Testing, Measurement, Information Dissemination
Clemens, Nathan H.; Davis, John L.; Simmons, Leslie E.; Oslund, Eric L.; Simmons, Deborah C. – Journal of Psychoeducational Assessment, 2015
Standardized measures are often used as an index of students' reading comprehension and scores have important implications, particularly for students who perform below expectations. This study examined secondary-level students' patterns of responding and the prevalence and impact of non-attempted items on a timed, group-administered,…
Descriptors: Secondary School Students, Performance Based Assessment, Multiple Choice Tests, Reading Comprehension
Lee, Eunjung; Lee, Won-Chan; Brennan, Robert L. – College Board, 2012
In almost all high-stakes testing programs, test equating is necessary to ensure that test scores across multiple test administrations are equivalent and can be used interchangeably. Test equating becomes even more challenging in mixed-format tests, such as Advanced Placement Program® (AP®) Exams, that contain both multiple-choice and constructed…
Descriptors: Test Construction, Test Interpretation, Test Norms, Test Reliability
Jin, Tan; Mak, Barley; Zhou, Pei – Language Testing, 2012
The fuzziness of assessing second language speaking performance raises two difficulties in scoring speaking performance: "indistinction between adjacent levels" and "overlap between scales". To address these two problems, this article proposes a new approach, "confidence scoring", to deal with such fuzziness, leading to "confidence" scores between…
Descriptors: Speech Communication, Scoring, Test Interpretation, Second Language Learning

Peer reviewed
Direct link
