Publication Date
In 2025 | 10 |
Since 2024 | 32 |
Since 2021 (last 5 years) | 89 |
Since 2016 (last 10 years) | 198 |
Since 2006 (last 20 years) | 402 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Practitioners | 274 |
Researchers | 122 |
Teachers | 102 |
Administrators | 63 |
Counselors | 28 |
Parents | 21 |
Policymakers | 21 |
Students | 15 |
Community | 8 |
Location
Canada | 44 |
California | 33 |
Australia | 32 |
United Kingdom | 23 |
United States | 19 |
Pennsylvania | 18 |
United Kingdom (England) | 16 |
New York | 15 |
Michigan | 14 |
Japan | 13 |
New Jersey | 12 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Kane, Michael T. – Journal of Educational Measurement, 2013
To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…
Descriptors: Test Interpretation, Validity, Scores, Test Use
Bennett, Randy Elliot – Teachers College Record, 2014
Background/Context: There is little question that education is changing, seemingly quickly and in some cases dramatically. The mechanisms through which individuals learn are shifting from paper-based ones to electronic media. Simultaneously, the nature of what individuals must learn is evolving, in good part due to an exponential accumulation of…
Descriptors: Educational Assessment, Educational Change, Evaluation, Evaluation Needs
Rix, Samantha – Journal on English Language Teaching, 2012
This paper examines the utilization of construct validity in formative assessment for classroom-based purposes. Construct validity pertains to the notion that interpretations are made by educators who analyze test scores during formative assessment. The purpose of this paper is to note the challenges that educators face when interpreting these…
Descriptors: Construct Validity, Formative Evaluation, Scores, Tests
Crawford, John R.; Garthwaite, Paul H.; Morrice, Nicola; Duff, Kevin – Psychological Assessment, 2012
Supplementary methods for the analysis of the Repeatable Battery for the Assessment of Neuropsychological Status are made available, including (a) quantifying the number of abnormally low Index scores and abnormally large differences exhibited by a case and accompanying this with estimates of the percentages of the normative population expected to…
Descriptors: Neurological Impairments, Cognitive Tests, Psychological Testing, Adults
Camara, Wayne J.; Shaw, Emily J. – Educational Measurement: Issues and Practice, 2012
The measurement community needs to better understand how to interact with the media to effectively disseminate important findings from educational testing efforts. To this end, the current paper will review media coverage of educational testing and related issues and elaborate on areas of concern and opportunities for improved communication…
Descriptors: Test Results, Educational Testing, Measurement, Information Dissemination
Clemens, Nathan H.; Davis, John L.; Simmons, Leslie E.; Oslund, Eric L.; Simmons, Deborah C. – Journal of Psychoeducational Assessment, 2015
Standardized measures are often used as an index of students' reading comprehension and scores have important implications, particularly for students who perform below expectations. This study examined secondary-level students' patterns of responding and the prevalence and impact of non-attempted items on a timed, group-administered,…
Descriptors: Secondary School Students, Performance Based Assessment, Multiple Choice Tests, Reading Comprehension
Whittaker, Tiffany A.; Williams, Natasha J.; Dodd, Barbara G. – Educational Assessment, 2011
This study assessed the interpretability of scaled scores based on either number correct (NC) scoring for a paper-and-pencil test or one of two methods of scoring computer-based tests: an item pattern (IP) scoring method and a method based on equated NC scoring. The equated NC scoring method for computer-based tests was proposed as an alternative…
Descriptors: Computer Assisted Testing, Scoring, Test Interpretation, Equated Scores
Lee, Eunjung; Lee, Won-Chan; Brennan, Robert L. – College Board, 2012
In almost all high-stakes testing programs, test equating is necessary to ensure that test scores across multiple test administrations are equivalent and can be used interchangeably. Test equating becomes even more challenging in mixed-format tests, such as Advanced Placement Program® (AP®) Exams, that contain both multiple-choice and constructed…
Descriptors: Test Construction, Test Interpretation, Test Norms, Test Reliability
Jin, Tan; Mak, Barley; Zhou, Pei – Language Testing, 2012
The fuzziness of assessing second language speaking performance raises two difficulties in scoring speaking performance: "indistinction between adjacent levels" and "overlap between scales". To address these two problems, this article proposes a new approach, "confidence scoring", to deal with such fuzziness, leading to "confidence" scores between…
Descriptors: Speech Communication, Scoring, Test Interpretation, Second Language Learning
Nolan, Meaghan M.; Beran, Tanya; Hecker, Kent G. – Statistics Education Research Journal, 2012
Students with positive attitudes toward statistics are likely to show strong academic performance in statistics courses. Multiple surveys measuring students' attitudes toward statistics exist; however, a comparison of the validity and reliability of interpretations based on their scores is needed. A systematic review of relevant electronic…
Descriptors: Student Attitudes, Statistics, Attitude Measures, Student Surveys
Gandy, Sandra E. – Reading & Writing Quarterly, 2013
With the increasing amount of testing taking place in classrooms, teachers may question how appropriate those assessments are for the growing numbers of English language learners (ELLs) in the United States. One of the assessment options for classroom teachers is the informal reading inventory (IRI), which is the most frequently used assessment…
Descriptors: Informal Reading Inventories, English Language Learners, Student Evaluation, Standardized Tests
Buschang, Rebecca E.; Chung, Gregory K. W. K.; Delacruz, Girlie C.; Baker, Eva L. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2012
The purpose of this study was to validate inferences about scores of one task designed to measure subject matter knowledge and three tasks designed to measure aspects of pedagogical content knowledge. Evidence for the validity of inferences was based on two expectations. First, if tasks were sensitive to expertise, we would find group differences.…
Descriptors: Validity, Measures (Individuals), Test Interpretation, Algebra
Pommerich, Mary – Educational Measurement: Issues and Practice, 2012
Neil Dorans has made a career of advocating for the examinee. He continues to do so in his NCME career award address, providing a thought-provoking commentary on some current trends in educational measurement that could potentially affect the integrity of test scores. Concerns expressed in the address call attention to a conundrum that faces…
Descriptors: Testing, Scores, Measurement, Test Construction
O'Reilly, Tenaha; Sabatini, John – ETS Research Report Series, 2013
This paper represents the third installment of the Reading for Understanding (RfU) assessment framework. This paper builds upon the two prior installments (Sabatini & O'Reilly, 2013; Sabatini, O'Reilly, & Deane, 2013) by discussing the role of performance moderators in the test design and how scenario-based assessment can be used as a tool…
Descriptors: Reading Comprehension, Reading Tests, Test Construction, Student Characteristics
Mori, Kazuo; Uchida, Akitoshi – Research in Education, 2012
Longitudinal change in the average Z scores for four groups of pupils sorted by quartiles was examined for its stability over three years. The data, collected from 1998 to 2009, was obtained from nine cohorts of Japanese junior high school pupils totaling 1,962 subjects. It showed illusionary declines among the mid-range pupils but improvements…
Descriptors: Foreign Countries, Junior High School Students, Cohort Analysis, Evaluation Problems