Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 20 |
Descriptor
Scores | 33 |
Test Use | 33 |
Testing | 33 |
Test Validity | 11 |
Standardized Tests | 10 |
Test Interpretation | 8 |
Tests | 7 |
Validity | 7 |
Accountability | 6 |
Achievement Tests | 6 |
Elementary Secondary Education | 6 |
More ▼ |
Source
Author
Sireci, Stephen G. | 2 |
Amery D. Wu | 1 |
Bachman, Lyle F. | 1 |
Beckum, Leonard C. | 1 |
Camara, Wayne J. | 1 |
Dietel, Ron | 1 |
Erb, Tom | 1 |
Haertel, Edward H. | 1 |
Han, Kyung T. | 1 |
Harsch, Claudia | 1 |
Herndon, Enid B. | 1 |
More ▼ |
Publication Type
Education Level
Adult Education | 3 |
Elementary Education | 3 |
Elementary Secondary Education | 3 |
Adult Basic Education | 2 |
High School Equivalency… | 2 |
Secondary Education | 2 |
High Schools | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Teachers | 2 |
Administrators | 1 |
Community | 1 |
Parents | 1 |
Practitioners | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Every Student Succeeds Act… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Haertel, Edward H. – Educational Psychologist, 2018
In the service of educational accountability, student achievement tests are being used to measure constructs quite unlike those envisioned by test developers. Scores are compared to cut points to create classifications like "proficient"; scores are combined over time to measure growth; student scores are aggregated to measure the…
Descriptors: Achievement Tests, Scores, Test Validity, Test Interpretation
Shohamy, Elana – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2022
The paper reports on trends in language testing taking place over the years and aim at critical perspectives of testing and promoting inclusion, equity and justice. It begins with critical theories by Messick, Foucault and Bourdieu, leading to critical language testing (CLT) which focused on consequences and uses of tests. Given the power of tests…
Descriptors: Language Tests, Testing, Multilingualism, Social Justice
Florida Department of Education, 2020
This technical assistance paper provides policy and guidance to individuals with test administration responsibilities in adult education programs. The Florida assessment policies and guidelines presented in this paper are appropriate for state and federal reporting. Therefore, guidance and procedures regarding the selection and use of appropriate…
Descriptors: Technical Assistance, Adult Education, Students with Disabilities, Testing Accommodations
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS) to provide guidance to states that are interested in including New Meridian content and would like to either keep reporting scores on the New Meridian Scale or use the New Meridian performance levels; that is, the state…
Descriptors: Testing, Standards, Comparative Analysis, Test Content
Florida Department of Education, 2021
This technical assistance paper provides policy and guidance to individuals with test administration responsibilities in adult education programs. The Florida assessment policies and guidelines presented herein are appropriate for state and federal reporting. Therefore, guidance and procedures regarding the selection and use of appropriate student…
Descriptors: Adult Education, Educational Assessment, Educational Policy, Test Selection
Sireci, Stephen G. – Journal of Educational Measurement, 2013
Kane (this issue) presents a comprehensive review of validity theory and reminds us that the focus of validation is on test score interpretations and use. In reacting to his article, I support the argument-based approach to validity and all of the major points regarding validation made by Dr. Kane. In addition, I call for a simpler, three-step…
Descriptors: Validity, Theories, Test Interpretation, Test Use
Sanders, Sara – National Technical Assistance Center for the Education of Neglected or Delinquent Children and Youth (NDTAC), 2019
This guide is designed to assist States, agencies, and/or facilities who work with youth who are neglected, delinquent, or at-risk (N or D). The information in the guide will benefit those who are (a) interested in implementing pre-posttests, (b) in the process of identifying an appropriate pre-posttest, or (c) ready to evaluate current testing…
Descriptors: At Risk Students, Delinquency, Pretests Posttests, Testing
Kane, Michael – Measurement: Interdisciplinary Research and Perspectives, 2012
Paul E. Newton's article on the consensus definition of validity tackles a number of big issues and makes a number of strong claims. I agreed with much of what he said, and I disagreed with a number of his claims, but I found his article to be consistently interesting and thought provoking (whether I agreed or not). I will focus on three general…
Descriptors: Validity, Construct Validity, Tests, Testing
GED Testing Service, 2018
The manual is presented in the form of a policy grid. The grid includes a consolidated list of General Educational Development (GED) Testing Service policies regarding the GED® test and overall GED® program. The grid combines all of the policies into one unified table and supersedes any prior policy manual.
Descriptors: High School Equivalency Programs, Equivalency Tests, Testing Programs, Educational Policy
Mattern, Krista D.; Kobrin, Jennifer L.; Camara, Wayne J. – Measurement: Interdisciplinary Research and Perspectives, 2012
As researchers at a testing organization concerned with the appropriate uses and validity evidence for our assessments, we provide an applied perspective related to the issues raised in the focus article. Newton's proposal for elaborating the consensus definition of validity is offered with the intention to reduce the risks of inadequate…
Descriptors: Evidence, Validity, Tests, Testing
Harsch, Claudia – Language Assessment Quarterly, 2014
This article explores a number of key issues that emerged during the panel discussion that followed the General Language Proficiency Symposium at the Language Testing Forum (LTF) 2010, celebrating the 30th anniversary of the LTF. The key issues that emerged during the discussion should be of interest to a wider audience, as they express current…
Descriptors: Language Proficiency, Literacy, Language Tests, High Stakes Tests
Magee, Robert G.; Jones, Brett D. – Australian Journal of Educational & Developmental Psychology, 2012
This article describes the development of an instrument to assess beliefs about standardized testing in schools, a topic of much heated debate. The Beliefs About Standardized Testing scale was developed to measure the extent to which individuals support high-stakes standardized testing. The 9-item scale comprises three subscales which measure…
Descriptors: Testing, Measures (Individuals), Standardized Tests, Epistemology
Lane, Suzanne – Measurement: Interdisciplinary Research and Perspectives, 2012
Considering consequences in the evaluation of validity is not new although it is still debated by Paul E. Newton and others. The argument-based approach to validity entails an interpretative argument that explicitly identifies the proposed interpretations and uses of test scores and a validity argument that provides a structure for evaluating the…
Descriptors: Educational Opportunities, Accountability, Validity, Inferences
Mann, Wolfgang; Marshall, Chloe R. – International Journal of Bilingual Education and Bilingualism, 2010
In this article, we adapt a concept designed to structure language testing more effectively, the "Assessment Use Argument" ("AUA"), as a framework for the development and/or use of sign language assessments for deaf children who are taught in a sign bilingual education setting. By drawing on data from a recent investigation of…
Descriptors: Sign Language, Bilingual Education, Deafness, Language Tests