Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 11 |
Descriptor
Test Results | 54 |
Test Use | 54 |
Test Validity | 54 |
Test Interpretation | 18 |
Scores | 15 |
Test Reliability | 15 |
Educational Testing | 13 |
Test Construction | 13 |
Elementary Secondary Education | 12 |
Testing Programs | 12 |
Achievement Tests | 11 |
More ▼ |
Source
Author
Haertel, Edward | 2 |
Isonio, Steven | 2 |
Lane, Suzanne | 2 |
Aikenhead, Glen S. | 1 |
Appleby, Judith A. | 1 |
Arbisi, Paul A. | 1 |
Bachman, Lyle | 1 |
Baker, Eva L. | 1 |
Ben-Porath, Yossef S. | 1 |
Bridgeman, Brent | 1 |
Bullock, Cheryl Davis | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 6 |
Elementary Education | 3 |
Early Childhood Education | 2 |
Grade 3 | 2 |
Grade 4 | 2 |
Grade 5 | 2 |
Grade 6 | 2 |
Grade 7 | 2 |
Grade 8 | 2 |
Intermediate Grades | 2 |
Junior High Schools | 2 |
More ▼ |
Audience
Practitioners | 9 |
Parents | 4 |
Students | 2 |
Teachers | 2 |
Community | 1 |
Laws, Policies, & Programs
Education Consolidation… | 1 |
Elementary and Secondary… | 1 |
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Lane, Suzanne – Measurement: Interdisciplinary Research and Perspectives, 2013
In Shepard's (1997) discussion on the importance of test use and consequences in a validity argument for educational assessments, she reflected on Cronbach and Meehl's (1955) perspective on the role of test developers in providing consequential evidence. In the following year, a special issue in "Educational Measurement: Issues and Practice"…
Descriptors: Educational Testing, Test Use, Test Results, Test Validity
Bachman, Lyle – Measurement: Interdisciplinary Research and Perspectives, 2013
At the outset of his thoughtful and thought-provoking article, Haertel (this issue) clearly identifies the issue with which he will be dealing: The disjunct, or gap, in current approaches to evaluating the merits of a given test, between the intended uses of that test and the validity of its score-based interpretations. The author thinks that…
Descriptors: Educational Testing, Test Use, Test Validity, Test Interpretation
Engelhard, George, Jr.; Wind, Stefanie A. – Measurement: Interdisciplinary Research and Perspectives, 2013
In 2012, Edward Haertel received the NCME Career Contributions Award. The focus article for this issue emerged from his address on the topic "How Is Testing Supposed to Improve Schooling?" His focus article provides a discussion of the relationships between testing and schooling in which he issues a call to action to the measurement community to…
Descriptors: Educational Testing, Educational Improvement, Social Action, Test Results
Haertel, Edward – Measurement: Interdisciplinary Research and Perspectives, 2013
The author is deeply gratified by the commentators' thoughtful responses and finds almost nothing to disagree with in any of them. Each offers additional insights prompting further reflection. In drawing out just a few common themes, this brief rejoinder omits many important ideas from the individual contributions. As stated in his title, the…
Descriptors: Educational Testing, Educational Improvement, Test Interpretation, Test Use
Zou, Shen; Xu, Qian – Language Assessment Quarterly, 2017
Washback and fairness are interrelated in validity research, and thus an investigation into washback inevitably involves fairness. This article reports Phase One of a washback study of "Test for English Majors for Grade Eight" (TEM8). Phase One was a questionnaire survey administered to university program administrators. Two research…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Test Bias
New York State Education Department, 2018
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 English Language Arts (ELA) and Mathematics 2018 Operational Tests. This report includes information about test content and test development, item (i.e., individual…
Descriptors: English, Language Arts, Language Tests, Mathematics Tests
Mattern, Krista D.; Kobrin, Jennifer L.; Camara, Wayne J. – Measurement: Interdisciplinary Research and Perspectives, 2012
As researchers at a testing organization concerned with the appropriate uses and validity evidence for our assessments, we provide an applied perspective related to the issues raised in the focus article. Newton's proposal for elaborating the consensus definition of validity is offered with the intention to reduce the risks of inadequate…
Descriptors: Evidence, Validity, Tests, Testing
New York State Education Department, 2017
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 English Language Arts (ELA) and Mathematics 2017 Operational Tests. This report includes information about test content and test development, item (i.e., individual…
Descriptors: English, Language Arts, Language Tests, Mathematics Tests
Lane, Suzanne – Measurement: Interdisciplinary Research and Perspectives, 2012
Considering consequences in the evaluation of validity is not new although it is still debated by Paul E. Newton and others. The argument-based approach to validity entails an interpretative argument that explicitly identifies the proposed interpretations and uses of test scores and a validity argument that provides a structure for evaluating the…
Descriptors: Educational Opportunities, Accountability, Validity, Inferences
Haertel, Edward – Measurement: Interdisciplinary Research and Perspectives, 2013
Validation research for educational achievement tests is often limited to an examination of intended test score interpretations. This article calls for an expansion of validation research in three dimensions. First, validation must attend to actual test use and its consequences, not just score meaning. Second, validation must attend to unintended…
Descriptors: Educational Testing, Educational Improvement, Test Validity, Achievement Tests
Falk, Beverly; Ort, Suzanne Wichterle; Moirs, Katie – Educational Assessment, 2007
This article describes the findings of studies conducted on a large-scale, classroom-based performance assessment of literacy for the early grades designed to provide information that is useful for reporting, as well as teaching. Technical studies found the assessment to be a promising instrument that is reliable and valid. Follow-up studies of…
Descriptors: Program Effectiveness, Performance Based Assessment, Student Evaluation, Evaluation Research

Arbisi, Paul A.; Ben-Porath, Yossef S. – Psychological Assessment, 1995
The development and initial validation of a new Minnesota Multiphasic Personality Inventory--2 (MMPI-2) scale designed to determine infrequent responding with psychopathological populations are described. Results with 1,179 subjects show that the Infrequency-Psychopathology Scale (F p ) may be useful in settings with high base rates of…
Descriptors: Patients, Psychological Patterns, Psychopathology, Responses

Messick, Samuel – Educational Researcher, 1981
Argues for appraising tests for evidence of construct validity as well as evaluating the potential social consequences of test use. Asserts that construct validity provides a rational approach for predictive hypotheses and a rational basis for judgment of test relevance to the criterion domain. (Author/JCD)
Descriptors: Ethics, Evaluation Criteria, Scores, Test Construction
Fitzpatrick, Anne R. – 1981
Three kinds of classificatory decisions that might be made using criterion-referenced tests (CRTs) are described, and methods to appraise the validity of each are subsequently discussed. Decisions that entail predictive, descriptive, and evaluative classifications comprised the three kinds of decisions described. Predictive classifications entail…
Descriptors: Classification, Competence, Criterion Referenced Tests, Cutting Scores
Smith, Nancy J. – 1982
A school system's testing program can be used as a tool in curriculum development and instructional improvement if the tests match the goals and objectives of the instructional program and what is taught in the classroom. Test-taking skills should be taught so that the test will accurately reflect certain knowledge. Test results should be an…
Descriptors: Curriculum Development, Elementary Secondary Education, Instructional Design, Instructional Development