NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)3
Since 2006 (last 20 years)11
What Works Clearinghouse Rating
Showing 1 to 15 of 54 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Lane, Suzanne – Measurement: Interdisciplinary Research and Perspectives, 2013
In Shepard's (1997) discussion on the importance of test use and consequences in a validity argument for educational assessments, she reflected on Cronbach and Meehl's (1955) perspective on the role of test developers in providing consequential evidence. In the following year, a special issue in "Educational Measurement: Issues and Practice"…
Descriptors: Educational Testing, Test Use, Test Results, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Bachman, Lyle – Measurement: Interdisciplinary Research and Perspectives, 2013
At the outset of his thoughtful and thought-provoking article, Haertel (this issue) clearly identifies the issue with which he will be dealing: The disjunct, or gap, in current approaches to evaluating the merits of a given test, between the intended uses of that test and the validity of its score-based interpretations. The author thinks that…
Descriptors: Educational Testing, Test Use, Test Validity, Test Interpretation
Peer reviewed Peer reviewed
Direct linkDirect link
Engelhard, George, Jr.; Wind, Stefanie A. – Measurement: Interdisciplinary Research and Perspectives, 2013
In 2012, Edward Haertel received the NCME Career Contributions Award. The focus article for this issue emerged from his address on the topic "How Is Testing Supposed to Improve Schooling?" His focus article provides a discussion of the relationships between testing and schooling in which he issues a call to action to the measurement community to…
Descriptors: Educational Testing, Educational Improvement, Social Action, Test Results
Peer reviewed Peer reviewed
Direct linkDirect link
Haertel, Edward – Measurement: Interdisciplinary Research and Perspectives, 2013
The author is deeply gratified by the commentators' thoughtful responses and finds almost nothing to disagree with in any of them. Each offers additional insights prompting further reflection. In drawing out just a few common themes, this brief rejoinder omits many important ideas from the individual contributions. As stated in his title, the…
Descriptors: Educational Testing, Educational Improvement, Test Interpretation, Test Use
Peer reviewed Peer reviewed
Direct linkDirect link
Zou, Shen; Xu, Qian – Language Assessment Quarterly, 2017
Washback and fairness are interrelated in validity research, and thus an investigation into washback inevitably involves fairness. This article reports Phase One of a washback study of "Test for English Majors for Grade Eight" (TEM8). Phase One was a questionnaire survey administered to university program administrators. Two research…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Test Bias
New York State Education Department, 2018
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 English Language Arts (ELA) and Mathematics 2018 Operational Tests. This report includes information about test content and test development, item (i.e., individual…
Descriptors: English, Language Arts, Language Tests, Mathematics Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Mattern, Krista D.; Kobrin, Jennifer L.; Camara, Wayne J. – Measurement: Interdisciplinary Research and Perspectives, 2012
As researchers at a testing organization concerned with the appropriate uses and validity evidence for our assessments, we provide an applied perspective related to the issues raised in the focus article. Newton's proposal for elaborating the consensus definition of validity is offered with the intention to reduce the risks of inadequate…
Descriptors: Evidence, Validity, Tests, Testing
New York State Education Department, 2017
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 English Language Arts (ELA) and Mathematics 2017 Operational Tests. This report includes information about test content and test development, item (i.e., individual…
Descriptors: English, Language Arts, Language Tests, Mathematics Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Lane, Suzanne – Measurement: Interdisciplinary Research and Perspectives, 2012
Considering consequences in the evaluation of validity is not new although it is still debated by Paul E. Newton and others. The argument-based approach to validity entails an interpretative argument that explicitly identifies the proposed interpretations and uses of test scores and a validity argument that provides a structure for evaluating the…
Descriptors: Educational Opportunities, Accountability, Validity, Inferences
Peer reviewed Peer reviewed
Direct linkDirect link
Haertel, Edward – Measurement: Interdisciplinary Research and Perspectives, 2013
Validation research for educational achievement tests is often limited to an examination of intended test score interpretations. This article calls for an expansion of validation research in three dimensions. First, validation must attend to actual test use and its consequences, not just score meaning. Second, validation must attend to unintended…
Descriptors: Educational Testing, Educational Improvement, Test Validity, Achievement Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Falk, Beverly; Ort, Suzanne Wichterle; Moirs, Katie – Educational Assessment, 2007
This article describes the findings of studies conducted on a large-scale, classroom-based performance assessment of literacy for the early grades designed to provide information that is useful for reporting, as well as teaching. Technical studies found the assessment to be a promising instrument that is reliable and valid. Follow-up studies of…
Descriptors: Program Effectiveness, Performance Based Assessment, Student Evaluation, Evaluation Research
Peer reviewed Peer reviewed
Arbisi, Paul A.; Ben-Porath, Yossef S. – Psychological Assessment, 1995
The development and initial validation of a new Minnesota Multiphasic Personality Inventory--2 (MMPI-2) scale designed to determine infrequent responding with psychopathological populations are described. Results with 1,179 subjects show that the Infrequency-Psychopathology Scale (F p ) may be useful in settings with high base rates of…
Descriptors: Patients, Psychological Patterns, Psychopathology, Responses
Peer reviewed Peer reviewed
Messick, Samuel – Educational Researcher, 1981
Argues for appraising tests for evidence of construct validity as well as evaluating the potential social consequences of test use. Asserts that construct validity provides a rational approach for predictive hypotheses and a rational basis for judgment of test relevance to the criterion domain. (Author/JCD)
Descriptors: Ethics, Evaluation Criteria, Scores, Test Construction
Fitzpatrick, Anne R. – 1981
Three kinds of classificatory decisions that might be made using criterion-referenced tests (CRTs) are described, and methods to appraise the validity of each are subsequently discussed. Decisions that entail predictive, descriptive, and evaluative classifications comprised the three kinds of decisions described. Predictive classifications entail…
Descriptors: Classification, Competence, Criterion Referenced Tests, Cutting Scores
Smith, Nancy J. – 1982
A school system's testing program can be used as a tool in curriculum development and instructional improvement if the tests match the goals and objectives of the instructional program and what is taught in the classroom. Test-taking skills should be taught so that the test will accurately reflect certain knowledge. Test results should be an…
Descriptors: Curriculum Development, Elementary Secondary Education, Instructional Design, Instructional Development
Previous Page | Next Page ยป
Pages: 1  |  2  |  3  |  4