Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 11 |
Descriptor
Evaluation | 11 |
Test Use | 11 |
Testing | 7 |
Tests | 6 |
Scores | 5 |
Validity | 4 |
Foreign Countries | 3 |
Models | 3 |
Test Interpretation | 3 |
Test Validity | 3 |
Accountability | 2 |
More ▼ |
Source
Measurement:… | 3 |
ETS Research Report Series | 2 |
Language Assessment Quarterly | 2 |
Computers & Education | 1 |
Educational Measurement:… | 1 |
International Journal of… | 1 |
Measurement and Evaluation in… | 1 |
Author
Oliveri, María Elena | 2 |
Crotts, Katrina M. | 1 |
Green, Anthony | 1 |
Jordan, Sally | 1 |
Kane, Michael | 1 |
Kolen, Michael J. | 1 |
Lane, Suzanne | 1 |
Lee, Won-Chan | 1 |
McNamara, Tim | 1 |
Nastal, Jessica | 1 |
Naugle, Kim A. | 1 |
More ▼ |
Publication Type
Journal Articles | 11 |
Reports - Descriptive | 4 |
Opinion Papers | 3 |
Reports - Research | 3 |
Reports - Evaluative | 1 |
Education Level
Audience
Location
Australia | 1 |
Canada | 1 |
Netherlands | 1 |
New Zealand | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
ACTFL Oral Proficiency… | 1 |
International English… | 1 |
What Works Clearinghouse Rating
Oliveri, María Elena; Rutkowski, David; Rutkowski, Lesli – ETS Research Report Series, 2018
Fifty years after the first international large-scale assessment (ILSA), participation in these studies continues to grow, with more than 50% of the world's countries participating. Concomitant with growth in ILSAs is an expansion in the diversity of participant countries with respect to languages, cultures, and educational perspectives and goals.…
Descriptors: International Assessment, Test Validity, Test Use, Alignment (Education)
Oliveri, María Elena; Nastal, Jessica; Slomp, David – ETS Research Report Series, 2020
This report discusses frameworks and assessment development approaches to consider fairness, opportunity to learn, and consequences of test use in the design and use of assessments administered to diverse populations. Examples include the integrated design and appraisal framework and the sociocognitively based evidence-centered design approach.…
Descriptors: Culture Fair Tests, Guidelines, Test Use, Test Construction
Zand Scholten, Annemarie – Measurement: Interdisciplinary Research and Perspectives, 2012
This paper presents the author's critique to Paul E. Newton's article titled "Clarifying the consensus definition of validity." In his article, Newton not only clarifies but also redefines the consensus definition of validity. In this redefinition he omits the term "construct" and introduces the term "measurement." Both omission and introduction…
Descriptors: Validity, Definitions, Evaluation, Test Use
Kane, Michael – Measurement: Interdisciplinary Research and Perspectives, 2012
Paul E. Newton's article on the consensus definition of validity tackles a number of big issues and makes a number of strong claims. I agreed with much of what he said, and I disagreed with a number of his claims, but I found his article to be consistently interesting and thought provoking (whether I agreed or not). I will focus on three general…
Descriptors: Validity, Construct Validity, Tests, Testing
Jordan, Sally – Computers & Education, 2012
Students were observed directly, in a usability laboratory, and indirectly, by means of an extensive evaluation of responses, as they attempted interactive computer-marked assessment questions that required free-text responses of up to 20 words and as they amended their responses after receiving feedback. This provided more general insight into…
Descriptors: Learner Engagement, Feedback (Response), Evaluation, Test Interpretation
Zenisky, April L.; Crotts, Katrina M. – International Journal of Testing, 2010
The "International Journal of Testing" (IJT) is the journal of the International Test Commission. It is intended to support the dissemination of scholarly research on tests and test use worldwide. The purpose of this article is to reflect on what has been published in IJT over its nine volumes to date, with a focus on the extent to which…
Descriptors: Test Use, Testing, Evaluation, Tests
Lane, Suzanne – Measurement: Interdisciplinary Research and Perspectives, 2012
Considering consequences in the evaluation of validity is not new although it is still debated by Paul E. Newton and others. The argument-based approach to validity entails an interpretative argument that explicitly identifies the proposed interpretations and uses of test scores and a validity argument that provides a structure for evaluating the…
Descriptors: Educational Opportunities, Accountability, Validity, Inferences
Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011
This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…
Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores
Naugle, Kim A. – Measurement and Evaluation in Counseling and Development, 2009
This article discusses testing in counseling, the history of psychology's attempts to restrict access to testing, and the potential impact on the public. Counselors are encouraged to obtain appropriate training in assessment and to understand that testing is not only consistent with fair testing policies but also essential for ethical practice.…
Descriptors: State Legislation, Counseling, Testing, Evaluation
Green, Anthony – Language Assessment Quarterly, 2006
Previous studies of washback (the influence of a test on teaching and learning) have provided insights into the complexity of educational systems and test use, especially in relation to the role of the teacher, but have given insufficient attention to the relationship between observed practices and test design features. In this article a washback…
Descriptors: Test Use, Writing Tests, Testing, Language Tests
McNamara, Tim – Language Assessment Quarterly, 2006
The thought of Samuel Messick has influenced language testing in 2 main ways: in proposing a new understanding of how inferences made based on tests must be challenged, and in drawing attention to the consequences of test use. The former has had a powerful impact on language-testing research, most notably in Bachman's work on validity and the…
Descriptors: Test Use, Testing, Language Tests, Validity