NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)1
Since 2006 (last 20 years)6
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 17 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Gafni, Naomi – Assessment in Education: Principles, Policy & Practice, 2016
Naomi Gafni, director of Research and Development, National Institute for Testing and Evaluation, Jerusalem, Israel, has devoted a substantial part of her career to the development of admissions tests and other educational tests and to the investigation of their validity. As such she is keenly aware of the complexities involved in this process.…
Descriptors: Test Validity, Test Interpretation, Test Use, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Hubley, Anita M.; Zumbo, Bruno D. – Social Indicators Research, 2011
The vast majority of measures have, at their core, a purpose of personal and social change. If test developers and users want measures to have personal and social consequences and impact, then it is critical to consider the consequences and side effects of measurement in the validation process itself. The consequential basis of test interpretation…
Descriptors: Construct Validity, Social Change, Measurement, Test Interpretation
Peer reviewed Peer reviewed
Direct linkDirect link
Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010
Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010
This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010
Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…
Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics
Peer reviewed Peer reviewed
Direct linkDirect link
von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010
The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…
Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria
Peer reviewed Peer reviewed
Messick, Samuel – Educational Researcher, 1989
Presents a unified concept of test validity that integrates both the scientific and ethical considerations of test interpretation and use. Argues that the appropriateness, meaningfulness, and usefulness of score-based inferences are inseparable, and that this integration is based on construct validity. (FMW)
Descriptors: Construct Validity, Ethics, Scores, Social Influences
Peer reviewed Peer reviewed
Lord, Frederic M. – Journal of Educational Measurement, 1984
Four methods are outlined for estimating or approximating from a single test administration the standard error of measurement of number-right test score at specified ability levels or cutting scores. The methods are illustrated and compared on one set of real test data. (Author)
Descriptors: Academic Ability, Cutting Scores, Error of Measurement, Scoring Formulas
Peer reviewed Peer reviewed
Tittle, Carol Kehr – Educational Measurement: Issues and Practice, 1989
An expanded framework for validating tests is needed to include the perspectives of teachers and students as well as of test makers and scientists. The development of educational assessments must take place within an understanding of how tests are used in context. (SLD)
Descriptors: Educational Assessment, Elementary Secondary Education, Evaluation Utilization, Learning Processes
Ryan, Joseph M. – 1988
This monograph is designed for educators who must make decisions about curriculum, instruction, and the training of teachers. Many important educational decisions are made based primarily, if not exclusively, on test scores. This reliance on test scores has evolved for a variety of reasons. This monograph starts with test scores as a given in…
Descriptors: Criterion Referenced Tests, Elementary Education, Elementary School Mathematics, Evaluation Utilization
Peer reviewed Peer reviewed
Lohman, David F. – International Journal of Educational Research, 1997
A look at the history of intelligence testing suggests that those most closely allied with intelligence testing were often least able to see the larger issues. Input is needed from those who have examined broader currents in the history and sociology of ideas. New ideas must be cultivated to avoid redundancy in the field. (SLD)
Descriptors: Educational History, Educational Testing, Intelligence Tests, Political Influences
Haertel, Edward H. – 1992
Classical test theory, item response theory, and generalizability theory all treat the abilities to be measured as continuous variables, and the items of a test as independent probes of underlying continua. These models are well-suited to measuring the broad, diffuse traits of traditional differential psychology, but not for measuring the outcomes…
Descriptors: Ability, Data Analysis, Error of Measurement, Generalizability Theory
Peer reviewed Peer reviewed
Traub, Ross E. – Alberta Journal of Educational Research, 1990
Describes five propositions concerning classroom assessment. Uses propositions to review seven conference papers. Propositions refer to the following: nature of achievement; student information necessary to interpret assessments; tension between need to describe and need to praise; distinction between formal and informal assessment; and need for…
Descriptors: Achievement, Educational Research, Evaluation Methods, Measures (Individuals)
Coffman, William E. – Executive Review, 1980
Standardized achievement tests are often misused as indicators of a school's quality or effectiveness relative to other schools. This is an incorrect use because it ignores variation among schools in student abilities, family support of education, student mobility, and other factors. People also misuse tests because they impute to them more…
Descriptors: Academic Ability, Achievement Tests, Criterion Referenced Tests, Educational Testing
Koretz, Daniel – American Educator: The Professional Journal of the American Federation of Teachers, 1988
Student test scores are increasingly used to judge the competence of the educational enterprise. Exaggeration of scores is the result of directing attention away from the individual student achievement to the average scores of schools, districts, and states. Implications and recommendations are discussed. (BJV)
Descriptors: Academic Achievement, Achievement Tests, Boards of Education, Educational Testing
Previous Page | Next Page ยป
Pages: 1  |  2