NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)1
Since 2006 (last 20 years)5
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 17 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Gafni, Naomi – Assessment in Education: Principles, Policy & Practice, 2016
Naomi Gafni, director of Research and Development, National Institute for Testing and Evaluation, Jerusalem, Israel, has devoted a substantial part of her career to the development of admissions tests and other educational tests and to the investigation of their validity. As such she is keenly aware of the complexities involved in this process.…
Descriptors: Test Validity, Test Interpretation, Test Use, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010
Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010
This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…
Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010
Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…
Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics
Peer reviewed Peer reviewed
Direct linkDirect link
von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010
The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…
Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria
Peer reviewed Peer reviewed
Messick, Samuel – Educational Researcher, 1989
Presents a unified concept of test validity that integrates both the scientific and ethical considerations of test interpretation and use. Argues that the appropriateness, meaningfulness, and usefulness of score-based inferences are inseparable, and that this integration is based on construct validity. (FMW)
Descriptors: Construct Validity, Ethics, Scores, Social Influences
Kamil, Michael S.; Tierney, Robert J. – Illinois Schools Journal, 1988
In conjunction with testing mandates, some states have developed new measures intended to reflect changes in thinking about reading. Discusses, in dialogue form, whether these new measures support educational improvement or limit them. (BJV)
Descriptors: Educational Assessment, Educational Improvement, Reading Tests, Scores
Peer reviewed Peer reviewed
Mitchell, James V., Jr. – Applied Measurement in Education, 1988
Applications of Oscar K. Buros' values and convictions to current developments in measurement are considered. Biographical information and Buros' personal philosophy on applied measurement are discussed. The Buros tradition refocuses evaluators' attention on the implications of their work for the end users of measurement results--test users and…
Descriptors: Computer Assisted Testing, Educational Assessment, Educational Philosophy, Educational Researchers
Sirotnik, Kenneth A. – 1979
The thesis of this paper is that the decision to use one of three approaches to unit-of-analysis in educational research should be based on substantive considerations, not statistical factors. In addition to the commonly used "total analysis" (regression analysis across individuals), the within and between analyses are inherent in the…
Descriptors: Classroom Environment, Correlation, Educational Research, Interaction
Peer reviewed Peer reviewed
Whitehead, Bruce; Santee, Phillip – Clearing House, 1987
Discusses the use of standardized test results as a guide to developing curriculum content and uses data gathered at Hellgate Elementary School, Montana, as an example. (JC)
Descriptors: Criterion Referenced Tests, Curriculum Development, Educational Research, Elementary Education
Peer reviewed Peer reviewed
Whitely, Susan E. – Intelligence, 1980
This article examines the potential contribution of latent trait models to the study of intelligence. Nontechnical introductions to both unidimensional and multidimensional latent trait models are given. Multidimensional latent trait models can be used to test alternative multiple component theories of test item processing. (Author/CTM)
Descriptors: Ability, Aptitude Tests, Cognitive Processes, Intelligence
Peer reviewed Peer reviewed
Whitehead, Bruce; Santee, Phillip – Clearing House, 1994
Discusses the use of standardized test results as a guide to developing curriculum content. Discusses such a plan being used (and offers data gathered) at Hellgate Elementary School, Montana, as an example. (JC)
Descriptors: Criterion Referenced Tests, Curriculum Development, Educational Research, Elementary Education
Haertel, Edward H. – 1992
Classical test theory, item response theory, and generalizability theory all treat the abilities to be measured as continuous variables, and the items of a test as independent probes of underlying continua. These models are well-suited to measuring the broad, diffuse traits of traditional differential psychology, but not for measuring the outcomes…
Descriptors: Ability, Data Analysis, Error of Measurement, Generalizability Theory
Shrock, Sharon; And Others – Performance and Instruction, 1986
Presents major stages in design and development of criterion referenced tests (CRT) with emphasis on differences between CRT construction and norm-referenced test construction. Discussion covers test interpretation; test theory; preparation for test construction (hierarchical analysis, item type selection, and choosing number of items); test…
Descriptors: Adoption (Ideas), Comparative Analysis, Criterion Referenced Tests, Industrial Training
Peer reviewed Peer reviewed
Lawrence, Carol W. – Language, Speech, and Hearing Services in Schools, 1992
This article reviews the concept and derivation of age-equivalent scores as evidence for speech-language deficits. It presents arguments against the use of age-equivalent scores in case management decisions and recommends that, when used, a report include a clear explanation of their true meaning and other summaries of test performance. (Author/DB)
Descriptors: Age Differences, Clinical Diagnosis, Elementary Secondary Education, Evaluation Methods
Previous Page | Next Page ยป
Pages: 1  |  2