NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20260
Since 20250
Since 2022 (last 5 years)0
Since 2017 (last 10 years)1
Since 2007 (last 20 years)12
What Works Clearinghouse Rating
Showing 46 to 60 of 93 results Save | Export
Myers, Charles T. – 1978
The viewpoint is expressed that adding to test reliability by either selecting a more homogeneous set of items, restricting the range of item difficulty as closely as possible to the most efficient level, or increasing the number of items will not add to test validity and that there is considerable danger that efforts to increase reliability may…
Descriptors: Achievement Tests, Item Analysis, Multiple Choice Tests, Test Construction
Peer reviewed Peer reviewed
Linn, Robert L. – Educational Measurement: Issues and Practice, 1982
Confusion in the terminology used in criterion-referenced measurement specifications and development and standard setting and the attendant role of cut-off scores are shown to need practical clarification through psychometric research on test applications and consequences. (CM)
Descriptors: Academic Standards, Criterion Referenced Tests, Cutting Scores, Measurement Objectives
Wilcox, Rand R. – 1981
These studies in test adequacy focus on two problems: procedures for estimating reliability, and techniques for identifying ineffective distractors. Fourteen papers are presented on recent advances in measuring achievement (a response to Molenaar); "an extension of the Dirichlet-multinomial model that allows true score and guessing to be…
Descriptors: Achievement Tests, Criterion Referenced Tests, Guessing (Tests), Mathematical Models
Powell, J. C. – 1980
Current Scoring practices for multiple-choice tests are rooted in early Associationist Theory and are based on a two-step procedure: (1) right answers counted as ones and wrong answers are zeros, and (2) number of right answers form a total-correct score. The author contends that if either step is invalid, the use of the general linear model (GLM)…
Descriptors: Elementary Secondary Education, Higher Education, Logical Thinking, Multiple Choice Tests
Peer reviewed Peer reviewed
Bhaskar, R.; Dillard, Jesse F. – Instructional Science, 1983
Description of an objective method for assigning weights to questions on examinations includes discussions of classical test theory, knowledge organization, and how task analysis can be used to identify knowledge elements required to solve specific problems, rank them, and assign objective weights to exam questions using a Pareto distribution (7…
Descriptors: Accounting, Epistemology, Evaluation Methods, Item Analysis
Peer reviewed Peer reviewed
Airaisian, Peter W. – International Journal of Educational Research, 1997
This issue presents examinations of educational testing, large-scale alternative assessment, small-scale alternative assessment, and educational measurement. These discussions go beyond technical issues to provide a conceptual perspective and a view of underlying histories, theories, applications, and the uncertainties associated with these…
Descriptors: Alternative Assessment, Educational Assessment, Educational Change, Educational Testing
Jones, Patricia B.; Sabers, Darrell L. – 1984
Several techniques have been developed for creating continuous smooth distributions of test norms. This paper describes two studies that explore the behavior of cubic splines in order to determine their appropriateness for use in test norming. The first study uses data from the Curriculum Referenced Tests of Mastery (CRTM) and employs two…
Descriptors: Equated Scores, Goodness of Fit, Measurement Techniques, Norm Referenced Tests
Peer reviewed Peer reviewed
Lohman, David F. – International Journal of Educational Research, 1997
A look at the history of intelligence testing suggests that those most closely allied with intelligence testing were often least able to see the larger issues. Input is needed from those who have examined broader currents in the history and sociology of ideas. New ideas must be cultivated to avoid redundancy in the field. (SLD)
Descriptors: Educational History, Educational Testing, Intelligence Tests, Political Influences
Peer reviewed Peer reviewed
Glaser, Robert – Educational Measurement: Issues and Practice, 1994
Some unfinished issues relating to achievement test theory that seemed implicit in the basic idea of criterion-referenced testing are reviewed, recognizing their importance in current studies of authentic assessment and performance-based tests. The future of performance-based evaluation is explored. (SLD)
Descriptors: Academic Achievement, Achievement Tests, Criterion Referenced Tests, Educational History
Livingston, Samuel A. – 1986
This paper deals with test fairness regarding a test consisting of two parts: (1) a "common" section, taken by all students; and (2) a "variable" section, in which some students may answer a different set of questions from other students. For example, a test taken by several thousand students each year contains a common multiple-choice portion and…
Descriptors: Difficulty Level, Error of Measurement, Essay Tests, Mathematical Models
Sullivan, Francis J. – 1987
To examine "bluffing"--ways in which conflicts in classrooms and evaluation procedures influence the styles of student writing and teachers' responses to different styles, a study analyzed the placement-test essays of 99 undergraduates entering Temple University (Pennsylvania) in the fall of 1982. Analysis of the texts was based on a…
Descriptors: Constructed Response, Essay Tests, Higher Education, Response Style (Tests)
Peer reviewed Peer reviewed
Altepeter, Tom – School Psychology Review, 1983
A critical review of the Expressive One-Word Picture Vocabulary Test (Gardner) is offered. The reviewer feels that the instrument cannot be recommended in its present form. Further research concerning the manual, and theoretical issues, (particularly test-retest stability) is strongly recommended. (Author/PN)
Descriptors: Error of Measurement, Intelligence Tests, Item Analysis, Pictorial Stimuli
Mayberry, Paul W. – 1984
Efforts to study the fidelity of translation of attitudinal scales into foreign languages have faltered due to the lack of powerful statistical tests to assess such transformations. This study uses a maximum likelihood factor analysis procedure to compare multivariate factor structures across subpopulations. The results showed that inconsistent…
Descriptors: Adults, Attitude Measures, Factor Analysis, Factor Structure
Peer reviewed Peer reviewed
Rennie, Leonie J.; Parker, Lesley H. – Journal of Research in Science Teaching, 1987
Examines some of the problems in interpreting data from attitude measures when the dimensionality of the instrument and sources of heterogeneity in the population are ignored. Illustrates this point by describing the development of an instrument designed to measure children's attitude toward science-related activities. (TW)
Descriptors: Attitude Measures, Attitudes, Elementary Education, Elementary School Science
Peer reviewed Peer reviewed
Direct linkDirect link
Wiliam, Dylan – Review of Research in Education, 2010
The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…
Descriptors: Educational Assessment, Validity, Inferences, Construct Validity
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7