NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 8 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip; Haberman, Shelby J. – Measurement: Interdisciplinary Research and Perspectives, 2009
In this commentary, the authors discuss some of the issues regarding the use of diagnostic classification models that practitioners should keep in mind. In the authors experience, these issues are not as well known as they should be. The authors then provide recommendations on diagnostic scoring.
Descriptors: Scoring, Reliability, Validity, Classification
Yen, Wendy M. – 1984
Two of the most popular methods for obtaining equal-interval scales for educational measurement are discussed: Thurstone's method and Item Response Theory (IRT). Between-grade growth on these scales is compared; while unstandardized differences show different trends for the two scales, standardized differences that take standard deviations into…
Descriptors: Academic Achievement, Achievement Tests, Educational Research, Latent Trait Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Luecht, Richard M. – Foreign Language Annals, 2003
This article contends that the necessary links between constructs and test scores/decisions in language assessment must be established through principled design procedures that align three models: (1) a theoretical construct model; (2) a test development model; and (3) a psychometric scoring model. The theoretical construct model articulates the…
Descriptors: Scoring, Psychometrics, Language Proficiency, Language Tests
Peer reviewed Peer reviewed
Jaeger, Richard M. – Educational Measurement: Issues and Practice, 1991
Issues concerning the selection of judges for standard setting are discussed. Determining the consistency of judges' recommendations, or their congruity with other expert recommendations, would help in selection. Enough judges must be chosen to allow estimation of recommendations by an entire population of judges. (SLD)
Descriptors: Cutting Scores, Evaluation Methods, Evaluators, Examiners
Peer reviewed Peer reviewed
Messick, Samuel – American Psychologist, 1995
Presents a comprehensive review of validity that includes an empirical evaluation of the actual and potential consequences of score interpretation and use, how those consequences come about, and what determines them. Six distinguishable aspects of construct validity are highlighted as a means of addressing central issues implicit in the notion of…
Descriptors: Concurrent Validity, Construct Validity, Content Validity, Models
Haertel, Edward H. – 1992
Classical test theory, item response theory, and generalizability theory all treat the abilities to be measured as continuous variables, and the items of a test as independent probes of underlying continua. These models are well-suited to measuring the broad, diffuse traits of traditional differential psychology, but not for measuring the outcomes…
Descriptors: Ability, Data Analysis, Error of Measurement, Generalizability Theory
Peer reviewed Peer reviewed
Wilson, John; Haugh, Brian – Language and Education, 1995
Argues that the method of "collaborative modelling" developed to teach reading skills may be utilized in generating and assessing pupil talk within the classroom. Pupil pairs were given different texts from science, English, and geography and asked to re-present them in another form. Results indicate the value of the talk emerging from…
Descriptors: Case Studies, Class Activities, Classroom Communication, Cooperation
Rock, Donald A. – 1991
Issues in the development of assessments of higher order thinking skills for college graduates are discussed in the order in which they were presented when this series of papers was commissioned. With regard to Issue 1, it is generally agreed that the development of these skills is a desirable goal, but there is little consensus on how they should…
Descriptors: Adult Literacy, Cognitive Measurement, College Graduates, Communication Skills