NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 481 to 495 of 1,161 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Zimmerman, Donald W.; Williams, Richard H.; Zumbo, Bruno D.; Ross, Donald – International Journal of Testing, 2005
This article focuses on Louis Guttman's contributions to the classical theory of educational and psychological tests, one of the lesser known of his many contributions to quantitative methods in the social sciences. Guttman's work in this field provided a rigorous mathematical basis for ideas that, for many decades after Spearman's initial work,…
Descriptors: Evaluation Methods, Test Theory, Social Sciences, Psychological Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Raju, Nambury S.; Oshima, T.C. – Educational and Psychological Measurement, 2005
Two new prophecy formulas for estimating item response theory (IRT)-based reliability of a shortened or lengthened test are proposed. Some of the relationships between the two formulas, one of which is identical to the well-known Spearman-Brown prophecy formula, are examined and illustrated. The major assumptions underlying these formulas are…
Descriptors: Item Response Theory, Test Reliability, Evaluation Methods, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Reeve, Charlie L.; Lam, Holly – Intelligence, 2005
The simple practice effects commonly observed when retaking general cognitive ability tests present a potential paradox. If observed score changes reflect real changes in g, we must revisit our understanding of its stability. Conversely, if observed score changes reflect something other than a true change in the underlying latent construct, this…
Descriptors: Psychometrics, Cognitive Ability, Cognitive Measurement, Test Theory
Liu, Kimy; Sundstrom-Hebert, Krystal; Ketterlin-Geller, Leanne R.; Tindal, Gerald – Behavioral Research and Teaching, 2008
The purpose of this study was to document the instrument development of maze measures for grades 3-8. Each maze passage contained twelve omitted words that students filled in by choosing the best-fit word from among the provided options. In this technical report, we describe the process of creating, reviewing, and pilot testing the maze measures.…
Descriptors: Test Construction, Cloze Procedure, Multiple Choice Tests, Reading Tests
Brown, James Dean; Ross, Jacqueline A. – 1993
This study investigates the Test of English as a Foreign Language (TOEFL), in particular the relative contributions to score dependability (analogous to classical theory reliability) of various numbers of items and subtests as well as the decision dependability at different cut points. Research questions that apply to the overall TOEFL battery and…
Descriptors: English (Second Language), Language Tests, Statistical Analysis, Test Reliability
Stone, Kathy Kees; And Others – 1983
Looking beyond the overall effectiveness of sensory stimulation, this study aimed to identify specific aspects of infant behavior most responsive to early stimulation. Subjects were 65 premature infants with a birth weight of less than 5 pounds, 8 ounces and a gestational age under 37 weeks. Experimental group members had completed a multimodal…
Descriptors: Comparative Analysis, Discriminant Analysis, Infant Behavior, Premature Infants
Wainer, Howard – 1982
This paper is the transcript of a talk given to those who use test information but who have little technical background in test theory. The concepts of modern test theory are compared with traditional test theory, as well as a probable future test theory. The explanations given are couched within an extended metaphor that allows a full description…
Descriptors: Difficulty Level, Latent Trait Theory, Metaphors, Test Items
Andrich, David – 1984
Both the attenuation paradox of traditional test theory and the assumption of local independence in person-item response theory have caused problems in interpretation. This paper demonstrates that the two are related concepts, and, through this demonstration, both are clarified. It is demonstrated that the breakdown of local independence leads to…
Descriptors: Latent Trait Theory, Test Interpretation, Test Items, Test Reliability
Budescu, David V. – 1979
This paper outlines a technique for differentially weighting options of a multiple choice test in a fashion that maximizes the item predictive validity. The rule can be applied with different number of categories and the "optimal" number of categories can be determined by significance tests and/or through the R2 criterion. Our theoretical analysis…
Descriptors: Multiple Choice Tests, Predictive Validity, Scoring Formulas, Test Items
Peer reviewed Peer reviewed
Albanese, Mark; Pfohl, Bruce – Evaluation and the Health Professions, 1988
A procedure derived from classical test theory analyzes course grades and report results to assess third-year clerkships at a midwestern medical school. The procedure is sensitive to a large range of characteristics of the courses and is a promising supplement to student course evaluations in studying curriculum change. (SLD)
Descriptors: Course Content, Course Evaluation, Curriculum Development, Curriculum Evaluation
Peer reviewed Peer reviewed
O'Brien, Michael L. – Studies in Educational Evaluation, 1986
A monograph issue on the development and use of a prescriptive measurement method is introduced. Given such a measurement system, it is possible to investigate both level and pattern of a student's performance, and to diagnose specific gaps in learning. (LMO)
Descriptors: Academic Achievement, Educational Diagnosis, Educational Testing, Elementary Secondary Education
Peer reviewed Peer reviewed
Jarjoura, David – Journal of Educational Statistics, 1985
Issues regarding tolerance and confidence intervals are discussed within the context of educational measurement, and conceptual distinctions are drawn between these two types of intervals. Points are raised about the advantages of tolerance intervals when the focus is on a particular observed score rather than a particular examinee. (Author/BW)
Descriptors: Comparative Analysis, Error of Measurement, Mathematical Models, Test Interpretation
Peer reviewed Peer reviewed
Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1984
This paper provides a list of 10 salient features of the standard error of measurement, contrasting it to the reliability coefficient. It is concluded that the standard error of measurement should be regarded as a primary characteristic of a mental test. (Author/DWH)
Descriptors: Educational Testing, Error of Measurement, Evaluation Methods, Psychological Testing
Peer reviewed Peer reviewed
Rost, Jurgen – Psychometrika, 1985
A latent class model for rating data is presented which provides an alternative to the latent trait approach of analyzing test data. It is the analog of Andrich's binomial Rasch model for Lazarsfeld's latent class analysis (LCA). Response probabilities for rating categories follow a binomial distribution and depend on class-specific item…
Descriptors: Item Analysis, Latent Trait Theory, Mathematical Models, Rating Scales
Peer reviewed Peer reviewed
Spencer, Bruce D. – Journal of Educational Measurement, 1983
Because test scores are ordinal not cordinal attributes, the average test score often is a misleading way to summarize the scores of a group of individuals. Similarly, correlation coefficients may be misleading summary measures of association between test scores. Proper, readily interpretable, summary statistics are developed from a theory of…
Descriptors: Correlation, Measurement Techniques, Scores, Statistical Analysis
Pages: 1  |  ...  |  29  |  30  |  31  |  32  |  33  |  34  |  35  |  36  |  37  |  ...  |  78