Descriptor
Source
Author
Camilli, Gregory | 2 |
Stedman, Lawrence C. | 2 |
Andrich, David | 1 |
Baker, Carl E. | 1 |
Baker, David P. | 1 |
Berliner, David C. | 1 |
Biddle, Bruce J. | 1 |
Borsboom, Denny | 1 |
Breland, Hunter | 1 |
Bulkley, Katrina | 1 |
Collins, Linda M. | 1 |
More ▼ |
Publication Type
Book/Product Reviews | 27 |
Journal Articles | 22 |
Reports - Evaluative | 7 |
Speeches/Meeting Papers | 3 |
Opinion Papers | 1 |
Education Level
Grade 8 | 1 |
Audience
Location
Florida | 2 |
Laws, Policies, & Programs
Assessments and Surveys
Trends in International… | 2 |
ACT Assessment | 1 |
What Works Clearinghouse Rating

Borsboom, Denny; Mellenbergh, Gideon J. – Intelligence, 2002
Makes the case that the arguments of F. Schmidt and J. Hunter in favor of the correction for attenuation in theory testing are based on mistaken assumptions. Outlines arguments against the routine use of correction for attenuation, focusing on the relationship between true scores and construct scores. (SLD)
Descriptors: Intelligence, Theories, True Scores

Sawilowsky, Shlomo S. – Educational and Psychological Measurement, 2000
B. Thompson and T. Vacha-Haase have examined the statement "the reliability of the test" with emphasis on the following three words: (1) the first "the"; (2) "test"; and (3) the second "the." This discussion focuses instead on the word "reliability." (Author)
Descriptors: Generalization, Meta Analysis, Psychometrics, Reliability

Collins, Linda M. – Applied Psychological Measurement, 1996
The clarification provided by Williams and Zimmerman on the reliability of gain scores is translated into recognizable patterns of change that tend to produce reliable or unreliable gain scores. The relevance of the traditional idea of reliability to the measurement of change is also discussed. (SLD)
Descriptors: Achievement Gains, Change, Measurement Techniques, Reliability

Camilli, Gregory – Journal of Educational Measurement, 1999
Yen and Burket suggested that shrinkage in vertical equating cannot be understood apart from multidimensionality. Reviews research on reliability, multidimensionality, and scale shrinkage, and explores issues of practical importance to educators. (SLD)
Descriptors: Equated Scores, Error of Measurement, Item Response Theory, Reliability

Kim, Seock-Ho – Applied Psychological Measurement, 1997
Reviews the most recent version of the BILOG computer program, which estimates item and trait level parameters for the one-, two-, and three-parameter logistic unidimensional Item Response Models for dichotomously scored data. Finds this version useful. (SLD)
Descriptors: Computer Software, Item Analysis, Item Response Theory, Scores

Hambleton, Ronald K. – Applied Psychological Measurement, 2000
Introduces the articles of this theme issue focusing on performance assessment methodology. Papers address: (1) merging item formats; (2) scoring models; (3) equating and linking; (4) generalizability theory; (5) standard setting methods; and (6) validity issues and methods. (SLD)
Descriptors: Equated Scores, Evaluation Methods, Generalizability Theory, Performance Based Assessment
What Works Clearinghouse, 2004
Peters (1992) reports that students in the intervention and control groups showed gains on the Orleans-Hanna test during the course of the school year (that is, from pretest to posttest). However, the test score gains of the two groups did not differ significantly. There was no evidence that the Saxon Algebra curriculum (intervention) was more or…
Descriptors: Mathematics Achievement, Intervention, Scores, Control Groups
Kieffer, Kevin M. – 1998
Factor analysis has been characterized as being at the heart of the score validation process. In virtually all applications of exploratory factor analysis, factors are rotated to better meet L. Thurstone's simple structure criteria. Two major rotation strategies are available: orthogonal and oblique. This paper reviews the numerous rotation…
Descriptors: Factor Analysis, Heuristics, Literature Reviews, Oblique Rotation

Humphreys, Lloyd G. – Applied Psychological Measurement, 1996
The reliability of a gain is determined by the reliabilities of the components, the correlation between them, and their standard deviations. Reliability is not inherently low, but the components of gains in many investigations make low reliability likely and require caution in the use of gain scores. (SLD)
Descriptors: Achievement Gains, Change, Correlation, Error of Measurement

Williams, Richard H.; Zimmerman, Donald W. – Applied Psychological Measurement, 1996
The critiques by L. Collins and L. Humphreys in this issue illustrate problems with the use of gain scores. Collins' examples show that familiar formulas for the reliability of differences do not reflect the precision of measures of change. Additional examples demonstrate flaws in the conventional approach to reliability. (SLD)
Descriptors: Achievement Gains, Change, Correlation, Error of Measurement

Kessel, Cathy; Linn, Marcia C. – Educational Measurement: Issues and Practice, 1996
Studies of student performance on entrance examinations and in high school and college courses are reviewed, and the validity of entrance examinations is studied. Scores tend to underpredict grades of females relative to those of males in mathematics courses. Possible explanations and the reform of mathematics education are discussed. (SLD)
Descriptors: College Entrance Examinations, College Students, Grades (Scholastic), Higher Education

Mehrens, William A. – Applied Measurement in Education, 1997
This commentary on articles in this special issue generally agrees with the viewpoints expressed, although it argues that in some cases the authors of these articles should have expanded on certain issues. Many comments relate to the legal defensibility of the positions taken. (SLD)
Descriptors: Certification, Decision Making, Licensing Examinations (Professions), Performance Based Assessment

Andrich, David – Psychometrika, 1995
This book discusses adapting pencil-and-paper tests to computerized testing. Mention is made of models for graded responses to items and of possibilities beyond pencil-and-paper-tests, but the book is essentially about dichotomously scored test items. Contrasts between item response theory and classical test theory are described. (SLD)
Descriptors: Adaptive Testing, Computer Assisted Testing, Item Response Theory, Scores

Luecht, Richard M. – Journal of Educational Measurement, 1998
Comments on the application of a proposed automated test assembly (ATA) to the problem of reducing potential performance differential among population subgroups and points out some pitfalls. Presents a rejoinder by M. Stocking and others. (SLD)
Descriptors: Automation, Computer Assisted Testing, Item Banks, Mathematics Tests
Hoffman, Anne – 1997
The Ability Explorer (AE) is a newly developed self-report inventory of abilities that is appropriate for group or individual administration. There are machine-scorable and hand-scorable versions of the test, and there are two levels. Level 1 is for students from junior high to high school, and Level 2 is for high school students and adults.…
Descriptors: Ability, Adolescents, Adults, Aptitude Tests
Previous Page | Next Page ยป
Pages: 1 | 2