ERIC - Search Results

Source

Applied Psychological…	5
Education Policy Analysis…	5
Journal of Educational…	3
Education Statistics Quarterly	2
Educational and Psychological…	2
Intelligence	2
Applied Measurement in…	1
Educational Measurement:…	1
Psychometrika	1
What Works Clearinghouse	1

Publication Type

Book/Product Reviews	27
Journal Articles	22
Reports - Evaluative	7
Speeches/Meeting Papers	3
Opinion Papers	1

Education Level

Grade 8

Audience

Location

Florida

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	2
ACT Assessment	1

What Works Clearinghouse Rating

Book/Product Reviews X

Showing 1 to 15 of 27 results Save | Export

True Scores, Latent Variables, and Constructs: A Comment on Schmidt and Hunter.

Peer reviewed

Borsboom, Denny; Mellenbergh, Gideon J. – Intelligence, 2002

Makes the case that the arguments of F. Schmidt and J. Hunter in favor of the correction for attenuation in theory testing are based on mistaken assumptions. Outlines arguments against the routine use of correction for attenuation, focusing on the relationship between true scores and construct scores. (SLD)

Descriptors: Intelligence, Theories, True Scores

Reliability: Rejoinder to Thompson and Vacha-Haase.

Peer reviewed

Sawilowsky, Shlomo S. – Educational and Psychological Measurement, 2000

B. Thompson and T. Vacha-Haase have examined the statement "the reliability of the test" with emphasis on the following three words: (1) the first "the"; (2) "test"; and (3) the second "the." This discussion focuses instead on the word "reliability." (Author)

Descriptors: Generalization, Meta Analysis, Psychometrics, Reliability

Is Reliability Obsolete? A Commentary on "Are Simple Gain Scores Obsolete?"

Peer reviewed

Collins, Linda M. – Applied Psychological Measurement, 1996

The clarification provided by Williams and Zimmerman on the reliability of gain scores is translated into recognizable patterns of change that tend to produce reliable or unreliable gain scores. The relevance of the traditional idea of reliability to the measurement of change is also discussed. (SLD)

Descriptors: Achievement Gains, Change, Measurement Techniques, Reliability

Measurement Error, Multidimensionality, and Scale Shrinkage: A Reply to Yen and Burket.

Peer reviewed

Camilli, Gregory – Journal of Educational Measurement, 1999

Yen and Burket suggested that shrinkage in vertical equating cannot be understood apart from multidimensionality. Reviews research on reliability, multidimensionality, and scale shrinkage, and explores issues of practical importance to educators. (SLD)

Descriptors: Equated Scores, Error of Measurement, Item Response Theory, Reliability

BILOG 3 for Windows: Item Analysis and Test Scoring with Binary Logistic Models.

Peer reviewed

Kim, Seock-Ho – Applied Psychological Measurement, 1997

Reviews the most recent version of the BILOG computer program, which estimates item and trait level parameters for the one-, two-, and three-parameter logistic unidimensional Item Response Models for dichotomously scored data. Finds this version useful. (SLD)

Descriptors: Computer Software, Item Analysis, Item Response Theory, Scores

Advances in Performance Assessment Methodology.

Peer reviewed

Hambleton, Ronald K. – Applied Psychological Measurement, 2000

Introduces the articles of this theme issue focusing on performance assessment methodology. Papers address: (1) merging item formats; (2) scoring models; (3) equating and linking; (4) generalizability theory; (5) standard setting methods; and (6) validity issues and methods. (SLD)

Descriptors: Equated Scores, Evaluation Methods, Generalizability Theory, Performance Based Assessment

Skill Performance Comparability of Two Algebra Programs on an Eighth-Grade Population. What Works Clearinghouse Detailed Study Report

Peer reviewed
PDF on ERIC

Download full text

What Works Clearinghouse, 2004

Peters (1992) reports that students in the intervention and control groups showed gains on the Orleans-Hanna test during the course of the school year (that is, from pretest to posttest). However, the test score gains of the two groups did not differ significantly. There was no evidence that the Saxon Algebra curriculum (intervention) was more or…

Descriptors: Mathematics Achievement, Intervention, Scores, Control Groups

Orthogonal versus Oblique Factor Rotation: A Review of the Literature regarding the Pros and Cons.

Download full text

Kieffer, Kevin M. – 1998

Factor analysis has been characterized as being at the heart of the score validation process. In virtually all applications of exploratory factor analysis, factors are rotated to better meet L. Thurstone's simple structure criteria. Two major rotation strategies are available: orthogonal and oblique. This paper reviews the numerous rotation…

Descriptors: Factor Analysis, Heuristics, Literature Reviews, Oblique Rotation

Linear Dependence on Gain Scores in Their Components Imposes Constraints on Their Use and Interpretation: Comment on "Are Simple Gain Scores Obsolete?"

Peer reviewed

Humphreys, Lloyd G. – Applied Psychological Measurement, 1996

The reliability of a gain is determined by the reliabilities of the components, the correlation between them, and their standard deviations. Reliability is not inherently low, but the components of gains in many investigations make low reliability likely and require caution in the use of gain scores. (SLD)

Descriptors: Achievement Gains, Change, Correlation, Error of Measurement

Commentary on the Commentaries of Collins and Humphreys.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Applied Psychological Measurement, 1996

The critiques by L. Collins and L. Humphreys in this issue illustrate problems with the use of gain scores. Collins' examples show that familiar formulas for the reliability of differences do not reflect the precision of measures of change. Additional examples demonstrate flaws in the conventional approach to reliability. (SLD)

Descriptors: Achievement Gains, Change, Correlation, Error of Measurement

Grades or Scores: Predicting Future College Mathematics Performance.

Peer reviewed

Kessel, Cathy; Linn, Marcia C. – Educational Measurement: Issues and Practice, 1996

Studies of student performance on entrance examinations and in high school and college courses are reviewed, and the validity of entrance examinations is studied. Scores tend to underpredict grades of females relative to those of males in mathematics courses. Possible explanations and the reform of mathematics education are discussed. (SLD)

Descriptors: College Entrance Examinations, College Students, Grades (Scholastic), Higher Education

Validating Licensing and Certification Test Score Interpretations and Decisions: A Response.

Peer reviewed

Mehrens, William A. – Applied Measurement in Education, 1997

This commentary on articles in this special issue generally agrees with the viewpoints expressed, although it argues that in some cases the authors of these articles should have expanded on certain issues. Many comments relate to the legal defensibility of the positions taken. (SLD)

Descriptors: Certification, Decision Making, Licensing Examinations (Professions), Performance Based Assessment

Computerized Adaptive Testing: A Primer Book Review.

Peer reviewed

Andrich, David – Psychometrika, 1995

This book discusses adapting pencil-and-paper tests to computerized testing. Mention is made of models for graded responses to items and of possibilities beyond pencil-and-paper-tests, but the book is essentially about dichotomously scored test items. Contrasts between item response theory and classical test theory are described. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Response Theory, Scores

A Reaction to "Moderating Possibly Irrelevant Multiple Mean Score Differences on a Test of Mathematical Reasoning."

Peer reviewed

Luecht, Richard M. – Journal of Educational Measurement, 1998

Comments on the application of a proposed automated test assembly (ATA) to the problem of reducing potential performance differential among population subgroups and points out some pitfalls. Presents a rejoinder by M. Stocking and others. (SLD)

Descriptors: Automation, Computer Assisted Testing, Item Banks, Mathematics Tests

Ability Explorer: A Review and Critique.

Download full text

Hoffman, Anne – 1997

The Ability Explorer (AE) is a newly developed self-report inventory of abilities that is appropriate for group or individual administration. There are machine-scorable and hand-scorable versions of the test, and there are two levels. Level 1 is for students from junior high to high school, and Level 2 is for high school students and adults.…

Descriptors: Ability, Adolescents, Adults, Aptitude Tests

Previous Page | Next Page »

Pages: 1 | 2

Scores	24
Achievement Gains	10
Test Results	8
Academic Achievement	5
International Education	5
International Studies	5
Reliability	5
Test Use	5
Achievement Tests	4
Elementary Secondary Education	4
Mathematics Achievement	4
Performance Factors	4
Scoring	4
Test Theory	4
Aptitude Tests	3
Change	3
Computer Assisted Testing	3
Conservatism	3
Educational Assessment	3
Educational History	3
Educational Trends	3
Error of Measurement	3
Item Response Theory	3
Measurement Techniques	3
Political Attitudes	3
More ▼

Camilli, Gregory	2
Stedman, Lawrence C.	2
Andrich, David	1
Baker, Carl E.	1
Baker, David P.	1
Berliner, David C.	1
Biddle, Bruce J.	1
Borsboom, Denny	1
Breland, Hunter	1
Bulkley, Katrina	1
Collins, Linda M.	1
Cozzens, Margaret B.	1
Frisbie, David A.	1
Fuhrman, Susan H.	1
Hambleton, Ronald K.	1
Hoffman, Anne	1
Humphreys, Lloyd G.	1
Jensen, Arthur R.	1
Kessel, Cathy	1
Kieffer, Kevin M.	1
Kim, Seock-Ho	1
Kupermintz, Haggai	1
Lee, Yong-Won	1
Linn, Marcia C.	1
Luecht, Richard M.	1
More ▼