ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	1

Source

Educational Measurement:…	1
Educational Researcher	1
Executive Review	1
International Association for…	1

Publication Type

Speeches/Meeting Papers	21
Reports - Evaluative	9
Reports - Research	9
Opinion Papers	4
Journal Articles	2
Collected Works - Serials	1
Reports - Descriptive	1

Education Level

Audience

Researchers

Location

Canada

Laws, Policies, & Programs

Assessments and Surveys

Kaufman Assessment Battery…	1
National Teacher Examinations	1
Wechsler Intelligence Scale…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

An Alternative Approach to Test Analysis and Interpretation

Download full text

Powell, J. C. – International Association for Development of the Information Society, 2013

This reflection paper challenges current test scoring practices on the grounds that most wrong-answer selections are thoughtful not random, presenting research supporting this proposition. An alternative test scoring system is presented, described and its outcomes discussed. This new scoring system increases the number of variables considered,…

Descriptors: Test Theory, Test Interpretation, Scoring, Multiple Choice Tests

Revised Thinking about the Nature of Score Validity.

Download full text

Woolley, Kristin K. – 1996

The theory of score validity has undergone several revisions within the measurement community. The current consensus among professionals is a rejection of the trinitarian doctrine (J. P. Guion, 1980) of score validity and the recognition of a unified view that includes social consequences of test interpretation and use. While some aspects of the…

Descriptors: Models, Scores, Standards, Test Interpretation

The Attenuation Paradox of Traditional Test Theory as a Breakdown of Local Independence in Person-Item Response Theory.

Andrich, David – 1984

Both the attenuation paradox of traditional test theory and the assumption of local independence in person-item response theory have caused problems in interpretation. This paper demonstrates that the two are related concepts, and, through this demonstration, both are clarified. It is demonstrated that the breakdown of local independence leads to…

Descriptors: Latent Trait Theory, Test Interpretation, Test Items, Test Reliability

Meaning and Values in Test Validation: The Science and Ethics of Assessment.

Peer reviewed

Messick, Samuel – Educational Researcher, 1989

Presents a unified concept of test validity that integrates both the scientific and ethical considerations of test interpretation and use. Argues that the appropriateness, meaningfulness, and usefulness of score-based inferences are inseparable, and that this integration is based on construct validity. (FMW)

Descriptors: Construct Validity, Ethics, Scores, Social Influences

Influences on and Limitations of Classical Test Theory Reliability Estimates.

Download full text

Arnold, Margery E. – 1996

It is incorrect to say "the test is reliable" because reliability is a function not only of the test itself, but of many factors. The present paper explains how different factors affect classical reliability estimates such as test-retest, interrater, internal consistency, and equivalent forms coefficients. Furthermore, the limits of classical test…

Descriptors: Estimation (Mathematics), Generalizability Theory, Heuristics, Interrater Reliability

Test Theory Reconceived.

Download full text

Mislevy, Robert J. – 1995

Educational test theory consists of statistical and methodological tools to support inferences about examinees' knowledge, skills, and accomplishments. The evolution of test theory has been shaped by the nature of users' inferences which, until recently, have been framed almost exclusively in terms of trait and behavioral psychology. Progress in…

Descriptors: Cognitive Psychology, Developmental Psychology, Educational Testing, Inferences

Ordinal Measurement.

Cliff, Norman – 1984

In almost all applications of measurement there is some sort of response by a human subject. Almost always, the response scale is ordinal, but almost always it is treated as if it were an interval measure. Methods for treating data ordinally are currently being developed in three areas: ordinal analysis for questionnaire responses, ordinal…

Descriptors: Multiple Regression Analysis, Questionnaires, Research Problems, Scores

Between-Groups Experimental Design for Studies of Rights and Formula Scoring.

Angoff, William H. – 1985

This paper points out that there are certain generalizations about directions for guessing and methods of scoring that require that data be derived from random groups design. It supports the viewpoint that it is neither sufficient nor appropriate to make such generalizations on the basis of an analysis of scores obtained from the answer sheets of…

Descriptors: Correlation, Guessing (Tests), Research Design, Scoring Formulas

Evaluation of Scales for Standardized Tests.

Divgi, D. R. – 1980

A method is proposed for providing an absolute, in contrast to comparative, evaluation of how well two tests are equated by transforming their raw scores into a particular common scale. The method is direct, not requiring creation of a standard for comparison; expresses its results in scaled rather than raw scores, and allows examination of the…

Descriptors: Equated Scores, Evaluation Criteria, Item Analysis, Latent Trait Theory

Validity: Whose Construction Is It in the Teaching and Learning Context?

Peer reviewed

Tittle, Carol Kehr – Educational Measurement: Issues and Practice, 1989

An expanded framework for validating tests is needed to include the perspectives of teachers and students as well as of test makers and scientists. The development of educational assessments must take place within an understanding of how tests are used in context. (SLD)

Descriptors: Educational Assessment, Elementary Secondary Education, Evaluation Utilization, Learning Processes

Wrong Answers on Multiple-Choice Achievement Tests: Blind Guesses or Systematic Choices?.

Powell, J. C. – 1980

A multi-faceted model for the selection of answers for multiple-choice tests was developed from the findings of a series of exploratory studies. This model implies that answer selection should be curvilinear. A series of models were tested for fit using the chi square procedure. Data were collected from 359 elementary school students ages 9-12.…

Descriptors: Elementary Education, Foreign Countries, Goodness of Fit, Guessing (Tests)

The End of an ERA: REQUIEM for the GLH. RIP.

Download full text

Powell, J. C. – 1980

Current Scoring practices for multiple-choice tests are rooted in early Associationist Theory and are based on a two-step procedure: (1) right answers counted as ones and wrong answers are zeros, and (2) number of right answers form a total-correct score. The author contends that if either step is invalid, the use of the general linear model (GLM)…

Descriptors: Elementary Secondary Education, Higher Education, Logical Thinking, Multiple Choice Tests

Basic Concepts in Generalizability Theory: A More Powerful Approach to Evaluating Reliability.

Download full text

Naizer, Gilbert – 1992

A measurement approach called generalizability theory (G-theory) is an important alternative to the more familiar classical measurement theory that yields less useful coefficients such as alpha or the KR-20 coefficient. G-theory is a theory about the dependability of behavioral measurements that allows the simultaneous estimation of multiple…

Descriptors: Error of Measurement, Estimation (Mathematics), Generalizability Theory, Higher Education

Calling Writers' Bluffs: Sources of Readers' Judgements in University Placement Testing.

Download full text

Sullivan, Francis J. – 1987

To examine "bluffing"--ways in which conflicts in classrooms and evaluation procedures influence the styles of student writing and teachers' responses to different styles, a study analyzed the placement-test essays of 99 undergraduates entering Temple University (Pennsylvania) in the fall of 1982. Analysis of the texts was based on a…

Descriptors: Constructed Response, Essay Tests, Higher Education, Response Style (Tests)

Latent Traits or Latent States? The Role of Discrete Models for Ability and Performance.

Download full text

Haertel, Edward H. – 1992

Classical test theory, item response theory, and generalizability theory all treat the abilities to be measured as continuous variables, and the items of a test as independent probes of underlying continua. These models are well-suited to measuring the broad, diffuse traits of traditional differential psychology, but not for measuring the outcomes…

Descriptors: Ability, Data Analysis, Error of Measurement, Generalizability Theory

Previous Page | Next Page »

Pages: 1 | 2

Test Interpretation	21
Test Theory	21
Elementary Secondary Education	5
Scores	5
Test Reliability	5
Test Validity	5
Higher Education	4
Minimum Competency Testing	4
Test Use	4
Testing Problems	4
Criterion Referenced Tests	3
Educational Testing	3
Generalizability Theory	3
Mathematical Models	3
Multiple Choice Tests	3
Research Methodology	3
Scoring Formulas	3
Standardized Tests	3
Test Construction	3
Test Items	3
Elementary Education	2
Equated Scores	2
Error of Measurement	2
Estimation (Mathematics)	2
Guessing (Tests)	2
More ▼

Powell, J. C.	3
Andrich, David	1
Angoff, William H.	1
Arnold, Margery E.	1
Cliff, Norman	1
Coffman, William E.	1
Divgi, D. R.	1
Goodstein, H. A.	1
Haertel, Edward H.	1
Hanner, Austin	1
Keene, John M.	1
Lyon, Mark A.	1
Messick, Samuel	1
Mislevy, Robert J.	1
Naizer, Gilbert	1
Roth, Rod	1
Smith, Douglas K.	1
Sullivan, Francis J.	1
Tittle, Carol Kehr	1
Woolley, Kristin K.	1
Young, Richard F.	1
More ▼