NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 20 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Penfield, Randall D. – Educational Researcher, 2010
A growing body of research showing that grade retention serves as an educationally low-quality placement has raised increasing concerns about whether the use of standardized tests in making decisions concerning grade retention conforms to current standards for appropriate and nondiscriminatory test use. This article examines the extent to which…
Descriptors: Test Use, Grade Repetition, Standardized Tests, Learning Readiness
Peer reviewed Peer reviewed
Direct linkDirect link
Kane, Michael T. – Educational Researcher, 2008
Lissitz and Samuelsen (2007) have proposed an operational definition of "validity" that shifts many of the questions traditionally considered under validity to a separate category associated with the utility of test use. Operational definitions support inferences about how well people perform some kind of task or how they respond to some kind of…
Descriptors: Test Use, Definitions, Validity, Classification
Peer reviewed Peer reviewed
Direct linkDirect link
Sireci, Stephen G. – Educational Researcher, 2007
Lissitz and Samuelsen (2007) propose a new framework for conceptualizing test validity that separates analysis of test properties from analysis of the construct measured. In response, the author of this article reviews fundamental characteristics of test validity, drawing largely from seminal writings as well as from the accepted standards. He…
Descriptors: Test Content, Test Validity, Guidelines, Test Items
Peer reviewed Peer reviewed
Thompson, Bruce – Educational Researcher, 1996
Reviews practices regarding tests of statistical significance and policies of the American Educational Research Association (AERA). Decades of misuse of statistical significance testing are described, and revised editorial policies to improve practice are highlighted. Correct interpretation of statistical tests, interpretation of effect sizes, and…
Descriptors: Editing, Educational Research, Effect Size, Statistical Significance
Peer reviewed Peer reviewed
Messick, Samuel – Educational Researcher, 1981
Argues for appraising tests for evidence of construct validity as well as evaluating the potential social consequences of test use. Asserts that construct validity provides a rational approach for predictive hypotheses and a rational basis for judgment of test relevance to the criterion domain. (Author/JCD)
Descriptors: Ethics, Evaluation Criteria, Scores, Test Construction
Peer reviewed Peer reviewed
Thompson, Bruce – Educational Researcher, 1997
Argues that describing results as "significant" rather than "statistically significant" is confusing to the very people most apt to misinterpret this telegraphic wording. The importance of reporting the effect size and the value of both internal and external replicability analyses are stressed. (SLD)
Descriptors: Editing, Educational Research, Effect Size, Scholarly Journals
Peer reviewed Peer reviewed
Messick, Samuel – Educational Researcher, 1989
Presents a unified concept of test validity that integrates both the scientific and ethical considerations of test interpretation and use. Argues that the appropriateness, meaningfulness, and usefulness of score-based inferences are inseparable, and that this integration is based on construct validity. (FMW)
Descriptors: Construct Validity, Ethics, Scores, Social Influences
Peer reviewed Peer reviewed
Sternberg, Robert J. – Educational Researcher, 1998
Links the literatures on human abilities and expertise, suggesting that human abilities are a form of developing expertise. Discusses the role of tests in a scheme that regards abilities as developing expertise and presents a model that implies a shift toward practice grounded in the development of knowledge-based expertise in all children.…
Descriptors: Ability, Children, Educational Assessment, Elementary Secondary Education
Peer reviewed Peer reviewed
Brookhart, Susan M. – Educational Researcher, 1999
Comments on the discussion by Ginette Delandshere and Anthony Petrosky of the numerical rating of complex performances in the context of the National Board for Professional Teaching Standards Early Adolescence/English Language Arts assessment. Conceptualizes the discussion in terms of matching information to purpose in testing. (SLD)
Descriptors: Adolescents, Elementary Secondary Education, Language Arts, Measurement Techniques
Peer reviewed Peer reviewed
Delandshere, Ginette; Petrosky, Anthony R. – Educational Researcher, 1999
Responds to Susan Brookhart's comments on the discussion of assessing complex performances by Delandshere and Petrosky. Notes that the article focused on the meaning and usefulness of scores or ratings in the context of the National Board for Professional Teaching Standards assessments. (SLD)
Descriptors: Adolescents, Elementary Secondary Education, Language Arts, Measurement Techniques
Peer reviewed Peer reviewed
Robinson, Daniel H.; Levin, Joel R. – Educational Researcher, 1997
Proposes modifications to the recent suggestions by B. Thompson (1996) for an American Educational Research Association editorial policy on statistical significance testing. Points out that, although it is useful to include effect sizes, they can be misinterpreted, and argues, as does Thompson, for greater attention to replication in educational…
Descriptors: Editing, Educational Research, Effect Size, Research Methodology
Peer reviewed Peer reviewed
Snow, Richard E. – Educational Researcher, 1989
Reviews new conceptions of cognitive and conative aptitude, learning, development, and achievement and their assessment. Argues that different purposes for educational assessment require different levels and models of assessment. Strongly suggests research on construct validity and teacher understanding and use of assessment. (FMW)
Descriptors: Academic Achievement, Academic Aptitude, Cognitive Structures, Construct Validity
Peer reviewed Peer reviewed
Delandshere, Ginette; Petrosky, Anthony R. – Educational Researcher, 1994
Discusses the role and consistency of judges' interpretations of teacher performance as part of an evaluative scheme for complex performance, with reference to the ideological framework of professional standards. The tension between assessment decisions and the recognition that assessment involves interpretation is explored. (SLD)
Descriptors: Decision Making, Educational Assessment, Epistemology, Evaluators
Peer reviewed Peer reviewed
Howe, Kenneth R. – Educational Researcher, 1994
Discusses the problem of educational testing that results in differential educational opportunities for the disadvantaged. It critically examines the proposition that there may be ways to justify the decisions made on the basis of differential test performance that are consistent with the requirements of equality. (GLR)
Descriptors: Criticism, Decision Making, Economically Disadvantaged, Educational Assessment
Peer reviewed Peer reviewed
Sternberg, Robert J. – Educational Researcher, 1996
Ten myths and countermyths about intelligence are explored. It is argued that the desire for simplicity and publicity has led psychologists and others writing about intelligence to take positions that cannot be justified by current theory or recent data. However defined, intelligence is but one aspect of being human. (SLD)
Descriptors: Biological Influences, Environmental Influences, Ethnicity, Genetics
Previous Page | Next Page ยป
Pages: 1  |  2