NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 21 results Save | Export
Engelhard, George, Jr. – 1988
The purpose of this essay is to describe the principles of educational measurement proposed by B. Wood during the 1920s in his dissertation, written under the direction of E. L. Thorndike, and later published as "Measurement in Higher Education" (1923). These principles were selected because they illustrate one of the earliest and most complete…
Descriptors: Educational History, Educational Testing, Test Theory, Testing Problems
Helms, LuAnn Sherbeck – 1999
This paper discusses the fact that reliability is about scores and not tests and how reliability limits effect sizes. The paper also explores the classical reliability coefficients of stability, equivalence, and internal consistency. Stability is concerned with how stable test scores will be over time, while equivalence addresses the relationship…
Descriptors: Effect Size, Meta Analysis, Reliability, Scores
Andrich, David – 1984
Both the attenuation paradox of traditional test theory and the assumption of local independence in person-item response theory have caused problems in interpretation. This paper demonstrates that the two are related concepts, and, through this demonstration, both are clarified. It is demonstrated that the breakdown of local independence leads to…
Descriptors: Latent Trait Theory, Test Interpretation, Test Items, Test Reliability
Fremer, John J. – 1985
The author proposes a greater professional association role in establishing standards for quality assurance in testing. He presents his views as a test developer who dislikes the legal model for resolving professional issues. The use of publications and informational activities to make people aware of the professional standards and how they can be…
Descriptors: Professional Associations, Professional Continuing Education, Quality Control, Standards
Yen, Wendy M. – 1982
Test scores that are not perfectly reliable cannot be strictly equated unless they are strictly parallel. This fact implies that tau equivalence can be lost if an equipercentile equating is applied to observed scores that are not strictly parallel. Thirty-six simulated data sets are produced to simulate equating tests with different difficulties…
Descriptors: Difficulty Level, Equated Scores, Latent Trait Theory, Methods
Livingston, Samuel A. – 1983
Discussed are nine questions regarding standard setting issues in educational testing: (1) Should normative or content-referenced standards be used? (2) Different standard setting methods yield different results. Does this finding present a problem? (3) Assess the adequacy of the grounding of various methods of standard setting in psychological…
Descriptors: Educational Testing, Evaluation, Evaluation Methods, Measurement Objectives
DeWine, Sue; And Others – 1985
Based on a review of the literature, this paper examines criticisms leveled against the communication audit developed by the International Communication Association (ICA) and then offers a modified version of the audit designed to meet those criticisms. Following a brief introduction, the first section of the paper reviews criticisms of the audit,…
Descriptors: Communication Research, Organizational Communication, Research Methodology, Speech Communication
Brittain, Mary M.; Brittain, Clay V. – 1981
A behavioral domain is well-defined when it is clear to both test developers and test users which categories of performance should or should not be considered for potential test items. Only those tests that are keyed to well-defined domains meet the definition of criterion-referenced tests. The greatest proliferation of criterion-referenced tests…
Descriptors: Criterion Referenced Tests, Reading Achievement, Reading Tests, Test Construction
Angoff, William H.; Schrader, William B. – 1982
In a study to determine whether a shift from Formula scoring to Rights scoring can be made without causing a discontinuity in the test scale, the analysis of special administrations of the Scholastic Aptitude Test and Chemistry Achievement Test and the variable section of an operational form of the Graduate Management Admission Test (GMAT) is…
Descriptors: Comparative Analysis, Equated Scores, Guessing (Tests), Higher Education
Powell, J. C. – 1980
Current Scoring practices for multiple-choice tests are rooted in early Associationist Theory and are based on a two-step procedure: (1) right answers counted as ones and wrong answers are zeros, and (2) number of right answers form a total-correct score. The author contends that if either step is invalid, the use of the general linear model (GLM)…
Descriptors: Elementary Secondary Education, Higher Education, Logical Thinking, Multiple Choice Tests
Jones, Patricia B.; Sabers, Darrell L. – 1984
Several techniques have been developed for creating continuous smooth distributions of test norms. This paper describes two studies that explore the behavior of cubic splines in order to determine their appropriateness for use in test norming. The first study uses data from the Curriculum Referenced Tests of Mastery (CRTM) and employs two…
Descriptors: Equated Scores, Goodness of Fit, Measurement Techniques, Norm Referenced Tests
Livingston, Samuel A. – 1986
This paper deals with test fairness regarding a test consisting of two parts: (1) a "common" section, taken by all students; and (2) a "variable" section, in which some students may answer a different set of questions from other students. For example, a test taken by several thousand students each year contains a common multiple-choice portion and…
Descriptors: Difficulty Level, Error of Measurement, Essay Tests, Mathematical Models
Sullivan, Francis J. – 1987
To examine "bluffing"--ways in which conflicts in classrooms and evaluation procedures influence the styles of student writing and teachers' responses to different styles, a study analyzed the placement-test essays of 99 undergraduates entering Temple University (Pennsylvania) in the fall of 1982. Analysis of the texts was based on a…
Descriptors: Constructed Response, Essay Tests, Higher Education, Response Style (Tests)
Mayberry, Paul W. – 1984
Efforts to study the fidelity of translation of attitudinal scales into foreign languages have faltered due to the lack of powerful statistical tests to assess such transformations. This study uses a maximum likelihood factor analysis procedure to compare multivariate factor structures across subpopulations. The results showed that inconsistent…
Descriptors: Adults, Attitude Measures, Factor Analysis, Factor Structure
Crowley, Susan L.; And Others – 1993
Issues surrounding accurate assessment of depression in children have received much attention. However, the stability of scores from depression measures has generally been estimated using only classical test score theory, rather than the more powerful generalizability theory. The dependability of scores from the Children's Depression Inventory…
Descriptors: Children, Clinical Diagnosis, Depression (Psychology), Diagnostic Tests
Previous Page | Next Page ยป
Pages: 1  |  2