ERIC - Search Results

Descriptor

Test Theory	21
Testing Problems	21
Test Items	6
Statistical Analysis	5
Test Construction	5
Test Reliability	5
Test Use	5
Test Validity	5
Scoring Formulas	4
Test Interpretation	4
Criterion Referenced Tests	3
Difficulty Level	3
Educational Testing	3
Equated Scores	3
Higher Education	3
Latent Trait Theory	3
Measurement Techniques	3
Reading Tests	3
Scores	3
Cognitive Processes	2
Cognitive Tests	2
Comparative Analysis	2
Diagnostic Tests	2
Elementary Secondary Education	2
Essay Tests	2
More ▼

Source

Executive Review

Publication Type

Speeches/Meeting Papers	21
Reports - Research	12
Opinion Papers	5
Information Analyses	2
Reports - Evaluative	2
Collected Works - Serials	1
Reports - Descriptive	1

Education Level

Audience

Researchers	7
Counselors	1
Practitioners	1

Location

Texas

Laws, Policies, & Programs

Assessments and Surveys

Childrens Depression Inventory	1
Graduate Management Admission…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Thorndike's and Wood's Principles of Educational Measurement: A View from the 1980's.

Download full text

Engelhard, George, Jr. – 1988

The purpose of this essay is to describe the principles of educational measurement proposed by B. Wood during the 1920s in his dissertation, written under the direction of E. L. Thorndike, and later published as "Measurement in Higher Education" (1923). These principles were selected because they illustrate one of the earliest and most complete…

Descriptors: Educational History, Educational Testing, Test Theory, Testing Problems

Basic Concepts in Classical Test Theory: Tests Aren't Reliable, the Nature of Alpha, and Reliability Generalization as a Meta-analytic Method.

Download full text

Helms, LuAnn Sherbeck – 1999

This paper discusses the fact that reliability is about scores and not tests and how reliability limits effect sizes. The paper also explores the classical reliability coefficients of stability, equivalence, and internal consistency. Stability is concerned with how stable test scores will be over time, while equivalence addresses the relationship…

Descriptors: Effect Size, Meta Analysis, Reliability, Scores

The Attenuation Paradox of Traditional Test Theory as a Breakdown of Local Independence in Person-Item Response Theory.

Andrich, David – 1984

Both the attenuation paradox of traditional test theory and the assumption of local independence in person-item response theory have caused problems in interpretation. This paper demonstrates that the two are related concepts, and, through this demonstration, both are clarified. It is demonstrated that the breakdown of local independence leads to…

Descriptors: Latent Trait Theory, Test Interpretation, Test Items, Test Reliability

Appropriate Quality Assurance Roles for Professional Associations.

Download full text

Fremer, John J. – 1985

The author proposes a greater professional association role in establishing standards for quality assurance in testing. He presents his views as a test developer who dislikes the legal model for resolving professional issues. The use of publications and informational activities to make people aware of the professional standards and how they can be…

Descriptors: Professional Associations, Professional Continuing Education, Quality Control, Standards

Obtaining Some Degree of Correspondence Between Unequatable Scores: A Comparison of Item Response Theory and Equipercentile Equating Methods.

Yen, Wendy M. – 1982

Test scores that are not perfectly reliable cannot be strictly equated unless they are strictly parallel. This fact implies that tau equivalence can be lost if an equipercentile equating is applied to observed scores that are not strictly parallel. Thirty-six simulated data sets are produced to simulate equating tests with different difficulties…

Descriptors: Difficulty Level, Equated Scores, Latent Trait Theory, Methods

Issues in Standard Setting: Some Comments, Some Suggestions, and Maybe Even a Few Answers.

Download full text

Livingston, Samuel A. – 1983

Discussed are nine questions regarding standard setting issues in educational testing: (1) Should normative or content-referenced standards be used? (2) Different standard setting methods yield different results. Does this finding present a problem? (3) Assess the adequacy of the grounding of various methods of standard setting in psychological…

Descriptors: Educational Testing, Evaluation, Evaluation Methods, Measurement Objectives

Validation of Organizational Communication Audit Instruments.

DeWine, Sue; And Others – 1985

Based on a review of the literature, this paper examines criticisms leveled against the communication audit developed by the International Communication Association (ICA) and then offers a modified version of the audit designed to meet those criticisms. Following a brief introduction, the first section of the paper reviews criticisms of the audit,…

Descriptors: Communication Research, Organizational Communication, Research Methodology, Speech Communication

Domain-Referenced Testing of Reading Achievement.

Brittain, Mary M.; Brittain, Clay V. – 1981

A behavioral domain is well-defined when it is clear to both test developers and test users which categories of performance should or should not be considered for potential test items. Only those tests that are keyed to well-defined domains meet the definition of criterion-referenced tests. The greatest proliferation of criterion-referenced tests…

Descriptors: Criterion Referenced Tests, Reading Achievement, Reading Tests, Test Construction

A Study of Hypotheses Basic to the Use of Rights and Formula Scores. Phase I--Based on Experimental Administration of College Board Tests [and] Phase II--Based on Operational Administration of the GMAT.

Angoff, William H.; Schrader, William B. – 1982

In a study to determine whether a shift from Formula scoring to Rights scoring can be made without causing a discontinuity in the test scale, the analysis of special administrations of the Scholastic Aptitude Test and Chemistry Achievement Test and the variable section of an operational form of the Graduate Management Admission Test (GMAT) is…

Descriptors: Comparative Analysis, Equated Scores, Guessing (Tests), Higher Education

The End of an ERA: REQUIEM for the GLH. RIP.

Download full text

Powell, J. C. – 1980

Current Scoring practices for multiple-choice tests are rooted in early Associationist Theory and are based on a two-step procedure: (1) right answers counted as ones and wrong answers are zeros, and (2) number of right answers form a total-correct score. The author contends that if either step is invalid, the use of the general linear model (GLM)…

Descriptors: Elementary Secondary Education, Higher Education, Logical Thinking, Multiple Choice Tests

An Investigation of Two Procedures for Smoothing Test Norms.

Download full text

Jones, Patricia B.; Sabers, Darrell L. – 1984

Several techniques have been developed for creating continuous smooth distributions of test norms. This paper describes two studies that explore the behavior of cubic splines in order to determine their appropriateness for use in test norming. The first study uses data from the Curriculum Referenced Tests of Mastery (CRTM) and employs two…

Descriptors: Equated Scores, Goodness of Fit, Measurement Techniques, Norm Referenced Tests

Adjusting Scores on Examinations Offering a Choice of Questions.

Download full text

Livingston, Samuel A. – 1986

This paper deals with test fairness regarding a test consisting of two parts: (1) a "common" section, taken by all students; and (2) a "variable" section, in which some students may answer a different set of questions from other students. For example, a test taken by several thousand students each year contains a common multiple-choice portion and…

Descriptors: Difficulty Level, Error of Measurement, Essay Tests, Mathematical Models

Calling Writers' Bluffs: Sources of Readers' Judgements in University Placement Testing.

Download full text

Sullivan, Francis J. – 1987

To examine "bluffing"--ways in which conflicts in classrooms and evaluation procedures influence the styles of student writing and teachers' responses to different styles, a study analyzed the placement-test essays of 99 undergraduates entering Temple University (Pennsylvania) in the fall of 1982. Analysis of the texts was based on a…

Descriptors: Constructed Response, Essay Tests, Higher Education, Response Style (Tests)

Analysis of Cross-Cultural Attitudinal Scale Translation Using Maximum Likelihood Factor Analysis.

Mayberry, Paul W. – 1984

Efforts to study the fidelity of translation of attitudinal scales into foreign languages have faltered due to the lack of powerful statistical tests to assess such transformations. This study uses a maximum likelihood factor analysis procedure to compare multivariate factor structures across subpopulations. The results showed that inconsistent…

Descriptors: Adults, Attitude Measures, Factor Analysis, Factor Structure

Depression in Children: The Children's Depression Inventory.

Download full text

Crowley, Susan L.; And Others – 1993

Issues surrounding accurate assessment of depression in children have received much attention. However, the stability of scores from depression measures has generally been estimated using only classical test score theory, rather than the more powerful generalizability theory. The dependability of scores from the Children's Depression Inventory…

Descriptors: Children, Clinical Diagnosis, Depression (Psychology), Diagnostic Tests

Previous Page | Next Page »

Pages: 1 | 2

Livingston, Samuel A.	2
Andrich, David	1
Angoff, William H.	1
Armour-Thomas, Eleanor	1
Brittain, Clay V.	1
Brittain, Mary M.	1
Broussard, Rolland L.	1
Coffman, William E.	1
Crowley, Susan L.	1
DeWine, Sue	1
Engelhard, George, Jr.	1
Fremer, John J.	1
Helms, LuAnn Sherbeck	1
Hunt, Earl	1
Jones, Patricia B.	1
Mayberry, Paul W.	1
Powell, J. C.	1
Sabers, Darrell L.	1
Sarvela, Paul D.	1
Schrader, William B.	1
Sullivan, Francis J.	1
Theunissen, Phiel J. J. M.	1
Yen, Wendy M.	1
More ▼