ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	4

Descriptor

Test Theory	15
Testing Problems	15
Test Use	9
Educational Testing	7
Psychometrics	7
Test Validity	7
Test Interpretation	6
Test Reliability	6
Comparative Analysis	5
Evaluation Methods	5
Measurement Techniques	5
Test Construction	5
Criterion Referenced Tests	4
Definitions	4
Educational Assessment	4
Equated Scores	4
Foreign Countries	4
High Stakes Tests	4
Predictive Measurement	4
Classification	3
Measurement Objectives	3
Predictive Validity	3
Standards	3
Test Items	3
Achievement Tests	2
More ▼

Source

Measurement:…	4
Educational Measurement:…	1
Executive Review	1
Journal of Educational…	1
Journal of Experimental…	1
Performance and Instruction	1

Author

Baird, Jo-Anne	1
Brittain, Clay V.	1
Brittain, Mary M.	1
Coffman, William E.	1
Cresswell, Mike	1
Fremer, John J.	1
Hunt, Earl	1
Linn, Robert L.	1
Livingston, Samuel A.	1
Myers, Charles T.	1
Newton, Paul E.	1
Secolsky, Charles	1
Shrock, Sharon	1
Wadleigh, Sandra L.	1
Williams, Richard H.	1
Zimmerman, Donald W.	1
von Davier, Alina A.	1
More ▼

Publication Type

Opinion Papers	15
Journal Articles	8
Speeches/Meeting Papers	5
Collected Works - Serials	1
Information Analyses	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Elementary Secondary Education

Audience

Researchers

Location

United Kingdom	2
United Kingdom (England)	2
United States	2
Australia	1
United Kingdom (Wales)	1

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
Nelson Denny Reading Tests	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Defending the Quality of Links between Scores from Different Tests and Exams

Peer reviewed

Direct link

Cresswell, Mike – Measurement: Interdisciplinary Research and Perspectives, 2010

Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

Conceptualizing Comparability

Peer reviewed

Direct link

Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2010

This article presents the author's rejoinder to thinking about linking from issue 8(1). Particularly within the more embracing linking frameworks, e.g., Holland & Dorans (2006) and Holland (2007), there appears to be a major disjunction between (1) classification discourse: the supposed basis for classification, that is, the underlying theory…

Descriptors: Foreign Countries, Measurement Techniques, Psychometrics, Comparative Analysis

What Constitutes Legitimate Causal Linking?

Peer reviewed

Direct link

Baird, Jo-Anne – Measurement: Interdisciplinary Research and Perspectives, 2010

Newton's article (2010) makes three main contributions to the literature. First, it is transatlantic, bringing together literatures that have been dealing with similar problems, using sometimes different methods and certainly with distinctive educational, cultural perspectives. He points out that neither of these literatures has all of the…

Descriptors: Foreign Countries, Predictive Validity, Standards, Ethics

What Dictates the Meaning of Test Linking? A Reaction to "Thinking about Linking"

Peer reviewed

Direct link

von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010

The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…

Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria

On the Direct Measurement of Face Validity: A Comment on Nevo.

Peer reviewed

Secolsky, Charles – Journal of Educational Measurement, 1987

For measuring the face validity of a test, Nevo suggested that test takers and nonprofessional users rate items on a five point scale. This article questions the ability of those raters and the credibility of the aggregated judgment as evidence of the validity of the test. (JAZ)

Descriptors: Content Validity, Measurement Techniques, Rating Scales, Test Items

Error of Measurement and Statistical Inference: Some Anomalies.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1980

It is suggested that error of measurement cannot be routinely incorporated into the "error term" in statistical tests, and that the reliability of test scores does not have the simple relationship to statistical inference that one might expect. (Author/GK)

Descriptors: Error of Measurement, Hypothesis Testing, Mathematical Formulas, Test Reliability

Appropriate Quality Assurance Roles for Professional Associations.

Download full text

Fremer, John J. – 1985

The author proposes a greater professional association role in establishing standards for quality assurance in testing. He presents his views as a test developer who dislikes the legal model for resolving professional issues. The use of publications and informational activities to make people aware of the professional standards and how they can be…

Descriptors: Professional Associations, Professional Continuing Education, Quality Control, Standards

Issues in Standard Setting: Some Comments, Some Suggestions, and Maybe Even a Few Answers.

Download full text

Livingston, Samuel A. – 1983

Discussed are nine questions regarding standard setting issues in educational testing: (1) Should normative or content-referenced standards be used? (2) Different standard setting methods yield different results. Does this finding present a problem? (3) Assess the adequacy of the grounding of various methods of standard setting in psychological…

Descriptors: Educational Testing, Evaluation, Evaluation Methods, Measurement Objectives

Domain-Referenced Testing of Reading Achievement.

Brittain, Mary M.; Brittain, Clay V. – 1981

A behavioral domain is well-defined when it is clear to both test developers and test users which categories of performance should or should not be considered for potential test items. Only those tests that are keyed to well-defined domains meet the definition of criterion-referenced tests. The greatest proliferation of criterion-referenced tests…

Descriptors: Criterion Referenced Tests, Reading Achievement, Reading Tests, Test Construction

Test Length and Validity: An Application of Test Theory to a Finite World.

Myers, Charles T. – 1978

The viewpoint is expressed that adding to test reliability by either selecting a more homogeneous set of items, restricting the range of item difficulty as closely as possible to the most efficient level, or increasing the number of items will not add to test validity and that there is considerable danger that efforts to increase reliability may…

Descriptors: Achievement Tests, Item Analysis, Multiple Choice Tests, Test Construction

Two Weak Spots in the Practice of Criterion-referenced Measurement.

Peer reviewed

Linn, Robert L. – Educational Measurement: Issues and Practice, 1982

Confusion in the terminology used in criterion-referenced measurement specifications and development and standard setting and the attendant role of cut-off scores are shown to need practical clarification through psychometric research on test applications and consequences. (CM)

Descriptors: Academic Standards, Criterion Referenced Tests, Cutting Scores, Measurement Objectives

An Overview of Criterion-Referenced Test Development.

Shrock, Sharon; And Others – Performance and Instruction, 1986

Presents major stages in design and development of criterion referenced tests (CRT) with emphasis on differences between CRT construction and norm-referenced test construction. Discussion covers test interpretation; test theory; preparation for test construction (hierarchical analysis, item type selection, and choosing number of items); test…

Descriptors: Adoption (Ideas), Comparative Analysis, Criterion Referenced Tests, Industrial Training

Testing and the Curriculum: Proceed with Caution.

Download full text

Wadleigh, Sandra L.; And Others – 1993

A study compared the performance of 44 applicants seeking admission to an alternative high school (n=19) and nursing assistant program (n=25) at a Wisconsin postsecondary institution on the Assessment of Student Skills for Entry Transfer (ASSET) test and the Nelson-Denny Reading Test. (Applicants who did not achieve a minimum score on ASSET then…

Descriptors: Allied Health Occupations Education, Educational Testing, High Schools, Nontraditional Education

Science, Technology, and Intelligence, Technical Report 9.

Download full text

Hunt, Earl – 1985

The scientific concept of intelligence has been heavily influenced by the technology of measurement. The variables which can be measured have been made the operational definition of intelligence. This approach differs from a deductive approach, in which a theory of cognition in general is used to derive the sorts of measurements that must be taken…

Descriptors: Cognitive Measurement, Cognitive Processes, Cognitive Tests, Individual Differences

Those Achievement Tests--How Useful?

Coffman, William E. – Executive Review, 1980

Standardized achievement tests are often misused as indicators of a school's quality or effectiveness relative to other schools. This is an incorrect use because it ignores variation among schools in student abilities, family support of education, student mobility, and other factors. People also misuse tests because they impute to them more…

Descriptors: Academic Ability, Achievement Tests, Criterion Referenced Tests, Educational Testing