ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	3

Descriptor

Evaluation Methods	11
Item Analysis	11
Testing Problems	11
Measurement Techniques	4
Test Construction	4
Test Items	4
Test Validity	4
Models	3
Scoring	3
Test Bias	3
Criterion Referenced Tests	2
Equated Scores	2
Error of Measurement	2
Evaluation Problems	2
Group Testing	2
Interrater Reliability	2
Judges	2
Statistical Analysis	2
Test Reliability	2
Test Theory	2
Tests	2
Weighted Scores	2
Academic Standards	1
Accounting	1
Achievement Rating	1
More ▼

Source

Applied Measurement in…	1
Assessment in Education:…	1
Educational Research and…	1
Evaluation in Education:…	1
Instructional Science	1

Publication Type

Reports - Research	6
Journal Articles	5
Information Analyses	2
Reports - Evaluative	2
Speeches/Meeting Papers	2
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	1
Secondary Education	1

Audience

Researchers

Location

Netherlands

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Ongoing Issues in Test Fairness

Peer reviewed

Direct link

Camilli, Gregory – Educational Research and Evaluation, 2013

In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…

Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format

Educational Measurement Issues and Implications of High Stakes Decision Making in Final Examinations in Secondary Education in the Netherlands

Peer reviewed

Direct link

van Rijn, P. W.; Beguin, A. A.; Verstralen, H. H. F. M. – Assessment in Education: Principles, Policy & Practice, 2012

While measurement precision is relatively easy to establish for single tests and assessments, it is much more difficult to determine for decision making with multiple tests on different subjects. This latter is the situation in the system of final examinations for secondary education in the Netherlands and is used as an example in this paper. This…

Descriptors: Secondary Education, Tests, Foreign Countries, Decision Making

Quantitative Methods Used in the Study of Item Bias.

Hills, John R. – 1984

The literature on item bias, i.e., the question of whether some items in tests favor one cultural group over another cultural group due to irrelevant factors, is reviewed and evaluated. All known references through 1981 are described including a large number of unpublished reports. Each method is described and the criticisms that have appeared in…

Descriptors: Evaluation Methods, Item Analysis, Racial Differences, Test Bias

Evaluation Design Project: Multilevel Interpretation of Evaluation Data Study.

Download full text

Miller, M. David; Burstein, Leigh – 1981

Two studies are presented in this report. The first is titled "Empirical Studies of Multilevel Approaches to Test Development and Interpretation: Measuring Between-Group Differences in Instruction." Because of a belief that schooling does affect student achievement, researchers have questioned the empirical and measurement techniques…

Descriptors: Error Patterns, Evaluation Methods, Item Analysis, Models

Determining How Well a Test Measures Your Objectives.

Download full text

Klein, Stephen P.; Kosecoff, Jacqueline P. – 1975

A procedure for in-depth analysis of a limited number of tests being considered for selection by a school, district, project, or state personnel is described. This procedure involves listing the objectives that it would be desirable to measure determining the relative importance of each of these objectives, having "judges" match test items to…

Descriptors: Correlation, Educational Objectives, Evaluation, Evaluation Criteria

Using Cognitive Science to Assign Test Weights.

Peer reviewed

Bhaskar, R.; Dillard, Jesse F. – Instructional Science, 1983

Description of an objective method for assigning weights to questions on examinations includes discussions of classical test theory, knowledge organization, and how task analysis can be used to identify knowledge elements required to solve specific problems, rank them, and assign objective weights to exam questions using a Pareto distribution (7…

Descriptors: Accounting, Epistemology, Evaluation Methods, Item Analysis

Criterion-Referenced Measurement: Its Main Applications, Problems and Findings.

van der Linden, Wim J. – Evaluation in Education: International Progress, 1982

Instructional programs organized according to modern educational technology are discussed within the purposes of criterion-referenced measurements used. The problems of criterion-referenced measurements include scoring and score interpretation, item and test analysis, and mastery testing. An overview of solutions and approaches to the problems and…

Descriptors: Criterion Referenced Tests, Educational Testing, Evaluation Methods, Item Analysis

Inter-Judge Reliability: Is Complete Agreement among Judges the Ideal?

Constable, Elizabeth; Andrich, David – 1984

In circumstances where judges are required to make ratings of performance, it is usually required to have two or more raters who are trained to agree on independent ratings of the same performance. It is suggested that such a requirement may produce the paradox of attenuation associated with item analysis, in which too high a correlation between…

Descriptors: Elementary Secondary Education, Evaluation Methods, Interrater Reliability, Interviews

Design of an Item Bias Review Form: Issues and Questions.

Download full text

Hambleton, Ronald K.; Rogers, H. Jane – 1988

Issues in preparing a review form to detect item bias in tests are discussed and the first draft of an item bias review form is presented. While stereotyping is the consistent representation of a given group in a particular light, bias is the presence of some characteristic of an item that results in differential performance of two individuals of…

Descriptors: Content Analysis, Culture Fair Tests, Ethnic Stereotypes, Evaluation Methods

The Use and Effect of Caution Indices in Detecting Aberrant Patterns of Standard-Setting Recommendations.

Jaeger, Richard M.; Busch, John Christian – 1986

This study explores the use of the modified caution index (MCI) for identifying judges whose patterns of recommendations suggest that their judgments might be based on incomplete information, flawed reasoning, or inattention to their standard-setting tasks. It also examines the effect on test standards and passing rates when the test standards of…

Descriptors: Criterion Referenced Tests, Error of Measurement, Evaluation Methods, High Schools

Andrich, David	1
Beguin, A. A.	1
Bhaskar, R.	1
Burstein, Leigh	1
Busch, John Christian	1
Camilli, Gregory	1
Constable, Elizabeth	1
Dillard, Jesse F.	1
Hambleton, Ronald K.	1
Hills, John R.	1
Jaeger, Richard M.	1
Klein, Stephen P.	1
Kosecoff, Jacqueline P.	1
Miller, M. David	1
Phillips, Gary W.	1
Rogers, H. Jane	1
Verstralen, H. H. F. M.	1
van Rijn, P. W.	1
van der Linden, Wim J.	1
More ▼