ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	3

Descriptor

Test Format	11
Test Theory	11
Test Construction	6
Test Items	4
Equated Scores	3
Evaluation Methods	3
Foreign Countries	3
Psychometrics	3
Statistical Analysis	3
Adaptive Testing	2
Computer Assisted Testing	2
Error of Measurement	2
Factor Analysis	2
Multiple Choice Tests	2
Norm Referenced Tests	2
Scores	2
Student Evaluation	2
Test Bias	2
Test Interpretation	2
Test Validity	2
Testing	2
Academic Achievement	1
Access to Education	1
Accountability	1
Achievement Tests	1
More ▼

Source

Applied Measurement in…	1
Applied Psychological…	1
Educational and Psychological…	1
Journal of Educational and…	1
Psychological Assessment	1
Review of Research in…	1
Spectrum	1

Author

Berger, Martijn P. F.	1
Brunner, Martin	1
Bruno, James E.	1
Dirkzwager, A.	1
Downing, Steven M.	1
Haladyna, Thomas M.	1
Houssemand, Claude	1
Jolly, S. Jean	1
Little, Roderick J. A.	1
Loarer, Even	1
Murray, Joel R.	1
Rubin, Donald B.	1
Steinmetz, Jean-Paul	1
Stewart, E. Elizabeth	1
Veerkamp, Wim J. J.	1
White, Karl	1
Wiliam, Dylan	1
van der Linden, Wim J.	1
More ▼

Publication Type

Reports - Evaluative	11
Journal Articles	7
Information Analyses	1
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education

Audience

Location

Luxembourg	1
United Kingdom	1
United States	1
Utah	1

Laws, Policies, & Programs

Elementary and Secondary…	1
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

Armed Services Vocational…	1
SAT (College Admission Test)	1
Stanford Achievement Tests	1
Wisconsin Card Sorting Test	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Incomplete Psychometric Equivalence of Scores Obtained on the Manual and the Computer Version of the Wisconsin Card Sorting Test?

Peer reviewed

Direct link

Steinmetz, Jean-Paul; Brunner, Martin; Loarer, Even; Houssemand, Claude – Psychological Assessment, 2010

The Wisconsin Card Sorting Test (WCST) assesses executive and frontal lobe function and can be administered manually or by computer. Despite the widespread application of the 2 versions, the psychometric equivalence of their scores has rarely been evaluated and only a limited set of criteria has been considered. The present experimental study (N =…

Descriptors: Computer Assisted Testing, Psychometrics, Test Theory, Scores

A Review of Selection Methods for Optimal Test Design. Research Report 94-4.

Download full text

Berger, Martijn P. F.; Veerkamp, Wim J. J. – 1994

The designing of tests has been a source of concern for test developers over the past decade. Various kinds of test forms have been applied. Among these are the fixed-form test, the adaptive test, and the testlet. Each of these forms has its own design. In this paper, the construction of test forms is placed within the general framework of optimal…

Descriptors: Adaptive Testing, Foreign Countries, Research Design, Selection

Test Equating from Biased Samples, with Application to the Armed Services Vocational Aptitude Battery.

Peer reviewed

Little, Roderick J. A.; Rubin, Donald B. – Journal of Educational and Behavioral Statistics, 1994

Equating a new standard test to an old reference test is considered when samples for equating are not randomly selected from the target population of test takers, identifying two problems from equating from biased samples. An empirical example with data from the Armed Services Vocational Aptitude Battery illustrates the approach. (SLD)

Descriptors: Equated Scores, Military Personnel, Sampling, Statistical Analysis

A Taxonomy of Multiple-Choice Item-Writing Rules.

Peer reviewed

Haladyna, Thomas M.; Downing, Steven M. – Applied Measurement in Education, 1989

A taxonomy of 43 rules for writing multiple-choice test items is presented, based on a consensus of 46 textbooks. These guidelines are presented as complete and authoritative, with solid consensus apparent for 33 of the rules. Four rules lack consensus, and 5 rules were cited fewer than 10 times. (SLD)

Descriptors: Classification, Interrater Reliability, Multiple Choice Tests, Objective Tests

Determining the Optimal Number of Alternatives to a Multiple-Choice Test Item: An Information Theoretic Perspective.

Peer reviewed

Bruno, James E.; Dirkzwager, A. – Educational and Psychological Measurement, 1995

Determining the optimal number of choices on a multiple-choice test is explored analytically from an information theory perspective. The analysis revealed that, in general, three choices seem optimal. This finding is in agreement with previous statistical and psychometric research. (SLD)

Descriptors: Distractors (Tests), Information Theory, Multiple Choice Tests, Psychometrics

Equating Scores from Adaptive to Linear Tests

Peer reviewed

Direct link

van der Linden, Wim J. – Applied Psychological Measurement, 2006

Two local methods for observed-score equating are applied to the problem of equating an adaptive test to a linear test. In an empirical study, the methods were evaluated against a method based on the test characteristic function (TCF) of the linear test and traditional equipercentile equating applied to the ability estimates on the adaptive test…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Format, Equated Scores

Methodological Issues Related to the Study of Context Effects in Multisection Tests.

Stewart, E. Elizabeth – 1981

Context effects are defined as being influences on test performance associated with the content of successively presented test items or sections. Four types of context effects are identified: (1) direct context effects (practice effects) which occur when performance on items is affected by the examinee having been exposed to similar types of…

Descriptors: Context Effect, Data Collection, Error of Measurement, Evaluation Methods

What Counts as Evidence of Educational Achievement? The Role of Constructs in the Pursuit of Equity in Assessment

Peer reviewed

Direct link

Wiliam, Dylan – Review of Research in Education, 2010

The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…

Descriptors: Educational Assessment, Validity, Inferences, Construct Validity

State Refinements to the ESEA Title I Evaluation and Reporting System: Utah 1979-80 Project. Final Report.

Download full text

White, Karl; And Others – 1981

To explain discrepancies in Utah's elementary school test results under the Elementary and Secondary Education Act's Title I Evaluation and Reporting System (TIERS), researchers investigated the adequacy and validity of TIERS evaluation models. Model A (norm-referenced testing) is used in most Utah school districts, in preference to Models B or C…

Descriptors: Achievement Tests, Elementary Education, Evaluation Methods, Norm Referenced Tests

Customizing Achievement Tests to Fit Local District Objectives.

Jolly, S. Jean – Spectrum, 1983

Proposes that objective-referenced tests replace norm-referenced tests as a vehicle for program evaluation. Describes a methodology, based on latent trait theory, for joining norm-referenced and objective-referenced testing in a customized testing program. (TE)

Descriptors: Criterion Referenced Tests, Elementary Secondary Education, Latent Trait Theory, Measurement Objectives

Steps and Recommendations for More Accurate Placement Test Creation.

Download full text

Murray, Joel R. – 2001

This paper aims to provide practical advice for creating a placement test for English-as-a-Second-Language (ESL) or English-as-a-foreign-language (EFL) instruction. Three forms of concrete assistance are provided: a detailed literature review; detailed steps focusing on the creation of placement tests; and a set of recommendations focusing on…

Descriptors: English (Second Language), Examiners, Factor Analysis, Literature Reviews