ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	3

Descriptor

Psychometrics	9
Test Format	9
Test Theory	9
Test Items	6
Test Construction	4
Computer Assisted Testing	3
Construct Validity	3
Latent Trait Theory	3
Measurement Techniques	3
Test Validity	3
Adaptive Testing	2
Difficulty Level	2
Foreign Countries	2
Item Analysis	2
Item Banks	2
Multiple Choice Tests	2
Scores	2
Scoring	2
Statistical Analysis	2
Test Interpretation	2
Testing Problems	2
Academic Achievement	1
Access to Education	1
Accountability	1
Achievement Tests	1
More ▼

Source

College Board	1
Educational and Psychological…	1
Journal of Educational…	1
Psychological Assessment	1
Review of Educational Research	1
Review of Research in…	1

Author

Wainer, Howard	2
Brunner, Martin	1
Bruno, James E.	1
Dirkzwager, A.	1
Dorans, Neil J.	1
Engelhard, George, Jr.	1
Houssemand, Claude	1
Kiely, Gerard L.	1
Leary, Linda F.	1
Loarer, Even	1
McBride, James R.	1
Melancon, Janet G.	1
Morgan, Anne	1
Steinmetz, Jean-Paul	1
Thompson, Bruce	1
Wiliam, Dylan	1
Wind, Stefanie A.	1
More ▼

Publication Type

Journal Articles	5
Reports - Research	4
Reports - Evaluative	3
Information Analyses	2
Speeches/Meeting Papers	2

Education Level

Elementary Secondary Education	1
High Schools	1
Secondary Education	1

Audience

Location

Luxembourg	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

Embedded Figures Test	1
SAT (College Admission Test)	1
Wisconsin Card Sorting Test	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Rating Quality Studies Using Rasch Measurement Theory. Research Report 2013-3

Download full text

Engelhard, George, Jr.; Wind, Stefanie A. – College Board, 2013

The major purpose of this study is to examine the quality of ratings assigned to CR (constructed-response) questions in large-scale assessments from the perspective of Rasch Measurement Theory. Rasch Measurement Theory provides a framework for the examination of rating scale category structure that can yield useful information for interpreting the…

Descriptors: Measurement Techniques, Rating Scales, Test Theory, Scores

Incomplete Psychometric Equivalence of Scores Obtained on the Manual and the Computer Version of the Wisconsin Card Sorting Test?

Peer reviewed

Direct link

Steinmetz, Jean-Paul; Brunner, Martin; Loarer, Even; Houssemand, Claude – Psychological Assessment, 2010

The Wisconsin Card Sorting Test (WCST) assesses executive and frontal lobe function and can be administered manually or by computer. Despite the widespread application of the 2 versions, the psychometric equivalence of their scores has rarely been evaluated and only a limited set of criteria has been considered. The present experimental study (N =…

Descriptors: Computer Assisted Testing, Psychometrics, Test Theory, Scores

A Review of Estimation Procedures for the Rasch Model with an Eye toward Longish Tests.

Peer reviewed

Morgan, Anne; Wainer, Howard – Journal of Educational Statistics, 1980

Two estimation procedures for the Rasch Model of test analysis are reviewed in detail, particularly with respect to new developments that make the more statistically rigorous conditional maximum likelihood estimation practical for use with longish tests. (Author/JKS)

Descriptors: Error of Measurement, Latent Trait Theory, Maximum Likelihood Statistics, Psychometrics

Determining the Optimal Number of Alternatives to a Multiple-Choice Test Item: An Information Theoretic Perspective.

Peer reviewed

Bruno, James E.; Dirkzwager, A. – Educational and Psychological Measurement, 1995

Determining the optimal number of choices on a multiple-choice test is explored analytically from an information theory perspective. The analysis revealed that, in general, three choices seem optimal. This finding is in agreement with previous statistical and psychometric research. (SLD)

Descriptors: Distractors (Tests), Information Theory, Multiple Choice Tests, Psychometrics

Adaptive Mental Testing: The State of the Art.

Download full text

McBride, James R. – 1979

In an adaptive test, the test administrator chooses test items sequentially during the test, in such a way as to adapt test difficulty to examinee ability as shown during testing. An effectively designed adaptive test can resolve the dilemma inherent in conventional test design. By tailoring tests to individuals, the adaptive test can…

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Military Personnel

Implications for Altering the Context in Which Test Items Appear: A Historical Perspective on an Immediate Concern.

Peer reviewed

Leary, Linda F.; Dorans, Neil J. – Review of Educational Research, 1985

Research on the potential effects of different item arrangement schemes on item statistics is reviewed for three separate periods. Earliest studies investigated the simple main effect of item order on test performance. The late 1960s emphasized interactions between item order and examinees' characteristics. Current concern focuses on item…

Descriptors: Achievement Tests, Aptitude Tests, Item Analysis, Latent Trait Theory

What Counts as Evidence of Educational Achievement? The Role of Constructs in the Pursuit of Equity in Assessment

Peer reviewed

Direct link

Wiliam, Dylan – Review of Research in Education, 2010

The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…

Descriptors: Educational Assessment, Validity, Inferences, Construct Validity

CATs, Testlets, and Test Construction: A Rationale for Putting Test Developers Back into CAT.

Wainer, Howard; Kiely, Gerard L. – 1986

Recent experience with the Computerized Adaptive Test (CAT) has raised a number of concerns about its practical applications. The concerns are principally involved with the concept of having the computer construct the test from a precalibrated item pool, and substituting statistical characteristics for the test developer's skills. Problems with…

Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Construct Validity

Measurement Characteristics of the Finding Embedded Figures Test in "Speed" versus "Power" Administrations.

Download full text

Melancon, Janet G.; Thompson, Bruce – 1990

Classical measurement theory was used to investigate measurement characteristics of both parts of the Finding Embedded Figures Test (FEFT) when the test was: administered in either a "no guessing" supply format or a multiple-choice selection format; administered to either undergraduate college students or middle school students; and…

Descriptors: Comparative Testing, Construct Validity, Guessing (Tests), Higher Education