ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	3

Descriptor

Psychometrics	9
Test Format	9
Testing Problems	9
Test Construction	7
Test Items	4
Test Validity	4
Computer Assisted Testing	3
Educational Assessment	3
Measurement Techniques	3
Performance Based Assessment	3
Achievement Tests	2
Algorithms	2
Construct Validity	2
Educational Policy	2
Educational Testing	2
Elementary Secondary Education	2
Item Banks	2
Scoring	2
State Programs	2
Test Reliability	2
Test Theory	2
Academic Achievement	1
Access to Education	1
Accountability	1
Adaptive Testing	1
More ▼

Source

Educational Research Quarterly	1
Measurement:…	1
National Council on…	1
Review of Research in…	1

Author

Carlson, Janet F.	1
Hambleton, Ronald K.	1
Kahl, Stuart R.	1
Kiely, Gerard L.	1
Lance, Charles E.	1
Moomaw, Michael E.	1
Parshall, Cynthia G.	1
Ritter, Judy	1
Schoenfeld, Alan H.	1
Stewart, Rob	1
Wainer, Howard	1
Wiliam, Dylan	1
More ▼

Publication Type

Journal Articles	3
Reports - Evaluative	3
Reports - Research	3
Speeches/Meeting Papers	3
Opinion Papers	2
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education

Audience

Location

United Kingdom	1
United States	1

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

SAT (College Admission Test)

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Testing and Data Integrity in the Administration of Statewide Student Assessment Programs

Download full text

National Council on Measurement in Education, 2012

Testing and data integrity on statewide assessments is defined as the establishment of a comprehensive set of policies and procedures for: (1) the proper preparation of students; (2) the management and administration of the test(s) that will lead to accurate and appropriate reporting of assessment results; and (3) maintaining the security of…

Descriptors: State Programs, Integrity, Testing, Test Preparation

A Psychometric View of Those Who Administer Standardized Tests: Are Test Givers Instruments Too?

Peer reviewed

Carlson, Janet F. – Educational Research Quarterly, 1998

This article invokes a literal image of test givers as measurement devices and explores the psychometric properties of these test administrator instruments. Concurrent and content validation and test-retest and parallel-forms validity are explored. (SLD)

Descriptors: Achievement Tests, Educational Testing, Examiners, Psychometrics

What Counts as Evidence of Educational Achievement? The Role of Constructs in the Pursuit of Equity in Assessment

Peer reviewed

Direct link

Wiliam, Dylan – Review of Research in Education, 2010

The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…

Descriptors: Educational Assessment, Validity, Inferences, Construct Validity

The Complexities of Assessing Teacher Knowledge

Peer reviewed

Direct link

Schoenfeld, Alan H. – Measurement: Interdisciplinary Research and Perspectives, 2007

The authors of this volume's stimulus papers have taken on the challenge of developing measures of teachers' mathematical knowledge for teaching (MKT). This task involves multiple decisions and considerations, including: (1) How does one specify the body of knowledge being assessed? What warrants are offered for those choices?; (2) How does one…

Descriptors: Test Validity, Psychometrics, Test Construction, Evaluation Research

Innovations: Graphics, Sound, and Alternative Response Modes.

Download full text

Parshall, Cynthia G.; Stewart, Rob; Ritter, Judy – 1996

While computer-based tests might be as simple as computerized versions of paper-and-pencil examinations, more innovative applications also exist. Examples of innovations in computer-based assessment include the use of graphics or sound, some measure of interactivity, a change in the means in which examinees responded to items, and the application…

Descriptors: College Students, Computer Assisted Testing, Educational Innovation, Graphic Arts

Assessing the Psychometric Quality of Performance Rating Scales: Comparisons among Evaluative Criteria.

Download full text

Lance, Charles E.; Moomaw, Michael E. – 1983

Direct assessments of the accuracy with which raters can use a rating instrument are presented. This study demonstrated how surplus behavioral incidents scaled during the development of Behaviorally Anchored Rating Scales (BARS) can be used effectively in the evaluation of the newly developed scales. Construction of scenarios of hypothetical…

Descriptors: Behavior Rating Scales, Comparative Analysis, Error of Measurement, Evaluation Criteria

Determining Optimal Test Lengths with a Fixed Total Testing Time.

Download full text

Hambleton, Ronald K. – 1986

The problem of determining optimal test lengths with fixed total testing time has proved to be a difficult one for criterion-referenced test developers. An algorithm is needed which can be used by test developers to allocate available testing time to maximize the validity of their total criterion-referenced tests or testing programs. To be…

Descriptors: Algorithms, Criterion Referenced Tests, Elementary Secondary Education, Psychometrics

Scoring Issues in Selected Statewide Assessment Programs Using Non-Multiple-Choice Formats.

Download full text

Kahl, Stuart R. – 1995

Although few question the positive impacts alternative forms of assessment can have on instruction, concerns about the psychometric quality of data obtained from such assessments are taking their toll. Scoring issues are at the heart of many of these concerns. This paper addresses the causes of these concerns: misinformation about psychometric…

Descriptors: Alternative Assessment, Educational Assessment, Equated Scores, Performance Based Assessment

CATs, Testlets, and Test Construction: A Rationale for Putting Test Developers Back into CAT.

Wainer, Howard; Kiely, Gerard L. – 1986

Recent experience with the Computerized Adaptive Test (CAT) has raised a number of concerns about its practical applications. The concerns are principally involved with the concept of having the computer construct the test from a precalibrated item pool, and substituting statistical characteristics for the test developer's skills. Problems with…

Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Construct Validity