ERIC - Search Results

Descriptor

Item Analysis	25
Test Reliability	25
Test Validity	16
Test Construction	12
Test Items	11
Higher Education	10
Latent Trait Theory	9
Difficulty Level	5
Multiple Choice Tests	5
Achievement Tests	4
Computer Assisted Testing	4
Elementary Secondary Education	4
Item Banks	4
Mathematical Models	4
Scores	4
Standardized Tests	4
Student Attitudes	4
Test Format	4
Analysis of Variance	3
Concurrent Validity	3
Construct Validity	3
Educational Research	3
Factor Analysis	3
Mastery Tests	3
Mathematics Tests	3
More ▼

Source

American Journal on Mental…	1
Diagnostique	1
Educational Measurement:…	1
Journal for Research in…	1
Research in Higher Education	1

Publication Type

Reports - Research	20
Speeches/Meeting Papers	15
Journal Articles	5
Information Analyses	2
Opinion Papers	2
Reports - Evaluative	2
Tests/Questionnaires	2
Guides - Non-Classroom	1
Reports - Descriptive	1

Education Level

Audience

Researchers	25
Practitioners	2
Teachers	2

Location

India	1
Nigeria	1
Turkey	1
Virgin Islands	1

Laws, Policies, & Programs

Assessments and Surveys

Bayley Scales of Infant…	1
California Critical Thinking…	1
Cattell Culture Fair…	1
SAT (College Admission Test)	1
Stanford Achievement Tests	1
Test Anxiety Inventory	1
Test of Economic Literacy	1
Test of Understanding in…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

On the Study of Matching Cut-Scores to Test Characteristics: An Observed Score Approach. Program Statistics Research Technical Report Series.

Wainer, Howard – 1985

Techniques derived from item response theory are useful for estimating the reliability of test classification above and below the cutting score. Test developers can construct a test whose information is peaked in the region of the cutting score; users can select a test which provides the most information in this region. The Cut-Score…

Descriptors: Cutting Scores, Item Analysis, Latent Trait Theory, Mastery Tests

Attitude Assessment for Research in Economic Education.

Hodgin, Robert F. – 1984

Guidelines for the construction and use of an attitude instrument are presented, and the application of the instrument to measure student attitude toward economics is described. Attention is directed to the Likert-like summated forced-choice variety of attitude instrument, whereby attitude toward the object is inferred from the summed responses to…

Descriptors: Attitude Measures, Economics Education, Higher Education, Item Analysis

Basic Skills Achievement in the Caribbean: A Research Model.

Download full text

Bliss, Leonard B. – 1984

A model for the validation of standardized tests of academic achievement upon populations not represented in the samples used to standardize the tests is presented, and the results of a field testing of the model are described. The 1973 editions of the Stanford Achievement Test and the Test of Academic Skills were administered to a sample of…

Descriptors: Achievement Tests, Basic Skills, Elementary Secondary Education, Item Analysis

Measuring a van Hiele Geometry Sequence: A Reanalysis.

Peer reviewed

Wilson, Mark – Journal for Research in Mathematics Education, 1990

Summarizes a reanalysis of the data from an investigation of a test designed to measure a learning sequence in geometry based on the work of van Hiele (1986). Discusses the test based on the Rasch model. (YP)

Descriptors: Geometric Concepts, Geometry, Item Analysis, Mathematical Concepts

The Stereotyped Behavior Scale for Adolescents and Adults with Mental Retardation.

Rojahn, Johannes; Tasse, Marc J.; Sturmey, Peter – American Journal on Mental Retardation, 1997

Development of the Stereotyped Behavior Scale for adolescents and adults with mental retardation is described. Use with 600 individuals resulted in refinement and a 26-item scale with an internal consistency alpha of 0.88, test-retest reliability of p=0.90, and interrater reliability of p=0.76. (DB)

Descriptors: Adolescents, Adults, Behavior Patterns, Behavior Rating Scales

Technical Characteristics and Some Correlates of the California Critical Thinking Skills Test, Forms A and B.

Peer reviewed

Jacobs, Stanley S. – Research in Higher Education, 1995

Comparison of college freshman performance on two different forms of the California Critical Thinking Skills Test (n=684, 692) found a lack of equivalence between forms and low internal consistency reliability. It is suggested that, although the test may be useful for research, it is not appropriate for decision making about individual students.…

Descriptors: College Freshmen, Comparative Analysis, Critical Thinking, Educational Research

A Quantitative Review of Research on Multiple-Choice Item Writing.

Download full text

Haladyna, Thomas M.; Downing, Steven M. – 1985

In this paper 45 item-writing rules for multiple-choice tests presented in textbooks on educational measurement in a previous study are identified. The current study presents a quantitative review of the literature with respect to the empirical and theoretical evaluation of these principles of item-writing. Fifty-six studies that addressed at…

Descriptors: Educational Research, Elementary Secondary Education, Item Analysis, Multiple Choice Tests

Reliability and Validity of Two-Option Multiple-Choice and Comparably Written True-False Items.

Download full text

Sax, Gilbert; Reiter, Pauline B. – 1980

Despite the popularity of both multiple-choice (MC) and true-false (TF) items, most investigations comparing the two formats have done so to determine the optimum number of choices to be given to students within a given time period. The purpose of this investigation was to compare the reliabilities and the validities of both formats when the items…

Descriptors: Analysis of Variance, Correlation, Higher Education, Item Analysis

Rasch Analysis of the Standardization Data of the Bayley Mental Scale of Infant Development.

Snyder, Scott; Sheehan, Robert – Diagnostique, 1992

Rasch calibration procedures were applied to item-response data for the 1,262 infants and toddlers comprising the standardization sample for the Mental Scale of the Bayley Scales of Infant Development. Analyses tend to confirm the psychometric integrity of the instrument. (Author)

Descriptors: Child Development, Cognitive Tests, Concurrent Validity, Construct Validity

Using Microcomputers to Assess Achievement and Instruction.

Peer reviewed

Nelson, Larry R. – Educational Measurement: Issues and Practice, 1984

The author argues that scoring, reporting, and deriving final grades can be considerably assisted by using a computer. He also contends that the savings in time and the computer database formed will allow instructors to determine test quality and reflect on the quality of instruction. (BW)

Descriptors: Achievement Tests, Affective Objectives, Computer Assisted Testing, Educational Testing

The Development of a Computer Literacy Assessment Instrument.

Torardi, Mary Montag – 1985

This report describes the procedures used to construct and validate the Standardized Test of Computer Literacy (STCL), a criterion-referenced instrument designed to assess students' computer literacy. Following a statement of the study problem and purpose, a description of the study methodology outlines the use of a 12-step model for development…

Descriptors: Academic Achievement, Competence, Computer Literacy, Computer Science Education

An Investigation of the Relationship between Item Arrangement and Test Performance.

Chissom, Brad; Chukabarah, Prince C. O. – 1985

The comparative effects of various sequences of test items were examined for over 900 graduate students enrolled in an educational research course at The University of Alabama, Tuscaloosa. experiment, which was conducted a total of four times using four separate tests, presented three different arrangements of 50 multiple-choice items: (1)…

Descriptors: Analysis of Variance, Comparative Testing, Difficulty Level, Graduate Students

The Effect of Keying All Options Correct on Equating Functions and Scores.

Download full text

Lenel, Julia C.; Gilmer, Jerry S. – 1986

In some testing programs an early item analysis is performed before final scoring in order to validate the intended keys. As a result, some items which are flawed and do not discriminate well may be keyed so as to give credit to examinees no matter which answer was chosen. This is referred to as allkeying. This research examined how varying the…

Descriptors: Equated Scores, Item Analysis, Latent Trait Theory, Licensing Examinations (Professions)

Riding the Rasch Tiger. Part 1: Laying the Item Bank Foundation (Paul Volker Would Approve).

Forster, Fred – 1987

Studies carried out over a 12-year period addressed fundamental questions on the use of Rasch-based item banks. Large field tests administered in grades 3-8 of reading, mathematics, and science items, as well as standardized test results were used to explore the possible effects of many factors on item calibrations. In general, the results…

Descriptors: Achievement Tests, Difficulty Level, Elementary Education, Item Analysis

Distributional Projections: A Practical Application of the Rasch Model.

Download full text

Phillips, Gary W.; Huynh, Huynh – 1985

A procedure which may be used to project the frequency distribution of one test onto that of another test is described and illustrated. The procedure is useful when a test developer wishes to construct an alternate form with preferred distributional characteristics. For example, the test developer may wish to construct a new test form with a…

Descriptors: Achievement Tests, Elementary Secondary Education, Item Analysis, Item Banks

Previous Page | Next Page »

Pages: 1 | 2

Bliss, Leonard B.	1
Chissom, Brad	1
Chukabarah, Prince C. O.	1
Cook, Linda L.	1
Downing, Steven M.	1
Forster, Fred	1
Gilmer, Jerry S.	1
Haladyna, Thomas M.	1
Hampilos, John P.	1
Harnisch, Delwyn L.	1
Hodgin, Robert F.	1
Huynh, Huynh	1
Jacobs, Stanley S.	1
Lenel, Julia C.	1
Mattheis, Floyd E.	1
Mueller, Richard J.	1
Nakayama, Genzo	1
Nelson, Larry R.	1
Nenty, H. Johnson	1
O'Brien, Michael	1
Oner, Necla	1
Phillips, Gary W.	1
Reiter, Pauline B.	1
Rojahn, Johannes	1
More ▼