ERIC - Search Results

Publication Date

In 2026	0
Since 2025	5
Since 2022 (last 5 years)	45
Since 2017 (last 10 years)	91
Since 2007 (last 20 years)	144

Descriptor

Test Format	418
Test Reliability	418
Test Validity	243
Test Construction	135
Test Items	119
Higher Education	88
Multiple Choice Tests	68
Foreign Countries	67
Testing	65
Test Interpretation	61
Comparative Analysis	57
Language Tests	57
Computer Assisted Testing	55
Scores	53
Scoring	51
Student Evaluation	46
Psychometrics	44
Test Use	44
Standardized Tests	43
Elementary Secondary Education	40
Item Analysis	40
Test Content	40
College Students	36
Second Language Learning	36
Test Reviews	36
More ▼

Education Level

Higher Education	50
Postsecondary Education	42
Secondary Education	25
Elementary Education	24
Middle Schools	17
Junior High Schools	15
High Schools	10
Grade 8	9
Grade 7	8
Early Childhood Education	7
Elementary Secondary Education	7
Grade 3	7
Grade 5	7
Intermediate Grades	7
Grade 4	6
Grade 6	6
Primary Education	6
Adult Education	2
Kindergarten	2
Grade 1	1
Grade 9	1
Preschool Education	1
More ▼

Audience

Practitioners	33
Teachers	23
Administrators	18
Researchers	12
Community	1
Counselors	1
Policymakers	1
Students	1
Support Staff	1

Location

New York	9
Turkey	8
California	7
Canada	6
Japan	6
Germany	4
United Kingdom	4
Georgia	3
Israel	3
France	2
Indonesia	2
Iran	2
Netherlands	2
New York (New York)	2
Nigeria	2
Singapore	2
South Africa	2
United Kingdom (Great Britain)	2
Bangladesh	1
Brazil	1
China	1
Connecticut	1
Czech Republic	1
Estonia	1
Finland	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	1
Job Training Partnership Act…	1
No Child Left Behind Act 2001	1
Pell Grant Program	1

What Works Clearinghouse Rating

Test Reliability X

Showing 256 to 270 of 418 results Save | Export

Scaling Behavioral Anchors.

Peer reviewed

Barnes, Janet L.; Landy, Frank J. – Applied Psychological Measurement, 1979

Although behaviorally anchored rating scales have both intuitive and empirical appeal, they have not always yielded superior results in contrast with graphic rating scales. Results indicate that the choice of an anchoring procedure will depend on the nature of the actual rating process. (Author/JKS)

Descriptors: Behavior Rating Scales, Comparative Testing, Higher Education, Rating Scales

The Effect of Negation and Polar Opposite Item Reversals on Questionnaire Reliability and Validity: An Experimental Investigation.

Peer reviewed

Schriesheim, Chester A.; And Others – Educational and Psychological Measurement, 1991

Effects of item wording on questionnaire reliability and validity were studied, using 280 undergraduate business students who completed a questionnaire comprising 4 item types: (1) regular; (2) polar opposite; (3) negated polar opposite; and (4) negated regular. Implications of results favoring regular and negated regular items are discussed. (SLD)

Descriptors: Business Education, Comparative Testing, Higher Education, Negative Forms (Language)

Screening and Diagnostic Instruments for Identification of Autism Spectrum Disorders in Children, Adolescents, and Young Adults: A Selective Review.

Peer reviewed

Charak, David A.; Stella, Jennifer L. – Assessment for Effective Intervention, 2002

This article provides in-depth information regarding the most commonly used instruments for the screening or diagnosis of autistic spectrum disorders. Reliability, validity, format, and target population are presented to help clinicians select appropriate diagnostic measures. Future directions in the development of new instruments are discussed.…

Descriptors: Adolescents, Adults, Autism, Children

Administering Defining Issues Test Online: Do Response Modes Matter?

Peer reviewed

Direct link

Xu, Yuejin; Iran-Nejad, Asghar; Thoma, Stephen J. – Journal of Interactive Online Learning, 2007

The purpose of the study was to determine comparability of an online version to the original paper-pencil version of Defining Issues Test 2 (DIT2). This study employed methods from both Classical Test Theory (CTT) and Item Response Theory (IRT). Findings from CTT analyses supported the reliability and discriminant validity of both versions.…

Descriptors: Computer Assisted Testing, Test Format, Comparative Analysis, Test Theory

A Study of the Effects of Contextualization and Familiarization on Responses to the TOEFL Vocabulary Test Items.

Download full text

Henning, Grant – 1991

In order to evaluate the Test of English as a Foreign Language (TOEFL) vocabulary item format and to determine the effectiveness of alternative vocabulary test items, this study investigated the functioning of eight different multiple-choice formats that differed with regard to: (1) length and inference-generating quality of the stem; (2) the…

Descriptors: Adults, Context Effect, Difficulty Level, English (Second Language)

The Ohio Vocational Education Achievement Test Program.

Download full text

Ohio State Univ., Columbus. Trade and Industrial Education Instructional Materials Lab. – 1978

The Ohio Vocational Achievement Tests are specially designed instruments for use by teachers, supervisors, and administrators to evaluate and diagnose vocational achievement for improving instruction in secondary vocational programs at the 11th and 12th grade levels. This guide explains the Ohio Vocational Achievement Tests and how they are used.…

Descriptors: Academic Achievement, Achievement Tests, High Schools, Scoring Formulas

Measurement Characteristics of a "No-Guessing" Administration of the Finding Embedded Figures Test--Research Edition.

Download full text

Melancon, Janet G.; Thompson, Bruce – 1988

Applied classical measurement theory was used to study the measurement characteristics of Forms A and B of the Finding Embedded Figures Test (FEFT) when the test is administered in a "no-guessing" or "supply" format. Data provided by 69 students at a private university in the southern United States were used. Both forms of the…

Descriptors: Comparative Analysis, Difficulty Level, Discriminant Analysis, Guessing (Tests)

An Analysis of Alternative Cloze Test Formats for Use with Fourth and Sixth Grade Readers.

Henk, William A. – 1983

The specific performance characteristics of eight alternative cloze test formats were examined at the fourth and sixth grade levels. At each grade, 64 subjects were randomly assigned to one of four basic treatments (every-fifth/standard, every-fifth/cued, total random/standard, and total random/cued) and tested. Responses on each of the cloze…

Descriptors: Cloze Procedure, Comparative Analysis, Grade 4, Grade 6

Will That Be on the Final?

Milton, Ohmer – 1982

Educators are called upon to improve the quality of classroom tests to enhance the learning of content. Less faculty concern for tests than for other features of instruction, compounded by a lack of knowing how to assess different levels of learning with test questions that measure complex processes, appear to generate poor quality classroom…

Descriptors: Educational Testing, Evaluation Methods, Higher Education, Learning Activities

Analysis of Score Change Patterns of Examinees Repeating the Graduate Record Examinations General Test.

Kingston, Neal; Turner, Nancy – 1984

This investigation examines the impact the l98l Graduate Record Examination (GRE) General Test Format Revision had on the stability over time of the verbal, quantitative, and analytical scores. Scores were used from the self-selected group of repeaters who took the GRE General Test twice between October 1980 and June 1982. Examinees were divided…

Descriptors: College Entrance Examinations, Graduate Study, Higher Education, Multiple Regression Analysis

The Reliability of Simple, Direct Measures of Written Expression.

Download full text

Marston, Doug; Deno, Stanley – 1981

The reliability of four measures of written expression was examined (total words written, mature words, words spelled correctly, and letters in sequence). Subjects included elementary-age students in several school districts, some of whom were learning disabled. Results revealed high coefficients for test-retest reliability, parallel-form…

Descriptors: Classroom Techniques, Comparative Analysis, Elementary Education, Formative Evaluation

The Curriculum Referenced Tests of Mastery.

Peer reviewed

Knight, Deborah Forsyth – Journal of Reading, 1985

Reviews the Curriculum Referenced Tests of Mastery that are intended to measure achievement, with the emphasis on measuring what a student has learned rather than predicting future success in school, concluding that the tests are worthy of consideration by any district. (HOD)

Descriptors: Academic Achievement, Educational Objectives, Language Arts, Mathematics Instruction

Analyzing Optional Test Items.

Peer reviewed

Aiken, Lewis R. – Educational and Psychological Measurement, 1989

Two alternatives to traditional item analysis and reliability estimation procedures are considered for determining the difficulty, discrimination, and reliability of optional items on essay and other tests. A computer program to compute these measures is described, and illustrations are given. (SLD)

Descriptors: College Entrance Examinations, Computer Software, Difficulty Level, Essay Tests

The Multiple True-False Item Format: A Status Review.

Peer reviewed

Frisbie, David A. – Educational Measurement: Issues and Practice, 1992

Literature related to the multiple true-false (MTF) item format is reviewed. Each answer cluster of a MTF item may have several true items and the correctness of each is judged independently. MTF tests appear efficient and reliable, although they are a bit harder than multiple choice items for examinees. (SLD)

Descriptors: Achievement Tests, Difficulty Level, Literature Reviews, Multiple Choice Tests

Estimating the Optimum Number of Options per Item Using an Incremental Option Paradigm.

Peer reviewed

Trevisan, Michael S.; And Others – Educational and Psychological Measurement, 1994

The reliabilities of 2-, 3-, 4-, and 5-choice tests were compared through an incremental-option model on a test taken by 154 high school seniors. Creating the test forms incrementally more closely approximates actual test construction. The nonsignificant differences among the option choices support the three-option item. (SLD)

Descriptors: Distractors (Tests), Estimation (Mathematics), High School Students, High Schools

« Previous Page | Next Page »

Pages: 1 | ... | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | ... | 28

Diagnostique	26
Educational and Psychological…	22
Journal of Educational…	9
Language Testing	9
New York State Education…	9
Psychological Assessment	7
Applied Psychological…	5
ETS Research Report Series	5
International Journal of…	5
Journal of Reading	5
Language Assessment Quarterly	5
Applied Measurement in…	4
Assessment	4
ProQuest LLC	4
Assessment & Evaluation in…	3
Assessment for Effective…	3
College Board	3
Evaluation and the Health…	3
Grantee Submission	3
Journal of Experimental…	3
Journal of Psychoeducational…	3
Perceptual and Motor Skills	3
Practical Assessment,…	3
Academic Medicine	2
Annual Review of Applied…	2
More ▼

White, Edward M.	6
Melancon, Janet G.	4
Thompson, Bruce	4
Trevisan, Michael S.	4
Federico, Pat-Anthony	3
Frisbie, David A.	3
Hambleton, Ronald K.	3
Sax, Gilbert	3
Stansfield, Charles W.	3
Straus, Murray A.	3
Aiken, Lewis R.	2
Alderson, J. Charles	2
Brown, James Dean	2
Bush, Martin	2
Conoyer, Sarah J.	2
Eignor, Daniel R.	2
Green, Kathy	2
Hamby, Sherry L.	2
Hendrickson, Amy	2
Henk, William A.	2
Henning, Grant	2
Kapes, Jerome T.	2
Liskin-Gasparro, Judith E.	2
Menold, Natalja	2
More ▼

Journal Articles	265
Reports - Research	239
Speeches/Meeting Papers	63
Reports - Descriptive	61
Reports - Evaluative	57
Information Analyses	25
Opinion Papers	24
Guides - Non-Classroom	21
Tests/Questionnaires	20
Guides - Classroom - Teacher	10
Guides - General	6
Numerical/Quantitative Data	5
Dissertations/Theses -…	4
Reference Materials -…	4
Books	3
ERIC Publications	1
Guides - Classroom - Learner	1
Non-Print Media	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	5
Embedded Figures Test	3
SAT (College Admission Test)	3
Wechsler Adult Intelligence…	3
Wechsler Intelligence Scale…	3
ACT Assessment	2
Beck Depression Inventory	2
Graduate Record Examinations	2
Minnesota Multiphasic…	2
Peabody Picture Vocabulary…	2
ACTFL Oral Proficiency…	1
Armed Services Vocational…	1
Attribution Style…	1
Behavior Assessment System…	1
Bem Sex Role Inventory	1
Bruininks Oseretsky Test of…	1
California Critical Thinking…	1
Canfield Learning Styles…	1
Computer Attitude Scale	1
Conflict Tactics Scale	1
Conners Rating Scales	1
Cornell Critical Thinking Test	1
Defining Issues Test	1
Developmental Indicators for…	1
Dimensions of Self Concept	1
More ▼