ERIC - Search Results

Publication Date

In 2026	0
Since 2025	5
Since 2022 (last 5 years)	45
Since 2017 (last 10 years)	91
Since 2007 (last 20 years)	144

Descriptor

Test Format	418
Test Reliability	418
Test Validity	243
Test Construction	135
Test Items	119
Higher Education	88
Multiple Choice Tests	68
Foreign Countries	67
Testing	65
Test Interpretation	61
Comparative Analysis	57
Language Tests	57
Computer Assisted Testing	55
Scores	53
Scoring	51
Student Evaluation	46
Psychometrics	44
Test Use	44
Standardized Tests	43
Elementary Secondary Education	40
Item Analysis	40
Test Content	40
College Students	36
Second Language Learning	36
Test Reviews	36
More ▼

Education Level

Higher Education	50
Postsecondary Education	42
Secondary Education	25
Elementary Education	24
Middle Schools	17
Junior High Schools	15
High Schools	10
Grade 8	9
Grade 7	8
Early Childhood Education	7
Elementary Secondary Education	7
Grade 3	7
Grade 5	7
Intermediate Grades	7
Grade 4	6
Grade 6	6
Primary Education	6
Adult Education	2
Kindergarten	2
Grade 1	1
Grade 9	1
Preschool Education	1
More ▼

Audience

Practitioners	33
Teachers	23
Administrators	18
Researchers	12
Community	1
Counselors	1
Policymakers	1
Students	1
Support Staff	1

Location

New York	9
Turkey	8
California	7
Canada	6
Japan	6
Germany	4
United Kingdom	4
Georgia	3
Israel	3
France	2
Indonesia	2
Iran	2
Netherlands	2
New York (New York)	2
Nigeria	2
Singapore	2
South Africa	2
United Kingdom (Great Britain)	2
Bangladesh	1
Brazil	1
China	1
Connecticut	1
Czech Republic	1
Estonia	1
Finland	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	1
Job Training Partnership Act…	1
No Child Left Behind Act 2001	1
Pell Grant Program	1

What Works Clearinghouse Rating

Test Reliability X

Showing 151 to 165 of 418 results Save | Export

A Mexican Version of the Peabody Picture Vocabulary Test.

Peer reviewed

Simon, Alan J.; Joiner, Lee M. – Journal of Educational Measurement, 1976

The purpose of this study was to determine whether a Mexican version of the Peabody Picture Vocabulary Test could be improved by directly translating both forms of the American test, then using decision procedures to select the better item of each pair. The reliability of the simple translations suffered. (Author/BW)

Descriptors: Early Childhood Education, Spanish, Test Construction, Test Format

A Closer Look at Three Measures of English Morphology.

Peer reviewed

Weber, Ronald L. – Journal of Learning Disabilities, 1982

Three measures often used with handicapped children (the Berry-Talbott Comprehension of Grammar, the Grammatic Closure subtest of the Illinois Test of Psycholinguistic Abilities, and the Grammatic Completion subtest of the Test of Language Development) are discussed in terms of test reliability, scoring procedures, format, and types of scores.…

Descriptors: Disabilities, Language Tests, Morphology (Languages), Nonstandard Dialects

Format Effects in Two Teacher Rating Scales of Hyperactivity.

Peer reviewed

Sandoval, Jonathan – Journal of Abnormal Child Psychology, 1981

The object of the study was to investigate the effect of differences in format on the precision of teacher ratings and thus on the reliability and validity of two teacher rating scales of children's hyperactive behavior. Attributes assessed were motor restlssness, inattentiveness, impulsivity, and aggressiveness/emotional stability. (Author/DB)

Descriptors: Behavior Rating Scales, Elementary Secondary Education, Hyperactivity, Test Format

Composite Reliability and Standard Errors of Measurement for a Seven-Subtest Short Form of the Wechsler Adult Intelligence Scale-Revised.

Peer reviewed

Schretlen, David; And Others – Psychological Assessment, 1994

Composite reliability and standard errors of measurement were computed for prorated Verbal, Performance, and Full-Scale intelligence quotient (IQ) scores from a seven-subtest short form of the Wechsler Adult Intelligence Scale-Revised. Results with 1,880 adults (standardization sample) indicate that this form is as reliable as the complete test.…

Descriptors: Adults, Error of Measurement, Intelligence, Intelligence Quotient

Grading Distractor-Identification Tests.

Peer reviewed

Austin, Joe Dan – Psychometrika, 1981

On distractor-identification tests students mark as many distractors as possible on each test item. A grading scale is developed for this type testing. The score is optimal in that it yields an unbiased estimate of the student's score as if no guessing had occurred. (Author/JKS)

Descriptors: Guessing (Tests), Item Analysis, Measurement Techniques, Scoring Formulas

What's Your Classroom Testing Validity Quotient?

Murphy, Meg – School Shop, 1981

Suggests three techniques for assuring the content validity of classroom/shop tests: build a bank of content-valid test items; develop valid tests based on a carefully prepared table of specifications; and check the validity of tests already developed. A self-test is included for the reader. (CT)

Descriptors: Item Banks, Test Construction, Test Format, Test Reliability

Selected Psychometric Characteristics of the Peabody Mathematics Readiness Test.

Peer reviewed

Gilley, William F.; And Others – Psychology: A Journal of Human Behavior, 1988

Administered Peabody Mathematics Readiness Test to 325 students in kindergarten through second grade to investigate selected psychometric characteristics of the test. Found low item-to-item correlations; results did not support factor structure suggested by test's authors or proposed hierarchical structure. (Author/NB)

Descriptors: Factor Structure, Learning Readiness, Mathematics, Primary Education

Estimating the Reliability of a Test Containing Multiple Item Formats.

Peer reviewed

Qualls, Audrey L. – Applied Measurement in Education, 1995

Classically parallel, tau-equivalently parallel, and congenerically parallel models representing various degrees of part-test parallelism and their appropriateness for tests composed of multiple item formats are discussed. An appropriate reliability estimate for a test with multiple item formats is presented and illustrated. (SLD)

Descriptors: Achievement Tests, Estimation (Mathematics), Measurement Techniques, Test Format

Development of an Interview-Based Geriatric Depression Rating Scale.

Peer reviewed

Jamison, Christine; Scogin, Forrest – International Journal of Aging and Human Development, 1992

Developed interview-based Geriatric Depression Rating Scale (GDRS) and administered 35-item GDRS to 68 older adults with range of affective disturbance. Found scale to have internal consistency and split-half reliability comparable to those of Hamilton Rating Scale for Depression and Geriatric Depression Scale. Concurrent validity, construct…

Descriptors: Depression (Psychology), Geriatrics, Interviews, Older Adults

Comparing Paper-Pencil and Computer-Based Versions of the Harrington-O'Shea Career Decision-Making System.

Peer reviewed

Kapes, Jerome T.; Vansickle, Timothy R. – Measurement and Evaluation in Counseling and Development, 1992

Examined equivalence of mode of administration of the Career Decision-Making System, comparing paper-and-pencil version and computer-based version. Findings from 61 undergraduate students indicated that the computer-based version was significantly more reliable than paper-and-pencil version and was generally equivalent in other respects.…

Descriptors: Comparative Testing, Computer Assisted Testing, Higher Education, Test Format

Metric Equivalence of the Bidimensional Acculturation Scale, the Satisfaction with Life Scale, and the Self-Construal Scale across Spanish and English Language Versions

Peer reviewed

Direct link

Singelis, Theodore M.; Yamada, Ann Marie; Barrio, Concepcion; Laney, Joshua Harrison; Her, Pa; Ruiz-Anaya, Alejandrina; Lennertz, Sara Terwilliger – Hispanic Journal of Behavioral Sciences, 2006

The metric equivalence of translated scales is often in question but seldom examined. This study presents test-retest data that support the metric equivalence of the Spanish and English language versions of three measures: the Bidimensional Acculturation Scale, the Satisfaction with Life Scale, and the Self-Construal Scale. Participants were…

Descriptors: Acculturation, Life Satisfaction, English, Test Format

Comparison of Multistage Tests with Computerized Adaptive and Paper-and-Pencil Tests. Research Report. ETS RR-07-04

Peer reviewed
PDF on ERIC

Download full text

Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007

Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…

Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models

A Comparison of the Item Difficulty and Item Discrimination of Multiple-Choice Items Using the "None of the Above" and One Correct Response Options.

Peer reviewed

Tollefson, Nona – Educational and Psychological Measurement, 1987

This study compared the item difficulty, item discrimination, and test reliability of three forms of multiple-choice items: (1) one correct answer; (2) "none of the above" as a foil; and (3) "none of the above" as the correct answer. Twelve items in the three formats were administered in a college statistics examination. (BS)

Descriptors: Difficulty Level, Higher Education, Item Analysis, Multiple Choice Tests

Effects of Modified Deletion Strategies and Scoring Procedures on Cloze Test Performance.

Peer reviewed

Henk, William A. – Journal of Reading Behavior, 1981

Analyzes alternative cloze forms derived from selected deletion strategies, scoring procedures, and blank conditions for respective effects on the cloze test performance of college-level readers. (HOD)

Descriptors: Cloze Procedure, College Students, Higher Education, Reading Research

Bruininks-Oseretsky Test of Motor Proficiency: A Viable Measure for 3- to 5-Yr.-Old Children.

Peer reviewed

Beitel, Patricia A.; Mead, Barbara J. – Perceptual and Motor Skills, 1980

Examined the short form and eight subtests of the Bruininks-Oseretsky Test of Motor Proficiency with a sample of preschoolers to assess its potential for discriminating among ages and between sexes and to see whether the short form accounted for a major portion of the variability of the complete battery. (Author/SJL)

Descriptors: Age Differences, Perceptual Motor Coordination, Performance Tests, Sex Differences

« Previous Page | Next Page »

Pages: 1 | ... | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | ... | 28

Diagnostique	26
Educational and Psychological…	22
Journal of Educational…	9
Language Testing	9
New York State Education…	9
Psychological Assessment	7
Applied Psychological…	5
ETS Research Report Series	5
International Journal of…	5
Journal of Reading	5
Language Assessment Quarterly	5
Applied Measurement in…	4
Assessment	4
ProQuest LLC	4
Assessment & Evaluation in…	3
Assessment for Effective…	3
College Board	3
Evaluation and the Health…	3
Grantee Submission	3
Journal of Experimental…	3
Journal of Psychoeducational…	3
Perceptual and Motor Skills	3
Practical Assessment,…	3
Academic Medicine	2
Annual Review of Applied…	2
More ▼

White, Edward M.	6
Melancon, Janet G.	4
Thompson, Bruce	4
Trevisan, Michael S.	4
Federico, Pat-Anthony	3
Frisbie, David A.	3
Hambleton, Ronald K.	3
Sax, Gilbert	3
Stansfield, Charles W.	3
Straus, Murray A.	3
Aiken, Lewis R.	2
Alderson, J. Charles	2
Brown, James Dean	2
Bush, Martin	2
Conoyer, Sarah J.	2
Eignor, Daniel R.	2
Green, Kathy	2
Hamby, Sherry L.	2
Hendrickson, Amy	2
Henk, William A.	2
Henning, Grant	2
Kapes, Jerome T.	2
Liskin-Gasparro, Judith E.	2
Menold, Natalja	2
More ▼

Journal Articles	265
Reports - Research	239
Speeches/Meeting Papers	63
Reports - Descriptive	61
Reports - Evaluative	57
Information Analyses	25
Opinion Papers	24
Guides - Non-Classroom	21
Tests/Questionnaires	20
Guides - Classroom - Teacher	10
Guides - General	6
Numerical/Quantitative Data	5
Dissertations/Theses -…	4
Reference Materials -…	4
Books	3
ERIC Publications	1
Guides - Classroom - Learner	1
Non-Print Media	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	5
Embedded Figures Test	3
SAT (College Admission Test)	3
Wechsler Adult Intelligence…	3
Wechsler Intelligence Scale…	3
ACT Assessment	2
Beck Depression Inventory	2
Graduate Record Examinations	2
Minnesota Multiphasic…	2
Peabody Picture Vocabulary…	2
ACTFL Oral Proficiency…	1
Armed Services Vocational…	1
Attribution Style…	1
Behavior Assessment System…	1
Bem Sex Role Inventory	1
Bruininks Oseretsky Test of…	1
California Critical Thinking…	1
Canfield Learning Styles…	1
Computer Attitude Scale	1
Conflict Tactics Scale	1
Conners Rating Scales	1
Cornell Critical Thinking Test	1
Defining Issues Test	1
Developmental Indicators for…	1
Dimensions of Self Concept	1
More ▼