ERIC - Search Results

Publication Date

In 2026	0
Since 2025	5
Since 2022 (last 5 years)	45
Since 2017 (last 10 years)	91
Since 2007 (last 20 years)	144

Descriptor

Test Format	418
Test Reliability	418
Test Validity	243
Test Construction	135
Test Items	119
Higher Education	88
Multiple Choice Tests	68
Foreign Countries	67
Testing	65
Test Interpretation	61
Comparative Analysis	57
Language Tests	57
Computer Assisted Testing	55
Scores	53
Scoring	51
Student Evaluation	46
Psychometrics	44
Test Use	44
Standardized Tests	43
Elementary Secondary Education	40
Item Analysis	40
Test Content	40
College Students	36
Second Language Learning	36
Test Reviews	36
More ▼

Education Level

Higher Education	50
Postsecondary Education	42
Secondary Education	25
Elementary Education	24
Middle Schools	17
Junior High Schools	15
High Schools	10
Grade 8	9
Grade 7	8
Early Childhood Education	7
Elementary Secondary Education	7
Grade 3	7
Grade 5	7
Intermediate Grades	7
Grade 4	6
Grade 6	6
Primary Education	6
Adult Education	2
Kindergarten	2
Grade 1	1
Grade 9	1
Preschool Education	1
More ▼

Audience

Practitioners	33
Teachers	23
Administrators	18
Researchers	12
Community	1
Counselors	1
Policymakers	1
Students	1
Support Staff	1

Location

New York	9
Turkey	8
California	7
Canada	6
Japan	6
Germany	4
United Kingdom	4
Georgia	3
Israel	3
France	2
Indonesia	2
Iran	2
Netherlands	2
New York (New York)	2
Nigeria	2
Singapore	2
South Africa	2
United Kingdom (Great Britain)	2
Bangladesh	1
Brazil	1
China	1
Connecticut	1
Czech Republic	1
Estonia	1
Finland	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	1
Job Training Partnership Act…	1
No Child Left Behind Act 2001	1
Pell Grant Program	1

What Works Clearinghouse Rating

Test Reliability X

Showing 226 to 240 of 418 results Save | Export

Number of Response Categories and Statistics on a Teacher Rating Scale.

Peer reviewed

Aiken, Lewis R. – Educational and Psychological Measurement, 1983

Each of six forms of a 10-item teacher evaluation rating scale, having two to seven response categories per form, was administered to over 100 college students. Means of item responses and item variances increased with the number of response categories. Internal consistency of total scores did not change systematically. (Author/PN)

Descriptors: College Students, Higher Education, Item Analysis, Rating Scales

Ordering Power of Separate versus Grouped True-False Tests: Interaction of Type of Test with Knowledge Levels of Examinees.

Peer reviewed

Hsu, Louis M. – Applied Psychological Measurement, 1979

A comparison of the relative ordering power of separate and grouped-items true-false tests indicated that neither type of test was uniformly superior to the other across all levels of knowledge of examinees. Grouped-item tests were found superior for examinees with low levels of knowledge. (Author/CTM)

Descriptors: Academic Ability, Knowledge Level, Multiple Choice Tests, Scores

The Alternate Forms of the Positive and Negative Mood Scales: Reliability, Validity, and Equivalence in Nonreferred Samples.

Peer reviewed

Lubin, Bernard; Van Whitlock, Rod – Assessment, 1996

The reliability and validity of the positive and negative mood scales of the trait version of the State Trait--Depression Adjective Check Lists (D. Watson and others, 1988) were supported with 269 college students, 197 adolescents, and 165 older adults. Results provide evidence of the equivalence of the positive and negative scales. (SLD)

Descriptors: Adolescents, College Students, Depression (Psychology), High Schools

Estimating Measures of Pass-Fail Reliability from Parallel Half-Tests.

Peer reviewed

Woodruff, David J.; Sawyer, Richard L. – Applied Psychological Measurement, 1989

Two methods--non-distributional and normal--are derived for estimating measures of pass-fail reliability. Both are based on the Spearman Brown formula and require only a single test administration. Results from a simulation (n=20,000 examinees) and a licensure examination (n=4,828 examinees) illustrate these methods. (SLD)

Descriptors: Equations (Mathematics), Estimation (Mathematics), Licensing Examinations (Professions), Measures (Individuals)

The Development and Validation of a Reliable Alternative Form for Raven's Standard Progressive Matrices.

Peer reviewed

Johnson, Nancy E.; And Others – Assessment, 1994

Development of an alternate form of Raven's Standard Progressive Matrices Test is described. Reliability analysis with 449 children of differing racial/ethnic backgrounds showed good reliability and comparable predictive validity. The alternate form is a promising research tool. (SLD)

Descriptors: Children, Ethnic Groups, Intelligence Tests, Matrices

An Assessment of Testing Variables in Non-Native Russian Stress Placement.

Peer reviewed

Hart, David K. – Slavic and East European Journal, 1994

Reports on a series of tests made to determine whether a correlation exists between modes of testing and the ability of Russian language students to place stress correctly. Contrary to the hypothesis, it was found that a significant variation did occur in the results obtained both for test type and test modality. (16 references) (MDM)

Descriptors: College Students, Higher Education, Language Skills, Russian

Development and Validation of a Computer-Administered Version of the Hamilton Anxiety Scale.

Peer reviewed

Kobak, Kenneth A.; And Others – Psychological Assessment, 1993

A developed computer-administered form of the Hamilton Anxiety Scale and the clinician form of the instrument were administered to 214 psychiatric outpatients and 78 community adults. Results support the reliability and validity of the computer-administered version as an alternative to the clinician-administered version. (SLD)

Descriptors: Adults, Anxiety, Clinical Diagnosis, Comparative Testing

The Relationship between Self Reports of College Experiences and Achievement-Test Scores. AIR 1994 Annual Forum Paper.

Download full text

Pike, Gary R. – 1994

This paper examines the proposed use of student self-report data as proxies for College Basic Academic Subjects Examination (College BASE) scores and as policy indicators of good educational practice. A recent study by the National Center for Higher Education Management Systems had recommended this use of student self-reports. For this study 540…

Descriptors: Achievement Tests, College Outcomes Assessment, College Seniors, Comparative Analysis

Developing and Improving the Quality of Written Tests.

Martin, Randy – 1988

Reasons for administering tests fall into two categories--decision-making and promoting learning. The two bases of tests are learning objectives and the level of learning at which training is developed. Test development involves a number of steps. The best way to tie objectives to test items is through the use of a table of specifications, which…

Descriptors: Elementary Secondary Education, Item Analysis, Item Banks, Postsecondary Education

Test Mapping.

Brown, William R. – 1988

The evaluation tools written by teachers are rarely valid or reliable. One teaching aid that can help in the creation of an effective evaluation instrument is called a test map. A test map is a systematic method to consider variables that are important in the construction of the format of a test. Five variables that are discussed in the test…

Descriptors: Elementary Secondary Education, Evaluation Methods, Higher Education, Student Evaluation

Reliability and Validity of Two-Option Multiple-Choice and Comparably Written True-False Items.

Download full text

Sax, Gilbert; Reiter, Pauline B. – 1980

Despite the popularity of both multiple-choice (MC) and true-false (TF) items, most investigations comparing the two formats have done so to determine the optimum number of choices to be given to students within a given time period. The purpose of this investigation was to compare the reliabilities and the validities of both formats when the items…

Descriptors: Analysis of Variance, Correlation, Higher Education, Item Analysis

Evaluating Diagnostic Pattern Recognition: The Performance Characteristics of a New Item Format.

Download full text

Case, Susan M.; And Others – 1988

An item format incorporating pattern recognition was designed to assess medical students' abilities in the area of clinical diagnosis. A group of approximately 20 faculty members of five New England medical schools met in Worcester for half of a day to develop pattern recognition items. Teams of four to six physicians were assigned to work on…

Descriptors: Clinical Diagnosis, Higher Education, Item Analysis, Medical Evaluation

The Effect of Topic Salience on the Responses to Different Item Orders.

Hensley, Wayne E. – 1982

Because item order and salience may affect the findings of social science research, a study was conducted to determine the effect of topic salience on subject response to different item orders. In the study, 10 high salience self-esteem items were presented with 10 low salience items concerning product labels in three versions. In the first…

Descriptors: College Students, Communication Research, Higher Education, Item Analysis

A Comparison of Two Item Selection Procedures for Building Criterion-Referenced Tests.

Download full text

Haladyna, Tom; Roid, Gale – 1981

Two approaches to criterion-referenced test construction are compared. Classical test theory is based on the practice of random sampling from a well-defined domain of test items; latent trait theory suggests that the difficulty of the items should be matched to the achievement level of the student. In addition to these two methods of test…

Descriptors: Criterion Referenced Tests, Error of Measurement, Latent Trait Theory, Test Construction

Multiple-Choice versus Free-Response: A Simulation Study.

Peer reviewed

Frary, Robert B. – Journal of Educational Measurement, 1985

Responses to a sample test were simulated for examinees under free-response and multiple-choice formats. Test score sets were correlated with randomly generated sets of unit-normal measures. The extent of superiority of free response tests was sufficiently small so that other considerations might justifiably dictate format choice. (Author/DWH)

Descriptors: Comparative Analysis, Computer Simulation, Essay Tests, Guessing (Tests)

« Previous Page | Next Page »

Pages: 1 | ... | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | ... | 28

Diagnostique	26
Educational and Psychological…	22
Journal of Educational…	9
Language Testing	9
New York State Education…	9
Psychological Assessment	7
Applied Psychological…	5
ETS Research Report Series	5
International Journal of…	5
Journal of Reading	5
Language Assessment Quarterly	5
Applied Measurement in…	4
Assessment	4
ProQuest LLC	4
Assessment & Evaluation in…	3
Assessment for Effective…	3
College Board	3
Evaluation and the Health…	3
Grantee Submission	3
Journal of Experimental…	3
Journal of Psychoeducational…	3
Perceptual and Motor Skills	3
Practical Assessment,…	3
Academic Medicine	2
Annual Review of Applied…	2
More ▼

White, Edward M.	6
Melancon, Janet G.	4
Thompson, Bruce	4
Trevisan, Michael S.	4
Federico, Pat-Anthony	3
Frisbie, David A.	3
Hambleton, Ronald K.	3
Sax, Gilbert	3
Stansfield, Charles W.	3
Straus, Murray A.	3
Aiken, Lewis R.	2
Alderson, J. Charles	2
Brown, James Dean	2
Bush, Martin	2
Conoyer, Sarah J.	2
Eignor, Daniel R.	2
Green, Kathy	2
Hamby, Sherry L.	2
Hendrickson, Amy	2
Henk, William A.	2
Henning, Grant	2
Kapes, Jerome T.	2
Liskin-Gasparro, Judith E.	2
Menold, Natalja	2
More ▼

Journal Articles	265
Reports - Research	239
Speeches/Meeting Papers	63
Reports - Descriptive	61
Reports - Evaluative	57
Information Analyses	25
Opinion Papers	24
Guides - Non-Classroom	21
Tests/Questionnaires	20
Guides - Classroom - Teacher	10
Guides - General	6
Numerical/Quantitative Data	5
Dissertations/Theses -…	4
Reference Materials -…	4
Books	3
ERIC Publications	1
Guides - Classroom - Learner	1
Non-Print Media	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	5
Embedded Figures Test	3
SAT (College Admission Test)	3
Wechsler Adult Intelligence…	3
Wechsler Intelligence Scale…	3
ACT Assessment	2
Beck Depression Inventory	2
Graduate Record Examinations	2
Minnesota Multiphasic…	2
Peabody Picture Vocabulary…	2
ACTFL Oral Proficiency…	1
Armed Services Vocational…	1
Attribution Style…	1
Behavior Assessment System…	1
Bem Sex Role Inventory	1
Bruininks Oseretsky Test of…	1
California Critical Thinking…	1
Canfield Learning Styles…	1
Computer Attitude Scale	1
Conflict Tactics Scale	1
Conners Rating Scales	1
Cornell Critical Thinking Test	1
Defining Issues Test	1
Developmental Indicators for…	1
Dimensions of Self Concept	1
More ▼