ERIC - Search Results

Publication Date

In 2026	0
Since 2025	5
Since 2022 (last 5 years)	45
Since 2017 (last 10 years)	91
Since 2007 (last 20 years)	144

Descriptor

Test Format	418
Test Reliability	418
Test Validity	243
Test Construction	135
Test Items	119
Higher Education	88
Multiple Choice Tests	68
Foreign Countries	67
Testing	65
Test Interpretation	61
Comparative Analysis	57
Language Tests	57
Computer Assisted Testing	55
Scores	53
Scoring	51
Student Evaluation	46
Psychometrics	44
Test Use	44
Standardized Tests	43
Elementary Secondary Education	40
Item Analysis	40
Test Content	40
College Students	36
Second Language Learning	36
Test Reviews	36
More ▼

Education Level

Higher Education	50
Postsecondary Education	42
Secondary Education	25
Elementary Education	24
Middle Schools	17
Junior High Schools	15
High Schools	10
Grade 8	9
Grade 7	8
Early Childhood Education	7
Elementary Secondary Education	7
Grade 3	7
Grade 5	7
Intermediate Grades	7
Grade 4	6
Grade 6	6
Primary Education	6
Adult Education	2
Kindergarten	2
Grade 1	1
Grade 9	1
Preschool Education	1
More ▼

Audience

Practitioners	33
Teachers	23
Administrators	18
Researchers	12
Community	1
Counselors	1
Policymakers	1
Students	1
Support Staff	1

Location

New York	9
Turkey	8
California	7
Canada	6
Japan	6
Germany	4
United Kingdom	4
Georgia	3
Israel	3
France	2
Indonesia	2
Iran	2
Netherlands	2
New York (New York)	2
Nigeria	2
Singapore	2
South Africa	2
United Kingdom (Great Britain)	2
Bangladesh	1
Brazil	1
China	1
Connecticut	1
Czech Republic	1
Estonia	1
Finland	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	1
Job Training Partnership Act…	1
No Child Left Behind Act 2001	1
Pell Grant Program	1

What Works Clearinghouse Rating

Test Reliability X

Showing 241 to 255 of 418 results Save | Export

An Alternative to Essay Examinations: The Interpretive Exercise. How to Minimize Test Time and Save Dollars.

Kemerer, Richard; Wahlstrom, Merlin – Performance and Instruction, 1985

Compares the features, learning outcomes tested, reliability, viability, and cost effectiveness of essay tests with those of interpretive tests used in training programs. A case study illustrating how an essay test was converted to an interpretive test and pilot tested is included to illustrate the advantages of interpretive testing. (MBR)

Descriptors: Case Studies, Comparative Analysis, Cost Effectiveness, Essay Tests

The Effect of Homogeneous vs. Heterogeneous Matching-Item Format on Test Performance and Reliability.

Peer reviewed

Allison, Donald E. – Alberta Journal of Educational Research, 1984

Reports that no significant difference in reliability appeared between a heterogeneous and a homogeneous form of the same general science matching-item test administered to 316 sixth-grade students but that scores on the heterogeneous form of the test were higher, independent of the examinee's sex or intelligence. (SB)

Descriptors: Comparative Analysis, Comparative Testing, Elementary Education, Grade 6

Measurement Issues in Certification.

Shrock, Sharon A.; Foshay, Wellesley R. – Performance and Instruction, 1984

Discusses methods of sampling the best information from instruction/training developers/candidates for professional certification and examines the problems of interpreting that information and making classification decisions. Assessment strategies including criterion-referenced, multiple-choice, short answer, and essay questions, and portfolio…

Descriptors: Certification, Competence, Criterion Referenced Tests, Instructional Development

Effect of Item Context on Students' Global Appraisals of Instruction.

Peer reviewed

Scott, Owen; Hsu, Yi-Ming – Perceptual and Motor Skills, 1982

Based on data from classes of 23 instructors in two institutions, it was concluded that specific items on an appraisal-of-instruction inventory probably do not influence students' global appraisals of instruction. If true, this conclusion has important implications for use of such inventories in appraising the effectiveness of college instruction.…

Descriptors: College Instruction, Course Evaluation, Higher Education, Student Evaluation of Teacher Performance

The Effects of Guessing and Item Dependence on the Reliability and Validity of Recognition Based Cloze Tests.

Peer reviewed

Baldauf, Richard B., Jr. – Educational and Psychological Measurement, 1982

A Monte Carlo design examined how the effects of guessing and item dependence influence test characteristics and student scores. Although validity for cloze variants was high, multiple-choice cloze had significantly lower reliabilities than did true score equivalents. (Author/PN)

Descriptors: Cloze Procedure, Elementary Education, Guessing (Tests), Reading Comprehension

Developments in Language Testing.

Peer reviewed

Douglas, Dan – Annual Review of Applied Linguistics, 1995

Reviews recent theoretical, methodological, and analytical developments in language testing, focusing on more refined models of language ability, reliability and validity, performance testing, innovative test formats, new applications of Item Response Theory and Generalizability Theory to test performance. An annotated bibliography discusses seven…

Descriptors: Annotated Bibliographies, Evaluation Methods, Language Proficiency, Language Tests

The Reliability, Validity, and Evaluation of the Objective Structured Clinical Examination in Podiatry (Chiropody).

Peer reviewed

Woodburn, Jim; Sutcliffe, Nick – Assessment & Evaluation in Higher Education, 1996

The Objective Structured Clinical Examination (OSCE), initially developed for undergraduate medical education, has been adapted for assessment of clinical skills in podiatry students. A 12-month pilot study found the test had relatively low levels of reliability, high construct and criterion validity, and good stability of performance over time.…

Descriptors: Clinical Teaching (Health Professions), Higher Education, Medical Education, Podiatry

Equivalence Reliability of the Split-Half WISC-R Object Assembly Subtest in a Cohort of Juvenile Offenders.

Download full text

Rodriguez-Aragon, Graciela; And Others – 1993

The predictive power of the Split-Half version of the Wechsler Intelligence Scale for Children--Revised (WISC-R) Object Assembly (OA) subtest was compared to that of the full administration of the OA subtest. A cohort of 218 male and 49 female adolescent offenders detained in a Texas juvenile detention facility between 1990 and 1992 was used. The…

Descriptors: Adolescents, Cohort Analysis, Comparative Testing, Correlation

Measuring Telephone Apprehension.

Steele, Cam Monroe; Reinsch, N. L., Jr. – 1983

An instrument for measuring telephone apprehension was developed to facilitate research into hypothesized relationships between communication apprehension and telephone apprehension. A set of 92 Likert-type items was adapted from previous communication apprehension scales and administered to 81 undergraduate students in a speech communication…

Descriptors: Adults, Attitude Measures, Communication Apprehension, Communication Research

Guidelines for Developing Diagnostic Tests. Methodology Project.

Download full text

Herman, Joan – 1984

Diagnostic testing can provide specific information about student skills as a decision-making aid to teachers in prescribing instruction, identifying needs for remediation, determining effective instructional materials and methods, and ultimately, improving student learning. Diagnostic testing, as viewed here, includes individual and group…

Descriptors: Diagnostic Tests, Elementary Secondary Education, Skill Analysis, Student Evaluation

Teaching and Assessing Clinical Skills Using a Modified Essay Examination. Teaching Activity Poster.

Brown, Stephen W. – 1987

A "modified essay examination" was used to help teach and to assess clinical problem-solving skills with 11 first trimester doctoral students. This examination provided a paper-and-pencil simulation of problems encountered in case management. Students were required to generate hypotheses, formulate questions, discuss issues, and make…

Descriptors: Case Records, Clinical Experience, Clinical Psychology, Essay Tests

Methodological Issues Related to the Study of Context Effects in Multisection Tests.

Stewart, E. Elizabeth – 1981

Context effects are defined as being influences on test performance associated with the content of successively presented test items or sections. Four types of context effects are identified: (1) direct context effects (practice effects) which occur when performance on items is affected by the examinee having been exposed to similar types of…

Descriptors: Context Effect, Data Collection, Error of Measurement, Evaluation Methods

Standards for Evaluating Criterion-Referenced Tests.

Walker, Clinton B. – 1978

Standards for evaluating criterion-referenced tests are presented. Twenty-one standards, grouped in three categories, are discussed. Category one is defined as measurement properties and is comprised of conceptual validity, including description of the domain, test item agreement with objectives, and item representativeness of the objectives; and…

Descriptors: Course Objectives, Criterion Referenced Tests, Evaluation Criteria, Scoring

Test Review: The Stanford Writing Assessment Program.

Peer reviewed

Onore, Cynthia S. – Journal of Reading, 1986

Reviews the Stanford Writing Assessment Program that has three intended uses: district-wide survey of students' writing ability, diagnosis of classroom or district-wide instructional strengths and weaknesses, and staff development through training for administering and scoring of writing samples. Notes that of these uses, none is necessarily best…

Descriptors: Educational Assessment, Educational Diagnosis, Evaluation Methods, Staff Development

The Relative Merits of Multiple True-False Achievement Tests.

Peer reviewed

Frisbie, David A.; Sweeney, Daryl C. – Journal of Educational Measurement, 1982

A 100-item five-choice multiple choice (MC) biology final exam was converted to multiple choice true-false (MTF) form to yield two content-parallel test forms comprised of the two item types. Students found the MTF items easier and preferred MTF over MC; the MTF subtests were more reliable. (Author/GK)

Descriptors: Biology, College Science, Comparative Analysis, Difficulty Level

« Previous Page | Next Page »

Pages: 1 | ... | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | ... | 28

Diagnostique	26
Educational and Psychological…	22
Journal of Educational…	9
Language Testing	9
New York State Education…	9
Psychological Assessment	7
Applied Psychological…	5
ETS Research Report Series	5
International Journal of…	5
Journal of Reading	5
Language Assessment Quarterly	5
Applied Measurement in…	4
Assessment	4
ProQuest LLC	4
Assessment & Evaluation in…	3
Assessment for Effective…	3
College Board	3
Evaluation and the Health…	3
Grantee Submission	3
Journal of Experimental…	3
Journal of Psychoeducational…	3
Perceptual and Motor Skills	3
Practical Assessment,…	3
Academic Medicine	2
Annual Review of Applied…	2
More ▼

White, Edward M.	6
Melancon, Janet G.	4
Thompson, Bruce	4
Trevisan, Michael S.	4
Federico, Pat-Anthony	3
Frisbie, David A.	3
Hambleton, Ronald K.	3
Sax, Gilbert	3
Stansfield, Charles W.	3
Straus, Murray A.	3
Aiken, Lewis R.	2
Alderson, J. Charles	2
Brown, James Dean	2
Bush, Martin	2
Conoyer, Sarah J.	2
Eignor, Daniel R.	2
Green, Kathy	2
Hamby, Sherry L.	2
Hendrickson, Amy	2
Henk, William A.	2
Henning, Grant	2
Kapes, Jerome T.	2
Liskin-Gasparro, Judith E.	2
Menold, Natalja	2
More ▼

Journal Articles	265
Reports - Research	239
Speeches/Meeting Papers	63
Reports - Descriptive	61
Reports - Evaluative	57
Information Analyses	25
Opinion Papers	24
Guides - Non-Classroom	21
Tests/Questionnaires	20
Guides - Classroom - Teacher	10
Guides - General	6
Numerical/Quantitative Data	5
Dissertations/Theses -…	4
Reference Materials -…	4
Books	3
ERIC Publications	1
Guides - Classroom - Learner	1
Non-Print Media	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	5
Embedded Figures Test	3
SAT (College Admission Test)	3
Wechsler Adult Intelligence…	3
Wechsler Intelligence Scale…	3
ACT Assessment	2
Beck Depression Inventory	2
Graduate Record Examinations	2
Minnesota Multiphasic…	2
Peabody Picture Vocabulary…	2
ACTFL Oral Proficiency…	1
Armed Services Vocational…	1
Attribution Style…	1
Behavior Assessment System…	1
Bem Sex Role Inventory	1
Bruininks Oseretsky Test of…	1
California Critical Thinking…	1
Canfield Learning Styles…	1
Computer Attitude Scale	1
Conflict Tactics Scale	1
Conners Rating Scales	1
Cornell Critical Thinking Test	1
Defining Issues Test	1
Developmental Indicators for…	1
Dimensions of Self Concept	1
More ▼