ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	6

Descriptor

Comparative Analysis	15
Comparative Testing	15
Test Reliability	15
Test Validity	8
Reading Tests	4
Reading Comprehension	3
Test Format	3
Bilingual Education	2
Bilingual Students	2
Computer Assisted Testing	2
Difficulty Level	2
Elementary Education	2
English	2
Foreign Countries	2
Grade 2	2
Higher Education	2
Interviews	2
Item Analysis	2
Multiple Choice Tests	2
Objective Tests	2
Program Effectiveness	2
Psychometrics	2
Racial Differences	2
Reading Achievement	2
Scores	2
More ▼

Source

Advances in Health Sciences…	1
Alberta Journal of…	1
Focus	1
International Review of…	1
Journal of Applied Testing…	1
Journal of Education Policy	1
Journal of Education for…	1
NABE: The Journal for the…	1
Psychol Rep	1
Studies in Higher Education	1

Publication Type

Reports - Research	9
Journal Articles	8
Reports - Descriptive	2
Collected Works - Serials	1
Dissertations/Theses -…	1
Reports - Evaluative	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Elementary Education	2
Higher Education	2
Postsecondary Education	2
Early Childhood Education	1
Elementary Secondary Education	1
Grade 2	1
High Schools	1
Primary Education	1
Secondary Education	1

Audience

Researchers

Location

Kenya	1
Maryland	1
Texas	1
United Kingdom (England)	1
Utah	1

Laws, Policies, & Programs

Assessments and Surveys

Armed Forces Qualification…	1
Armed Services Vocational…	1
Comprehensive Tests of Basic…	1
New Jersey College Basic…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Multiple True-False Items: A Comparison of Scoring Algorithms

Peer reviewed

Direct link

Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018

Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…

Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests

Does MTV Really Do a Good Job of Evaluating Professors? An Empirical Test of the Internet Site Ratemyprofessors.com

Peer reviewed

Direct link

Murray, Keith B.; Zdravkovic, Srdan – Journal of Education for Business, 2016

Considerable debate continues regarding the efficacy of the website RateMyProfessors.com (RMP). To date, however, virtually no direct, experimental research has been reported which directly bears on questions relating to sampling adequacy or item adequacy in producing what favorable correlations have been reported. The authors compare the data…

Descriptors: Computer Assisted Testing, Computer Software Evaluation, Student Evaluation of Teacher Performance, Item Analysis

Assessing Reading Fluency in Kenya: Oral or Silent Assessment?

Peer reviewed

Direct link

Piper, Benjamin; Zuilkowski, Stephanie Simmons – International Review of Education, 2015

In recent years, the Education for All movement has focused more intensely on the quality of education, rather than simply provision. Many recent and current education quality interventions focus on literacy, which is the core skill required for further academic success. Despite this focus on the quality of literacy instruction in developing…

Descriptors: Foreign Countries, Reading Fluency, Reading Tests, Oral Reading

Peer Assessment without Assessment Criteria

Peer reviewed

Direct link

Jones, Ian; Alcock, Lara – Studies in Higher Education, 2014

Peer assessment typically requires students to judge peers' work against assessment criteria. We tested an alternative approach in which students judged pairs of scripts against one another in the absence of assessment criteria. First year mathematics undergraduates (N?=?194) sat a written test on conceptual understanding of multivariable…

Descriptors: Peer Evaluation, Evaluation Criteria, Alternative Assessment, Undergraduate Students

Rethinking Assessment and Inequality: The Production of Disparities in Attainment in Early Years Education

Peer reviewed

Direct link

Bradbury, Alice – Journal of Education Policy, 2011

Despite decades of research and debate, the issue of unequal outcomes continues to be a concern in educational systems worldwide. In England, published data relating to pupils' attainment across ethnic groups and by class indicators has been used to demonstrate continued inequalities in schools. This article attempts to deconstruct the…

Descriptors: Ethnic Groups, Urban Areas, Foreign Countries, Educational Policy

The Contribution of Constructed Response Items to Large Scale Assessment: Measuring and Understanding Their Impact

Peer reviewed

Direct link

Lissitz, Robert W.; Hou, Xiaodong; Slater, Sharon Cadman – Journal of Applied Testing Technology, 2012

This article investigates several questions regarding the impact of different item formats on measurement characteristics. Constructed response (CR) items and multiple choice (MC) items obviously differ in their formats and in the resources needed to score them. As such, they have been the subject of considerable discussion regarding the impact of…

Descriptors: Computer Assisted Testing, Scoring, Evaluation Problems, Psychometrics

Study to Compare Reliability of Performance on Live and Recorded Dictation Tests.

Download full text

Manpower Administration (DOL), Washington, DC. U.S. Training and Employment Service. – 1969

To compare the reliability of performance on recorded dictation tests with performance on live tests, 216 university students who were nearing completion of an intermediate shorthand course and 26 job applicants seeking stenographic positions were divided into 10 groups, with five receiving live dictation and five receiving recorded dictation. The…

Descriptors: Comparative Analysis, Comparative Testing, Evaluation, Performance Tests

Comparative Performance of Male and Female Enlistees on Air Force Selection Measures.

Download full text

Vitola, Bart M.; Wilbourn, James M. – 1971

Male and female enlistee samples were compared for total groups and by enlistment region in terms of their performance on the Airman Qualifying Examination (AQE) and the Armed Services Vocational Aptitude Battery (ASVAB). Women in the Air Force (WAF) test-retest performance was evaluated on the Armed Forces Women's Selection Test (AFWST) which is…

Descriptors: Aptitude Tests, Comparative Analysis, Comparative Testing, Military Air Facilities

Reappraisal of the Validity of a Short Short Form of Wechsler's Scales

Silverstein, A. B. – Psychol Rep, 1970

Reappraises the validity and reliability of Vocabulary and Block Design (V-VD) as a short form of the Wechsler Adult Intelligence Scale (WAIS), the Wechsler Intelligence Scale for Children (WISC), and the Wechsler Preschool and Primary Scale of Intelligence (WPPI). Presents a table for converting the sum of scaled scores into an estimate of Full…

Descriptors: Comparative Analysis, Comparative Testing, Grade Equivalent Scores, Intelligence Tests

A Correlation Study: The New Jersey College Basic Skills Placement Test and Degrees of Reading Power Test.

Sammon, Susan F. – 1988

A study investigated whether a positive correlation existed between scores obtained by incoming freshman on the recently developed Degrees of Reading Power Test (DRP) and the required Reading Comprehension subtest of the New Jersey College Basic Skills Placement Test (NJCBSPT). The subjects, 217 William Paterson College freshman enrolled in a…

Descriptors: Comparative Analysis, Comparative Testing, Correlation, Educational Testing

The Effect of Homogeneous vs. Heterogeneous Matching-Item Format on Test Performance and Reliability.

Peer reviewed

Allison, Donald E. – Alberta Journal of Educational Research, 1984

Reports that no significant difference in reliability appeared between a heterogeneous and a homogeneous form of the same general science matching-item test administered to 316 sixth-grade students but that scores on the heterogeneous form of the test were higher, independent of the examinee's sex or intelligence. (SB)

Descriptors: Comparative Analysis, Comparative Testing, Elementary Education, Grade 6

A Comparative Analysis of Cloze and Maze Performances of Second Grade Children.

Pedigo, Patricia; De Santi, Roger J. – 1986

To determine the most accurate group-administered measure of reading achievement, a study explored variations of the cloze and maze procedures with second grade students who were native English speakers or who were being taught English as a second language. Subjects--108 second grade volunteers (1% American Indian, 49% Asian, 39.8% Black, 1%…

Descriptors: Cloze Procedure, Comparative Analysis, Comparative Testing, Grade 2

The Comparability of English and Spanish Versions of Oral Language Proficiency Instruments.

Peer reviewed

Merino, Barbara J.; Spencer, Mary – NABE: The Journal for the National Association for Bilingual Education, 1983

Compares five commonly used English-Spanish language dominance instruments according to area of language measured, domain assessed, developmental comparability, and language variety or dialect. Examines the proficiency information provided and the validity, reliability, and norming of the instruments. Concludes that the tests are not comparable…

Descriptors: Bilingual Education, Bilingual Students, Comparative Analysis, Comparative Testing

A Comparative Analysis of English and Spanish Reading Performance of Mexican American Students.

Cervantes, Robert A.; Bernal, Helen Hazuda – 1976

A South Texas survey conducted in 1975 investigated the reading performance of Mexican American students enrolled in a bilingual program to determine whether or not students achieved significantly different reading scores on parallel Spanish and English versions of an appropriate test (Guidance Testing Associates Inter-American Test of Reading).…

Descriptors: Bilingual Education, Bilingual Students, Comparative Analysis, Comparative Testing

Testing, Equality, and Handicapped People.

Peer reviewed
PDF on ERIC

Download full text

Benderson, Albert, Ed. – Focus, 1988

The scores of handicapped students taking tests such as the Scholastic Aptitude Test (SAT) or the Graduate Record Examinations are flagged so that admissions officers will be aware that they were achieved under special circumstances. A series of studies was initiated to determine whether special administrations of such tests are comparable to…

Descriptors: Admission Criteria, College Admission, College Entrance Examinations, College Students

Alcock, Lara	1
Allison, Donald E.	1
Bauer, Daniel	1
Benderson, Albert, Ed.	1
Bernal, Helen Hazuda	1
Bradbury, Alice	1
Cervantes, Robert A.	1
De Santi, Roger J.	1
Fischer, Martin R.	1
Guttormsen, Sissel	1
Hou, Xiaodong	1
Huwendiek, Sören	1
Jones, Ian	1
Krebs, René	1
Lahner, Felicitas-Maria	1
Lissitz, Robert W.	1
Lörwald, Andrea Carolin	1
Merino, Barbara J.	1
Murray, Keith B.	1
Nouns, Zineb Miriam	1
Pedigo, Patricia	1
Piper, Benjamin	1
Sammon, Susan F.	1
Silverstein, A. B.	1
Slater, Sharon Cadman	1
More ▼