Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 6 |
Descriptor
Comparative Analysis | 15 |
Comparative Testing | 15 |
Test Reliability | 15 |
Test Validity | 8 |
Reading Tests | 4 |
Reading Comprehension | 3 |
Test Format | 3 |
Bilingual Education | 2 |
Bilingual Students | 2 |
Computer Assisted Testing | 2 |
Difficulty Level | 2 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 9 |
Journal Articles | 8 |
Reports - Descriptive | 2 |
Collected Works - Serials | 1 |
Dissertations/Theses -… | 1 |
Reports - Evaluative | 1 |
Speeches/Meeting Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Elementary Education | 2 |
Higher Education | 2 |
Postsecondary Education | 2 |
Early Childhood Education | 1 |
Elementary Secondary Education | 1 |
Grade 2 | 1 |
High Schools | 1 |
Primary Education | 1 |
Secondary Education | 1 |
Audience
Researchers | 1 |
Location
Kenya | 1 |
Maryland | 1 |
Texas | 1 |
United Kingdom (England) | 1 |
Utah | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Armed Forces Qualification… | 1 |
Armed Services Vocational… | 1 |
Comprehensive Tests of Basic… | 1 |
New Jersey College Basic… | 1 |
What Works Clearinghouse Rating
Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018
Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…
Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests
Murray, Keith B.; Zdravkovic, Srdan – Journal of Education for Business, 2016
Considerable debate continues regarding the efficacy of the website RateMyProfessors.com (RMP). To date, however, virtually no direct, experimental research has been reported which directly bears on questions relating to sampling adequacy or item adequacy in producing what favorable correlations have been reported. The authors compare the data…
Descriptors: Computer Assisted Testing, Computer Software Evaluation, Student Evaluation of Teacher Performance, Item Analysis
Piper, Benjamin; Zuilkowski, Stephanie Simmons – International Review of Education, 2015
In recent years, the Education for All movement has focused more intensely on the quality of education, rather than simply provision. Many recent and current education quality interventions focus on literacy, which is the core skill required for further academic success. Despite this focus on the quality of literacy instruction in developing…
Descriptors: Foreign Countries, Reading Fluency, Reading Tests, Oral Reading
Jones, Ian; Alcock, Lara – Studies in Higher Education, 2014
Peer assessment typically requires students to judge peers' work against assessment criteria. We tested an alternative approach in which students judged pairs of scripts against one another in the absence of assessment criteria. First year mathematics undergraduates (N?=?194) sat a written test on conceptual understanding of multivariable…
Descriptors: Peer Evaluation, Evaluation Criteria, Alternative Assessment, Undergraduate Students
Bradbury, Alice – Journal of Education Policy, 2011
Despite decades of research and debate, the issue of unequal outcomes continues to be a concern in educational systems worldwide. In England, published data relating to pupils' attainment across ethnic groups and by class indicators has been used to demonstrate continued inequalities in schools. This article attempts to deconstruct the…
Descriptors: Ethnic Groups, Urban Areas, Foreign Countries, Educational Policy
Lissitz, Robert W.; Hou, Xiaodong; Slater, Sharon Cadman – Journal of Applied Testing Technology, 2012
This article investigates several questions regarding the impact of different item formats on measurement characteristics. Constructed response (CR) items and multiple choice (MC) items obviously differ in their formats and in the resources needed to score them. As such, they have been the subject of considerable discussion regarding the impact of…
Descriptors: Computer Assisted Testing, Scoring, Evaluation Problems, Psychometrics
Manpower Administration (DOL), Washington, DC. U.S. Training and Employment Service. – 1969
To compare the reliability of performance on recorded dictation tests with performance on live tests, 216 university students who were nearing completion of an intermediate shorthand course and 26 job applicants seeking stenographic positions were divided into 10 groups, with five receiving live dictation and five receiving recorded dictation. The…
Descriptors: Comparative Analysis, Comparative Testing, Evaluation, Performance Tests
Vitola, Bart M.; Wilbourn, James M. – 1971
Male and female enlistee samples were compared for total groups and by enlistment region in terms of their performance on the Airman Qualifying Examination (AQE) and the Armed Services Vocational Aptitude Battery (ASVAB). Women in the Air Force (WAF) test-retest performance was evaluated on the Armed Forces Women's Selection Test (AFWST) which is…
Descriptors: Aptitude Tests, Comparative Analysis, Comparative Testing, Military Air Facilities
Silverstein, A. B. – Psychol Rep, 1970
Reappraises the validity and reliability of Vocabulary and Block Design (V-VD) as a short form of the Wechsler Adult Intelligence Scale (WAIS), the Wechsler Intelligence Scale for Children (WISC), and the Wechsler Preschool and Primary Scale of Intelligence (WPPI). Presents a table for converting the sum of scaled scores into an estimate of Full…
Descriptors: Comparative Analysis, Comparative Testing, Grade Equivalent Scores, Intelligence Tests
Sammon, Susan F. – 1988
A study investigated whether a positive correlation existed between scores obtained by incoming freshman on the recently developed Degrees of Reading Power Test (DRP) and the required Reading Comprehension subtest of the New Jersey College Basic Skills Placement Test (NJCBSPT). The subjects, 217 William Paterson College freshman enrolled in a…
Descriptors: Comparative Analysis, Comparative Testing, Correlation, Educational Testing

Allison, Donald E. – Alberta Journal of Educational Research, 1984
Reports that no significant difference in reliability appeared between a heterogeneous and a homogeneous form of the same general science matching-item test administered to 316 sixth-grade students but that scores on the heterogeneous form of the test were higher, independent of the examinee's sex or intelligence. (SB)
Descriptors: Comparative Analysis, Comparative Testing, Elementary Education, Grade 6
Pedigo, Patricia; De Santi, Roger J. – 1986
To determine the most accurate group-administered measure of reading achievement, a study explored variations of the cloze and maze procedures with second grade students who were native English speakers or who were being taught English as a second language. Subjects--108 second grade volunteers (1% American Indian, 49% Asian, 39.8% Black, 1%…
Descriptors: Cloze Procedure, Comparative Analysis, Comparative Testing, Grade 2

Merino, Barbara J.; Spencer, Mary – NABE: The Journal for the National Association for Bilingual Education, 1983
Compares five commonly used English-Spanish language dominance instruments according to area of language measured, domain assessed, developmental comparability, and language variety or dialect. Examines the proficiency information provided and the validity, reliability, and norming of the instruments. Concludes that the tests are not comparable…
Descriptors: Bilingual Education, Bilingual Students, Comparative Analysis, Comparative Testing
Cervantes, Robert A.; Bernal, Helen Hazuda – 1976
A South Texas survey conducted in 1975 investigated the reading performance of Mexican American students enrolled in a bilingual program to determine whether or not students achieved significantly different reading scores on parallel Spanish and English versions of an appropriate test (Guidance Testing Associates Inter-American Test of Reading).…
Descriptors: Bilingual Education, Bilingual Students, Comparative Analysis, Comparative Testing
Benderson, Albert, Ed. – Focus, 1988
The scores of handicapped students taking tests such as the Scholastic Aptitude Test (SAT) or the Graduate Record Examinations are flagged so that admissions officers will be aware that they were achieved under special circumstances. A series of studies was initiated to determine whether special administrations of such tests are comparable to…
Descriptors: Admission Criteria, College Admission, College Entrance Examinations, College Students