NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Researchers1
What Works Clearinghouse Rating
Showing all 15 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Ole J. Kemi – Advances in Physiology Education, 2025
Students are assessed by coursework and/or exams, all of which are marked by assessors (markers). Student and marker performances are then subject to end-of-session board of examiner handling and analysis. This occurs annually and is the basis for evaluating students but also the wider learning and teaching efficiency of an academic institution.…
Descriptors: Undergraduate Students, Evaluation Methods, Evaluation Criteria, Academic Standards
Rutkowski, David; Rutkowski, Leslie; Plucker, Jonathan A. – Phi Delta Kappan, 2015
The OECD and its U.S. administrator, McGraw-Hill Education CTB, have recently concluded the first cycle of the OECD-Test for Schools in the U.S. This test is being marketed to local schools and is designed to compare 15-year-olds from individual participating schools against peers nationally and internationally using the OECD's PISA test as its…
Descriptors: Participation, International Education, Comparative Testing, Comparative Education
Peer reviewed Peer reviewed
Direct linkDirect link
Turgut, Guliz – Clearing House: A Journal of Educational Strategies, Issues and Ideas, 2013
The ranking of the United States in major international tests such as the Progress in International Reading Literacy Study (PIRLS), Trends in International Mathematics and Science Study (TIMSS), and Program for International Student Assessment (PISA) is used as the driving force and rationale for the current educational reforms in the United…
Descriptors: Educational Change, Success, Educational Strategies, Educational Indicators
Peer reviewed Peer reviewed
Direct linkDirect link
Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2010
Item response theory (IRT) methods are generally used to create score scales for large-scale tests. Research has shown that IRT scales are stable across groups and over time. Most studies have focused on items that are dichotomously scored. Now Rasch and other IRT models are used to create scales for tests that include polytomously scored items.…
Descriptors: Measures (Individuals), Item Response Theory, Robustness (Statistics), Item Analysis
Pell, Godfrey; Homer, Matthew S.; Roberts, Trudie E. – International Journal of Research & Method in Education, 2008
Increasingly, academic institutions are being required to improve the validity of the assessment process; unfortunately, often this is at the expense of reliability. In medical schools (such as Leeds), standardized tests of clinical skills, such as "Objective Structured Clinical Examinations" (OSCEs) are widely used to assess clinical…
Descriptors: Medical Education, Standardized Tests, Clinical Experience, Criterion Referenced Tests
Green, Kathy E. – 1989
The psychometric utility of six experimental cognitive style (CS) measures was analyzed. Examinees were 1,135 clients of the Johnson O'Connor Research Foundation who, during 1985, completed at least one of the six CS tests. Information is provided on measure reliability; relationships among CS measures; relationships with standard battery aptitude…
Descriptors: Age Differences, Aptitude Tests, Cognitive Measurement, Cognitive Style
Pedigo, Patricia; De Santi, Roger J. – 1986
To determine the most accurate group-administered measure of reading achievement, a study explored variations of the cloze and maze procedures with second grade students who were native English speakers or who were being taught English as a second language. Subjects--108 second grade volunteers (1% American Indian, 49% Asian, 39.8% Black, 1%…
Descriptors: Cloze Procedure, Comparative Analysis, Comparative Testing, Grade 2
Avery, Richard O.; And Others – Education and Training in Mental Retardation, 1989
Scores on the Wechsler Intelligence Scale for Children-Revised (WISC-R) and Wechsler Adult Intelligence Scale-Revised (WAIS-R) Verbal, Performance, and Full Scales were compared for 26 adolescents with educable mental handicaps. The WAIS-R, while strongly correlated with the WISC-R, provided higher scores on all three scales. Several WISC-R…
Descriptors: Adolescents, Comparative Testing, Intelligence Quotient, Intelligence Tests
Wiser, Berton; Lenke, Joanne M. – 1987
The extent to which national performance on an achievement test battery changed over a 4-year period was studied, and the comparative performance of students in school districts taking the test series for the first time was compared with that of students in districts that had used the battery at least once in one or more grades. The seventh…
Descriptors: Achievement Gains, Achievement Tests, Comparative Testing, Elementary School Students
Ross, G. Robert – 1977
A set of eight widely used inductive reasoning tests were investigated to determine whether or not they have different factorial structures. The eight inductive tests and three deductive tests, taken from the French Kit of Reference Tests for Cognitive Factors and the Watson-Glaser Critical Thinking Appraisal, were administered to 157 high school…
Descriptors: Abstract Reasoning, Cognitive Tests, Comparative Testing, Deduction
Guidance Testing Associates, Austin, TX. – 1967
The purpose of this technical report is to describe the Tests of General Ability and Tests of Reading of the Inter-American Series, to give a brief account of their construction, and to present related statistical data. Norms and suggestions on the use and interpretation of the tests are published separately. The Inter-American Series discussed in…
Descriptors: Achievement Tests, Aptitude Tests, Biculturalism, Bilingualism
Crowder, Christopher R.; Gallas, Edwin J. – 1978
Both on-level and out-of-level tests were administered to third and fifth grade children in order to compare the scaled scores of different level tests of the same testing program and to discover whether the relationship between levels might be distorted by ceiling or floor effects. Only reading tests were used in this study. The Stanford…
Descriptors: Achievement Tests, Comparative Testing, Difficulty Level, Elementary Education
Peer reviewed Peer reviewed
Jones, Allan – Journal of Geography in Higher Education, 1997
Examines the increase in popularity of objective testing in the United Kingdom and addresses some of the accompanying academic issues. Reports on a case study of test production and implementation to illustrate issues of time costs and benefits. Discusses question styles, marking schemes, and the problem of guesswork. (MJP)
Descriptors: Comparative Testing, Educational Practices, Educational Trends, Foreign Countries
Slaughter, Helen B.; Gallas, Edwin J. – 1978
Concern was expressed for the possible effects of testing Elementary Secondary Education Act (ESEA) Title I students with norm-referenced tests that may be so difficult that many students will have scores in the chance range. The likelihood of such students obtaining equal scaled scores if they were tested with easier out-of-level tests was…
Descriptors: Achievement Tests, Comparative Testing, Disadvantaged Youth, Equated Scores
Powers, Stephen; Gallas, Edwin J. – 1978
Fourth, seventh, and ninth grade students in Elementary Secondary Education Act (ESEA) Title I programs were tested with the reading comprehension subtests of the Comprehensive Tests of Basic Skills, at each of two levels: on-level for each respective grade, and an easier out-of-level form. Approximately half of these students were found to be…
Descriptors: Achievement Tests, Comparative Testing, Compensatory Education, Difficulty Level