Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 2 |
Descriptor
Test Length | 17 |
Test Validity | 17 |
Testing Problems | 17 |
Test Reliability | 8 |
Test Construction | 7 |
Test Items | 7 |
Test Format | 6 |
Elementary Secondary Education | 5 |
Higher Education | 4 |
Item Analysis | 4 |
Achievement Tests | 3 |
More ▼ |
Source
Applied Psychological… | 1 |
Educational Research and… | 1 |
Educational and Psychological… | 1 |
Language Testing | 1 |
Author
Publication Type
Reports - Research | 7 |
Speeches/Meeting Papers | 7 |
Information Analyses | 3 |
Journal Articles | 3 |
Reports - Evaluative | 3 |
Opinion Papers | 1 |
Reports - Descriptive | 1 |
Education Level
Elementary Secondary Education | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Researchers | 5 |
Location
Japan | 1 |
New Jersey | 1 |
United Kingdom | 1 |
Vermont | 1 |
Laws, Policies, & Programs
Assessments and Surveys
General Educational… | 1 |
National Assessment of… | 1 |
Stanford Achievement Tests | 1 |
Test of English as a Foreign… | 1 |
Wechsler Intelligence Scale… | 1 |
Wechsler Intelligence Scales… | 1 |
What Works Clearinghouse Rating
Watanabe, Yoshinori – Language Testing, 2013
This article describes the National Center Test for University Admissions, a unified national test in Japan, which is taken by 500,000 students every year. It states that implementation of the Center Test began in 1990, with the English component consisting only of the written section until 2005, when the listening section was first implemented…
Descriptors: College Admission, Foreign Countries, College Entrance Examinations, English (Second Language)
Camilli, Gregory – Educational Research and Evaluation, 2013
In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…
Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format

Modjeski, Richard B.; Michael, William B. – Educational and Psychological Measurement, 1978
The General Education Performance Index (GEPI) is a comparatively short test covering the same content as the General Educational Development Test (GED), which takes ten hours to administer. Correlations of the subtests of the GEPI with the GED ranged from .28 to .57. (JKS)
Descriptors: Correlation, Equivalency Tests, Military Personnel, Statistical Data
Lutz, William – 1983
After an extensive review of the available research on large-scale writing assessment, certain issues in writing assessment seem to be unresolved, and still other issues are not supported by adequate research. This paper reviews the basic issues in writing assessment, points out which topics are supported by strong research, and which topics are…
Descriptors: Educational Assessment, Essay Tests, Higher Education, Multiple Choice Tests
Graham, Darol L. – 1974
The adequacy of a test developed for statewide assessment of basic mathematics skills was investigated. The test, comprised of multiple-choice items reflecting a series of behavioral objectives, was compared with a more extensive criterion measure generated from the same objectives by the application of a strict item sampling model. In many…
Descriptors: Comparative Testing, Criterion Referenced Tests, Educational Assessment, Item Sampling

Kafry, Ditsa; And Others – Applied Psychological Measurement, 1979
A series of behavioral expectation scale applications were analyzed in an attempt to point out an appropriate number of dimensions to be included in such studies. Results reflected the problems of dimension interdependence when the number of dimensions exceeds nine. (Author/JKS)
Descriptors: Behavior Rating Scales, Expectation, Factor Analysis, Higher Education
Myers, Charles T. – 1978
The viewpoint is expressed that adding to test reliability by either selecting a more homogeneous set of items, restricting the range of item difficulty as closely as possible to the most efficient level, or increasing the number of items will not add to test validity and that there is considerable danger that efforts to increase reliability may…
Descriptors: Achievement Tests, Item Analysis, Multiple Choice Tests, Test Construction
Jolly, S. Jean; And Others – 1985
Scores from the Stanford Achievement Tests administered to 50,000 students in Palm Beach County, Florida, were studied in order to determine whether the speeded nature of the reading comprehension subtest was related to inconsistencies in the score profiles. Specifically, the probable effect of random guessing was examined. Reading scores were…
Descriptors: Achievement Tests, Elementary Secondary Education, Guessing (Tests), Item Analysis
Oosterhof, Albert C.; Coats, Pamela K. – 1981
Instructors who develop classroom examinations that require students to provide a numerical response to a mathematical problem are often very concerned about the appropriateness of the multiple-choice format. The present study augments previous research relevant to this concern by comparing the difficulty and reliability of multiple-choice and…
Descriptors: Comparative Analysis, Difficulty Level, Grading, Higher Education
Hopper, Margaret F. – 2001
This paper provides an overview of the types of testing accommodations used for students with disabilities and presents arguments for and against their use. It begins by discussing student participation in educational assessments and federal requirements concerning the participation of students with disabilities. The types of accommodations are…
Descriptors: Academic Accommodations (Disabilities), Academic Standards, Disabilities, Educational Assessment
Freedman, Sarah Warshauer – 1991
Writing teachers and educators can add to information from large-scale testing and teachers can strengthen classroom assessment by creating a tight fit between large-scale testing and classroom assessment. Across the years, large-scale testing programs have struggled with a difficult problem: how to evaluate student writing reliably and…
Descriptors: Elementary Secondary Education, Foreign Countries, Informal Assessment, Portfolios (Background Materials)
Hambleton, Ronald K. – 1986
The problem of determining optimal test lengths with fixed total testing time has proved to be a difficult one for criterion-referenced test developers. An algorithm is needed which can be used by test developers to allocate available testing time to maximize the validity of their total criterion-referenced tests or testing programs. To be…
Descriptors: Algorithms, Criterion Referenced Tests, Elementary Secondary Education, Psychometrics
Boyd, Thomas A.; Tramontana, Michael G. – 1984
To examine the validity of short forms of the Wechsler Intelligence Scale for Children-Revised (WISC-R), the WISC-R was first administered to 106 hospitalized psychiatric patients, aged 8-16. No subjects had a primary diagnosis of mental retardation or learning disability, and one-third were receiving psychotropic medication. WISC-R IQ scores…
Descriptors: Adolescents, Children, Correlation, Elementary Secondary Education
Harnisch, Delwyn L. – 1985
Computer adaptive testing systems are feasible for certification and licensure testing. This is in part due to the availability of extensive yet inexpensive computers. Modern item response theory, combined with computerized adaptive testing, yields a powerful new method of testing which provides greater accuracy and efficiency and less boredom for…
Descriptors: Adaptive Testing, Certification, Computer Assisted Testing, Cost Effectiveness
Carifio, James – 1992
Researchers and program evaluators would often like to use a particular instrument, but do not because it is too long or would require too much testing time. Having a validated set of objective procedures for reducing the size of an instrument could improve many research and evaluation efforts. This paper reports the results of test reduction or…
Descriptors: Attitude Measures, Elementary School Students, Factor Analysis, Intermediate Grades
Previous Page | Next Page ยป
Pages: 1 | 2