Publication Date
| In 2026 | 0 |
| Since 2025 | 10 |
| Since 2022 (last 5 years) | 54 |
| Since 2017 (last 10 years) | 97 |
| Since 2007 (last 20 years) | 163 |
Descriptor
| Test Format | 506 |
| Test Validity | 506 |
| Test Reliability | 243 |
| Test Construction | 180 |
| Test Items | 127 |
| Foreign Countries | 108 |
| Language Tests | 96 |
| Higher Education | 86 |
| Testing | 80 |
| Computer Assisted Testing | 72 |
| Test Use | 67 |
| More ▼ | |
Source
Author
Publication Type
Education Level
| Higher Education | 60 |
| Postsecondary Education | 50 |
| Secondary Education | 30 |
| Elementary Education | 25 |
| Middle Schools | 19 |
| Junior High Schools | 15 |
| High Schools | 13 |
| Grade 8 | 11 |
| Grade 4 | 9 |
| Elementary Secondary Education | 8 |
| Grade 5 | 8 |
| More ▼ | |
Audience
| Practitioners | 30 |
| Teachers | 19 |
| Administrators | 17 |
| Researchers | 9 |
| Community | 1 |
| Policymakers | 1 |
| Students | 1 |
| Support Staff | 1 |
Location
| Canada | 10 |
| China | 9 |
| New York | 9 |
| Japan | 7 |
| Netherlands | 6 |
| Germany | 5 |
| Turkey | 5 |
| United Kingdom | 5 |
| United Kingdom (England) | 5 |
| Australia | 4 |
| Georgia | 4 |
| More ▼ | |
Laws, Policies, & Programs
| Elementary and Secondary… | 1 |
| Individuals with Disabilities… | 1 |
| Job Training Partnership Act… | 1 |
| No Child Left Behind Act 2001 | 1 |
| Pell Grant Program | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Herman, Joan – 1984
This resource paper is a guide for planning and developing instructionally relevant tests of student learning at the classroom, building, or district level. It is based on a model of instruction and testing which systematically uses assessment information to support and facilitate instructional improvement. Course goals and objectives are first…
Descriptors: Criterion Referenced Tests, Elementary Secondary Education, Instructional Development, Models
White, Karl; And Others – 1981
To explain discrepancies in Utah's elementary school test results under the Elementary and Secondary Education Act's Title I Evaluation and Reporting System (TIERS), researchers investigated the adequacy and validity of TIERS evaluation models. Model A (norm-referenced testing) is used in most Utah school districts, in preference to Models B or C…
Descriptors: Achievement Tests, Elementary Education, Evaluation Methods, Norm Referenced Tests
Pike, Lewis W. – 1979
This evaluative and developmental study was undertaken between 1972-74 to determine the effectiveness of items used for the Test of English as a Foreign Language (TOEFL) in relationship to other item types used in assessing English proficiency, and to recommend possible changes in TOEFL content and format. TOEFL was developed to assess the English…
Descriptors: Cloze Procedure, English (Second Language), Essay Tests, Foreign Students
Salvia, John; Salvia, Shawn Amig – Diagnostique, 1985
Performance of 100 college freshmen on the Woodcock-Johnson Psycho-Educational Battery, Part II, Tests of Achievement, were analyzed by subtests and cluster scores to determine appropriateness for assessing achievement of handicapped students. Minor inversions in item order and pronounced ceiling effects on all subtests yielded lowered subtest and…
Descriptors: Achievement Tests, Cluster Analysis, College Freshmen, Disability Identification
Peer reviewedCalderbank, Mark; Awwad, Muhammad – System, 1988
Discusses the issue of oral communication tests through an account of an oral test administered to nearly 1,000 students at Yarmouk University Language Center. The ways in which the institution attempted to overcome problems in the oral test's feasibility, validity, and reliability are presented. (Author/CB)
Descriptors: College Students, Communicative Competence (Languages), Foreign Countries, Higher Education
Peer reviewedFoster, Karen – Journal of Reading, 1987
Reviews a secondary school version of the Test of English as a Foreign Language (TOEFL) that focuses on two of the primary language skills: listening and reading. Subtest includes a variety of tasks measuring semantic, syntactic, and higher-level reading comprehension abilities. (NKA)
Descriptors: English (Second Language), Language Proficiency, Language Tests, Listening Comprehension Tests
Peer reviewedMather, Peter W.; And Others – Journal of Reading, 1985
Examines the changes made in the Nelson Reading Skills Test, Forms 3 and 4, noting that the authors of the revised test have strengthened the test by requiring students to define words in context and by adding sections related to phonic skills, word parts, and reading rate. (HOD)
Descriptors: Content Analysis, Elementary Secondary Education, Reading Comprehension, Reading Rate
Peer reviewedSalend, Spencer J. – Intervention in School and Clinic, 1995
Test modifications and techniques that teachers can employ to adapt their tests to meet individualized needs of mainstreamed students with disabilities are considered. Suggestions are offered to assist special education teachers in helping general educators design tests; address test reliability, validity, content, and format; and develop…
Descriptors: Disabilities, Elementary Secondary Education, Evaluation Methods, Learning Problems
Peer reviewedCaudill, Steven B.; Gropper, Daniel M. – Journal of Economic Education, 1991
Presents a study of the effect of question order on student performance on economics tests. Reports that question order has no statistically significant effect on examination scores, even after including variables that reflect differential human capital characteristics. Concludes that instructors need not worry that some examination versions give…
Descriptors: Economics Education, Educational Research, Higher Education, Human Capital
Tsagari, Dina – Online Submission, 2007
The review presented and discussed in this paper explores the theoretical underpinnings and research findings of the washback of high-stakes tests in the field of language teaching and testing as well general education and suggests areas and ways of researching the phenomenon in the future. (Contains 1 figure and 1 table.)
Descriptors: Testing, Influences, Language Tests, High Stakes Tests
Alderson, J. Charles; And Others – 1995
The guide is intended for teachers who must construct language tests and for other professionals who may need to construct, evaluate, or use the results of language tests. Most examples are drawn from the field of English-as-a-Second-Language instruction in the United Kingdom, but the principles and practices described may be applied to the…
Descriptors: Educational Trends, English (Second Language), Interrater Reliability, Language Tests
Siskind, Theresa G.; And Others – 1992
The instructional validity of computer administered tests was studied with a focus on whether differences in test scores and item behavior are a function of instructional mode (computer versus non-computer). In the first of 3 studies, performance test scores for approximately 400 high school students in 1990-91 for tasks accomplished with the…
Descriptors: Comparative Testing, Comprehension, Computer Assisted Instruction, Computer Assisted Testing
Weiping, Wu – 1991
Problems in the testing of Chinese as a foreign language (CFL) are examined, focusing on proficiency testing needs and test standardization. Particular attention is paid to listening and reading assessment. The first part of the discussion looks at specific problems with five existing proficiency tests, including such aspects as inadequacy of the…
Descriptors: Chinese, Cultural Context, Language Proficiency, Language Role
Melancon, Janet G.; Thompson, Bruce – 1989
Classical measurement theory was used to investigate the measurement (psychometric) characteristics of both parts of the Finding Embedded Figures Test (FEFT) administered in either a "no guessing" supply format or a multiple-choice selection format to undergraduate college students or to middle school students. Three issues were…
Descriptors: Comparative Testing, Construct Validity, Higher Education, Junior High School Students
Vispoel, Walter P.; Twing, Jon S. – 1989
The measurement precision, efficiency, and validity of an adaptive test and four conventional listening tests designed to assess musical ability were compared. The conventional tests were the Seashore Tonal Memory Test and three tests (peaked, rectangular, and maximum discrimination) constructed from items in the 278-item adaptive test pool. The…
Descriptors: Adaptive Testing, College Students, Comparative Testing, High School Students


