Publication Date
In 2025 | 34 |
Since 2024 | 128 |
Since 2021 (last 5 years) | 467 |
Since 2016 (last 10 years) | 873 |
Since 2006 (last 20 years) | 1353 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Practitioners | 195 |
Teachers | 159 |
Researchers | 92 |
Administrators | 49 |
Students | 34 |
Policymakers | 14 |
Parents | 12 |
Counselors | 2 |
Community | 1 |
Media Staff | 1 |
Support Staff | 1 |
More ▼ |
Location
Canada | 62 |
Turkey | 59 |
Germany | 40 |
United Kingdom | 36 |
Australia | 35 |
Japan | 35 |
China | 32 |
United States | 32 |
California | 25 |
United Kingdom (England) | 25 |
Netherlands | 24 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating

Bennett, Randy Elliot; And Others – Journal of Educational Measurement, 1991
The relationship of multiple-choice and free-response items on the College Board's Advanced Placement Computer Science Examination was studied using confirmatory factor analysis. Results with 2 samples of 1,000 high school students suggested that the most parsimonious fit was achieved using a single factor. Implications for construct validity are…
Descriptors: Chi Square, College Entrance Examinations, Comparative Testing, Computer Science

Skaggs, Gary; Lissitz, Robert W. – Journal of Educational Measurement, 1992
The consistency of several item bias detection methods was studied across different test administrations of the same items using data from a mathematics test given to approximately 6,600 eighth grade students in all. The Mantel Haenszel and item-response-theory-based sum-of-squares methods were the most consistent. (SLD)
Descriptors: Comparative Testing, Grade 8, Item Bias, Item Response Theory

Birenbaum, Menucha; And Others – Applied Psychological Measurement, 1992
The effect of multiple-choice (MC) or open-ended (OE) response format on diagnostic assessment of algebra test performance was investigated with 231 eighth and ninth graders in Tel Aviv (Israel) using bug or rule space analysis. Both analyses indicated closer similarity between parallel OE subsets than between stem-equivalent OE and MC subsets.…
Descriptors: Algebra, Comparative Testing, Educational Assessment, Educational Diagnosis

Harris, Abigail M.; Carlton, Sydell T. – Applied Measurement in Education, 1993
Differential item functioning on 6 forms of the Scholastic Aptitude Test was examined for 181,228 male and 198,668 female students focusing on the points tested, the test format, and subject matter in which items are embedded. Implications of the identifiable differences are discussed. (SLD)
Descriptors: College Entrance Examinations, Comparative Analysis, Females, High School Students

DeMars, Christine E. – Applied Measurement in Education, 2000
Studied the effects of test consequences, response formats, gender, and ethnicity on the mathematics and science sections of the Michigan High School Proficiency Test. Results for more than 11,000 students show that students taking constructed response and multiple choice formats performed better under high stakes conditions. Discusses gender and…
Descriptors: Constructed Response, Ethnicity, High School Students, High Schools
Alderson, J. Charles; And Others – 1995
The guide is intended for teachers who must construct language tests and for other professionals who may need to construct, evaluate, or use the results of language tests. Most examples are drawn from the field of English-as-a-Second-Language instruction in the United Kingdom, but the principles and practices described may be applied to the…
Descriptors: Educational Trends, English (Second Language), Interrater Reliability, Language Tests
Smith, Robert L.; Carlson, Alfred B. – 1995
The feasibility of constructing test forms with practically equivalent cut scores using judges' estimates of item difficulty as target "statistical" specifications was investigated. Test forms with equivalent judgmental cut scores (based on judgments of item difficulty) were assembled using items from six operational forms of the…
Descriptors: Cutting Scores, Decision Making, Difficulty Level, Equated Scores
Myerberg, N. James – 1996
The Montgomery County (Maryland) public school system has started using assessments other than multiple-choice tests because it is felt that this will provide school staff with better information about the success of the instructional program. One of the ways assessments can provide better information is by having teachers score student papers.…
Descriptors: Accountability, Achievement Tests, Educational Assessment, Elementary Secondary Education
Donoghue, John R.; Mazzeo, John – 1995
At grades 8 and 12, the 1992 National Assessment of Educational Progress (NAEP) reading assessment contained a small number of 50-minute blocks in addition to the usual 25-minute blocks. To determine whether to incorporate the 50-minute blocks into the operational scaling, this study sought to determine whether the longer blocks measured a…
Descriptors: Chi Square, Goodness of Fit, Grade 12, Grade 8
Bennett, Randy Elliot – 1994
The Educational Testing Service is moving rapidly to computerize its tests for admissions to postsecondary education and occupational licensure/certification. Computerized tests offer important advantages, including immediate score reporting, the convenience of testing when the examinee wishes, and for adaptive tests, equal accuracy throughout the…
Descriptors: Adaptive Testing, College Entrance Examinations, Computer Assisted Testing, Computer Managed Instruction
Siskind, Theresa G.; And Others – 1992
The instructional validity of computer administered tests was studied with a focus on whether differences in test scores and item behavior are a function of instructional mode (computer versus non-computer). In the first of 3 studies, performance test scores for approximately 400 high school students in 1990-91 for tasks accomplished with the…
Descriptors: Comparative Testing, Comprehension, Computer Assisted Instruction, Computer Assisted Testing
Meisels, Samuel J.; Marsden, Dorothea B.; Wiske, Martha Stone; Henderson, Laura W. – 1997
The Early Screening Inventory-Revised (ESI-R) is a brief developmental screening instrument that is individually administered to children from 3 to 6 years of age. It is designed to identify children who may need special education services in order to perform successfully in school. The ESI-R is intended to assess the child's ability to acquire…
Descriptors: Child Development, Cognitive Development, Disabilities, Language Acquisition
Weiping, Wu – 1991
Problems in the testing of Chinese as a foreign language (CFL) are examined, focusing on proficiency testing needs and test standardization. Particular attention is paid to listening and reading assessment. The first part of the discussion looks at specific problems with five existing proficiency tests, including such aspects as inadequacy of the…
Descriptors: Chinese, Cultural Context, Language Proficiency, Language Role
Wang, Chen-Shih; Ackerman, Terry – 1994
Passages used in the Illinois Goal Assessment Program (IGAP) reading test are intact pieces of literature, stories, and essays that match classroom reading assignments and typical student reading experiences. There are 15 testlets, each containing 5 items, associated with each passage. Each testlet requires students to demonstrate various levels…
Descriptors: Analysis of Covariance, Elementary Education, Elementary School Students, Grade 3
Gullickson, Arlen R. – 1982
Rudman and colleagues (1980) deplored the paucity of descriptive information relative to teachers' test use patterns. The present study addresses the abundant prescriptive, and lack of descriptive information concerning teacher testing. A mailed survey procedure gathered testing practice information from elementary and secondary South Dakota…
Descriptors: Elementary School Teachers, Elementary Secondary Education, Secondary School Teachers, Teacher Education