Publication Date
In 2025 | 3 |
Since 2024 | 12 |
Since 2021 (last 5 years) | 41 |
Since 2016 (last 10 years) | 126 |
Since 2006 (last 20 years) | 395 |
Descriptor
Test Theory | 1161 |
Test Items | 261 |
Test Reliability | 252 |
Test Construction | 245 |
Test Validity | 245 |
Psychometrics | 181 |
Scores | 176 |
Item Response Theory | 165 |
Foreign Countries | 159 |
Item Analysis | 141 |
Statistical Analysis | 134 |
More ▼ |
Source
Author
Publication Type
Education Level
Location
United States | 17 |
United Kingdom (England) | 15 |
Canada | 14 |
Australia | 13 |
Turkey | 12 |
Sweden | 8 |
United Kingdom | 8 |
Netherlands | 7 |
Texas | 7 |
New York | 6 |
Taiwan | 6 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 4 |
Elementary and Secondary… | 3 |
Individuals with Disabilities… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Clarke, Sophie; Lindsay, Katharine; McKenna, Chris; New, Steve – ALT-J: Research in Learning Technology, 2004
There has been a wealth of investigation into the use of online multiple-choice questions as a means of summative assessment, however the research into the use of formative MCQs by the same mode of delivery still remains patchy. Similarly, research and implementation has been largely concentrated within the Sciences and Medicine rather than the…
Descriptors: Summative Evaluation, Computer Assisted Testing, Online Systems, Multiple Choice Tests
Gullickson, Arlen R. – 1982
Rudman and colleagues (1980) deplored the paucity of descriptive information relative to teachers' test use patterns. The present study addresses the abundant prescriptive, and lack of descriptive information concerning teacher testing. A mailed survey procedure gathered testing practice information from elementary and secondary South Dakota…
Descriptors: Elementary School Teachers, Elementary Secondary Education, Secondary School Teachers, Teacher Education
Boldt, Robert F. – 1986
This study investigated whether or not the validity of the Scholastic Aptitude Test (SAT) is both higher and less variable across colleges than it appears to be. Data from 99 validity studies conducted by the Validity Study Service of the College Board were used. In addition to test validities based on first-year college averages, which were…
Descriptors: Class Rank, College Admission, College Entrance Examinations, Generalization
Eignor, Daniel R.; Stocking, Martha L. – 1986
A previous study of pre-equating the Scholastic Aptitude Test (SAT) using item response theory provided unacceptable equating results for SAT-mathematical data. The purpose of this study was to investigate two possible explanations for these unacceptable pre-equating results. Specifically, the calibration process, which made use of the…
Descriptors: College Entrance Examinations, Equated Scores, Higher Education, Latent Trait Theory
Stevenson, Zollie, Jr. – 1985
A study examined the correlations and predictive relationships between reading and language achievement test scores and North Carolina Annual Writing Assessment scores. Subjects, over 1,000 sixth and ninth grade students, were administered both the North Carolina Annual Writing Assessment and the California Achievement Test (CAT) in 1984 and 1985.…
Descriptors: Achievement Tests, Elementary Secondary Education, Language Skills, Language Tests
Carlman, Nancy – 1985
A study examined whether Canadian twelfth grade students' papers would rate differently when they were written in different modes and whether there are significant differences between global (modified holistic) scores and rhetorical effectiveness (modified primary trait) scores for the same papers. Fifty students wrote on two transactional topics…
Descriptors: Comparative Analysis, Discourse Modes, Evaluation Methods, Foreign Countries
Thathong, Ngamnit; Kruawan, Preecha – 1985
The feasibility of a self-scoring flexilevel test was investigated in terms of practical effectiveness and criterion related validity. The test was administered to over 2,000 students studying secondary school mathematics in Thailand. The study was conducted, and the test administered, in two phases: test development and evaluation. The test was…
Descriptors: Adaptive Testing, Feasibility Studies, Foreign Countries, High Schools
Alberta Dept. of Education, Edmonton. Student Evaluation and Data Processing Branch. – 1987
The purpose of this booklet is to provide social studies teachers with student writing samples that exemplify the criteria used to score students' responses on the June 1987 Alberta (Canada) Social Studies 30 Diploma Examination. Students choose one of two possible essay questions that required writing a defense of individual initiative in…
Descriptors: Comparative Testing, Constructed Response, Educational Testing, Essay Tests
Webb, Noreen; Herman, Joan – 1984
This paper describes the development of a language arts test to assess the consistency of student response patterns and the feasibility of using the test to diagnose students' misconceptions. The studies were part of a project to develop computerized adaptive testing for the language arts with software to diagnose student errors. The…
Descriptors: Adaptive Testing, Computer Assisted Testing, Diagnostic Tests, Error Patterns
Forster, Fred – 1987
Studies carried out over a 12-year period addressed fundamental questions on the use of Rasch-based item banks. Large field tests administered in grades 3-8 of reading, mathematics, and science items, as well as standardized test results were used to explore the possible effects of many factors on item calibrations. In general, the results…
Descriptors: Achievement Tests, Difficulty Level, Elementary Education, Item Analysis
Armour-Thomas, Eleanor – 1986
The use of standardized tests and test data to detect and address differences in cognitive styles is advocated here. To this end, the paper describes the componential theory of intelligence addressed by Sternberg et. al. This theory defines the components of intelligence by function and level of generality, including: (1) metacomponents: higher…
Descriptors: Cognitive Processes, Cognitive Style, Cognitive Tests, Diagnostic Tests
Schaeffer, Gary A.; And Others – 1984
The reliability of criterion referenced tests, which are often used to evaluate health education programs, may be conceptualized in different ways. Classical conceptualizations of test reliability have limited usefulness when applied to health-related criterion referenced tests. When a cutting score is set, test reliability can be represented as…
Descriptors: Correlation, Criterion Referenced Tests, Cutting Scores, Elementary Secondary Education
Paludi, Michele A. – 1981
The fear of success (FOS) construct in the achievement motivation of women was initially written by undergraduates in which the opening sentence described a male or female who ranked first in a medical school class. From these results, an intrapsychic interpretation of FOS was made. Other researchers, who accepted the sex of the cue character as…
Descriptors: Achievement Need, Fear of Success, Females, Motivation
Haladyna, Tom; Roid, Gale – 1980
An empirical review of test items is described as an essential step in criterion-referenced test development. The concept of test items' instructional sensitivity is introduced, and research is briefly reviewed which describes four theoretical contexts in which instructional sensitivity indexes have been observed: criterion-referenced; classical…
Descriptors: Achievement Tests, Bayesian Statistics, Course Objectives, Criterion Referenced Tests
Psychological Corp., New York, NY. – 1978
Seven criteria for selecting and describing the standardization sample of the Metropolitan Achievement Test (MAT), 1978 edition are discussed. Four major variables were used to describe the sample: socioeconomic status; school district enrollment; public vs. nonpublic; and geographic region. Both forms and all grade levels of the Survey Battery…
Descriptors: Achievement Tests, Elementary Secondary Education, Enrollment, Geographic Regions