Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 3 |
Descriptor
Test Construction | 22 |
Test Items | 22 |
Test Selection | 22 |
Item Analysis | 9 |
Test Reliability | 8 |
Higher Education | 6 |
Test Use | 5 |
Test Validity | 5 |
Testing Problems | 5 |
Difficulty Level | 4 |
Item Banks | 4 |
More ▼ |
Source
Author
Ackermann, Richard | 1 |
Adema, Jos J. | 1 |
Baker, E. L. | 1 |
Baker, Eva L. | 1 |
Benson, Jeri | 1 |
Beuchert, A. Kent | 1 |
Boekkooi-Timminga, Ellen | 1 |
Brandenburg, Dale C. | 1 |
Brownell, Sara E. | 1 |
Cahen, Leonard S. | 1 |
Cooper, Katelyn M. | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 2 |
Higher Education | 1 |
Audience
Practitioners | 3 |
Researchers | 1 |
Teachers | 1 |
Location
Singapore | 1 |
United Kingdom | 1 |
West Virginia | 1 |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Wright, Christian D.; Huang, Austin L.; Cooper, Katelyn M.; Brownell, Sara E. – International Journal for the Scholarship of Teaching and Learning, 2018
College instructors in the United States usually make their own decisions about how to design course exams. Even though summative course exams are well known to be important to student success, we know little about the decision making of instructors when designing course exams. To probe how instructors design exams for introductory biology, we…
Descriptors: College Faculty, Science Teachers, Science Tests, Teacher Made Tests
Jacobsen, Jared; Ackermann, Richard; Eguez, Jane; Ganguli, Debalina; Rickard, Patricia; Taylor, Linda – Journal of Applied Testing Technology, 2011
A computer adaptive test (CAT) is a delivery methodology that serves the larger goals of the assessment system in which it is embedded. A thorough analysis of the assessment system for which a CAT is being designed is critical to ensure that the delivery platform is appropriate and addresses all relevant complexities. As such, a CAT engine must be…
Descriptors: Delivery Systems, Testing Programs, Computer Assisted Testing, Foreign Countries
Herman, Joan L.; Osmundson, Ellen; Dietel, Ronald – Assessment and Accountability Comprehensive Center, 2010
This report describes the purposes of benchmark assessments and provides recommendations for selecting and using benchmark assessments--addressing validity, alignment, reliability, fairness and bias and accessibility, instructional sensitivity, utility, and reporting issues. We also present recommendations on building capacity to support schools'…
Descriptors: Multiple Choice Tests, Test Items, Benchmarking, Educational Assessment
Polin, L.; Baker, E. L. – 1978
A neglected element in designing tests is that of publicness, that is, the extent to which test specifications are understandable and usable by all interested parties. Issues related to content validity, such as test bias and instructional sensitivity, become accessible to these parties once content validity and design have been adequately…
Descriptors: Rating Scales, Test Construction, Test Items, Test Selection

Beuchert, A. Kent; Mendoza, Jorge L. – Journal of Educational Measurement, 1979
Ten item discrimination indices, across a variety of item analysis situations, were compared, based on the validities of tests constructed by using each of the indices to select 40 items from a 100-item pool. Item score data were generated by a computer program and included a simulation of guessing. (Author/CTM)
Descriptors: Item Analysis, Simulation, Statistical Analysis, Test Construction
Boekkooi-Timminga, Ellen – 1988
A new test construction method based on integer linear programming is described. This method selects optimal tests in small amounts of computer time. The new method, called the Cluster-Based Method, assumes that the items in the bank have been grouped according to their item information curves so that items within a group, or cluster, are…
Descriptors: Computer Assisted Testing, Item Banks, Latent Trait Theory, Mathematical Models

Ebel, Robert L. – Journal of Educational Measurement, 1982
Reasonable and practical solutions to two major problems confronting the developer of any test of educational achievement (what to measure and how to measure it) are proposed, defended, and defined. (Author/PN)
Descriptors: Measurement Techniques, Objective Tests, Test Construction, Test Items

Adema, Jos J. – Applied Psychological Measurement, 1992
Two methods are proposed for the construction of weakly parallel tests based on a prespecified information function. A method is then described for selecting weakly parallel tests that are optimal with respect to the Maximin criterion. Numerical examples demonstrate the practicality of the tests. (SLD)
Descriptors: Equations (Mathematics), Heuristics, Item Banks, Item Response Theory

Nevo, Barukh – Educational and Psychological Measurement, 1977
Item-test correlations are compared to item test-retest correlations as measures for selecting items in test construction. The author concludes that the item test-retest method is superior for item analysis which aims at getting shorter tests while maintaining test stability. (Author/JKS)
Descriptors: College Students, Correlation, Higher Education, Item Analysis

Henderson, Metta Lou – American Journal of Pharmaceutical Education, 1984
The uses, advantages and disadvantages, preparation, and scoring of essay tests and oral tests are outlined and discussed, and sample questions of each type oriented to pharmaceutical instruction are provided. (MSE)
Descriptors: Essay Tests, Higher Education, Pharmaceutical Education, Scoring
Lutkus, Anthony D.; Laskaris, George – 1981
Analyses of student responses to Introductory Psychology test questions were discussed. The publisher supplied a two thousand item test bank on computer tape. Instructors selected questions for fifteen item tests. The test questions were labeled by the publisher as factual or conceptual. The semester course used a mastery learning format in which…
Descriptors: Difficulty Level, Higher Education, Item Analysis, Item Banks

Dolinsky, Donna; Reid, Vincent E. – American Journal of Pharmaceutical Education, 1984
Cognitive learning and cognitive measures are defined and various types of objective measures of cognitive learning are discussed and compared, including short answer test items, true-false items, multiple choice items, matching items, and written simulations. (MSE)
Descriptors: Cognitive Tests, Comparative Analysis, Higher Education, Measurement Techniques

Goh, David S. – Applied Psychological Measurement, 1979
The advantages of using psychometric thoery to design short forms of intelligence tests are demonstrated by comparing such usage to a systematic random procedure that has previously been used. The Wechsler Intelligence Scale for Children Revised (WISC-R) Short Form is presented as an example. (JKS)
Descriptors: Elementary Secondary Education, Intelligence Tests, Item Analysis, Psychometrics

Shick, Jacqueline – Science Teacher, 1990
Described is how tests that accompany texts can be effective if items selected relate to topics stressed in class. Identified are problems commonly found in these types of tests and directions on how to improve problem questions. (KR)
Descriptors: Science Education, Science Materials, Science Tests, Secondary Education
ERIC Clearinghouse for Social Studies/Social Science Education, Bloomington, IN. – 1987
Appropriate evaluation can greatly enhance the teaching process, and this resource packet is designed to help make testing more efficient. Tests and test items are featured in these listings, and information on test construction is provided. The various sources which are highlighted include: (1) professional organizations; (2) journals and…
Descriptors: Annotated Bibliographies, Courseware, Elementary Secondary Education, Evaluation Criteria
Previous Page | Next Page ยป
Pages: 1 | 2