Publication Date
| In 2026 | 6 |
| Since 2025 | 444 |
| Since 2022 (last 5 years) | 1942 |
| Since 2017 (last 10 years) | 4086 |
| Since 2007 (last 20 years) | 6792 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 644 |
| Teachers | 455 |
| Researchers | 440 |
| Administrators | 126 |
| Policymakers | 68 |
| Students | 68 |
| Counselors | 26 |
| Parents | 24 |
| Community | 10 |
| Support Staff | 5 |
| Media Staff | 3 |
| More ▼ | |
Location
| Turkey | 608 |
| Australia | 341 |
| Canada | 254 |
| China | 180 |
| Indonesia | 149 |
| United States | 143 |
| United Kingdom | 130 |
| Germany | 118 |
| Taiwan | 111 |
| California | 110 |
| Spain | 107 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 2 |
Peer reviewedMeredith, Mimi – Youth Theatre Journal, 1989
Explains the development of Florida's certification examination for prospective drama teachers. Describes its process design and the participation of drama educators from around the state as keys to the test's successful development. (SR)
Descriptors: Drama, Higher Education, Secondary Education, Teacher Certification
Peer reviewedLaugksch, Rudiger C.; Spargo, Peter E. – Science Education, 1996
Describes development of 472 true-false scientific literacy test items with high content and item validity that test 240 key ideas in, and attitudes toward, science. Items are based on selected chapters of the 1989 American Association for the Advancement of Science report, entitled Science for All Americans. Appendix contains sample items.…
Descriptors: Elementary Secondary Education, Science Tests, Scientific Literacy, Test Construction
Peer reviewedKrus, David J.; Helmstadter, Gerald C. – Educational and Psychological Measurement, 1993
Negative coefficients of reliability, sometimes returned by the standard formula for estimation of the internal-consistency reliability, are neither theoretically nor numerically correct. Alternative strategies for test development in this special case are suggested. (Author)
Descriptors: Estimation (Mathematics), Reliability, Test Construction, Test Use
Peer reviewedOsipow, Samuel H. – Journal of Counseling and Development, 1991
Describes author's experiences and roles in developing four career-oriented measures: translating the Ramak into English; development of the Career Decision Scale and Occupational Stress Inventory; and current development of the Task Specific Scale of Occupational Self-Efficacy. (Author/ABL)
Descriptors: Career Counseling, Counseling, Evaluation Methods, Measures (Individuals)
Peer reviewedKlecker, Beverly M.; Loadman, William E. – Educational and Psychological Measurement, 1998
The stability, reliability, and validity of scores on the subscales of the School Participant Empowerment Scale (P. Short and J. Rinehart, 1992) were studied with data from 4,091 Ohio classroom teachers. Confirmatory factor analysis did not confirm the subscales identified by the instrument developers. Explanatory factor analysis was used to…
Descriptors: Empowerment, Participative Decision Making, Reliability, Teachers
Peer reviewedStuart-Hamilton, Ian – Educational Gerontology, 1999
Attitudes were assessed after 89 undergraduates were asked either five neutral questions, five questions on the economic welfare of older people, or five on elders' physical frailty. Economic questions resulted in significantly more negative views of the mental aspects of aging, suggesting that questionnaires may contain tacit sources of bias. (SK)
Descriptors: Age Discrimination, Aging (Individuals), Attitudes, Test Bias
Peer reviewedEngelhard, George, Jr.; Davis, Melodee; Hansche, Linda – Applied Measurement in Education, 1999
Examined whether reviewers on item-review committees can identify accurately test items that exhibit a variety of flaws. Results with 39 reviewers of a 75-item test show that reviewers exhibit fairly high accuracy rates overall, with statistically significant differences in judgmental accuracy among reviewers. (SLD)
Descriptors: Decision Making, Judges, Review (Reexamination), Test Construction
Peer reviewedWadden, Paul; Hilke, Robert; Hamp-Lyons, Liz – TESOL Quarterly, 1999
Provides a form of argumentative dialectic to Liz Hamp-Lyons's forum commentary published in an earlier issue of this journal, "Ethical Test Preparation Practice: The Case of TOEFL." Hamp-Lyons responds to the comments.(Author/VWL)
Descriptors: English (Second Language), Ethics, Second Language Instruction, Test Construction
Peer reviewedSanders, Piet F.; Verschoor, Alfred J. – Applied Psychological Measurement, 1998
Presents minimization and maximization models for parallel test construction under constraints. The minimization model constructs weakly and strongly parallel tests of minimum length, while the maximization model constructs weakly and strongly parallel tests with maximum test reliability. (Author/SLD)
Descriptors: Algorithms, Models, Reliability, Test Construction
Peer reviewedTimminga, Ellen – Applied Psychological Measurement, 1998
Discusses problems of diagnosing and repairing infeasible linear-programming models in computerized test assembly. Demonstrates that it is possible to localize the causes of infeasibility, although this is not always easy. (SLD)
Descriptors: Adaptive Testing, Computer Assisted Testing, Linear Programming, Test Construction
Peer reviewedvan der Linden, Wim J.; Adema, Jos J. – Journal of Educational Measurement, 1998
Proposes an algorithm for the assembly of multiple test forms in which the multiple-form problem is reduced to a series of computationally less intensive two-form problems. Illustrates how the method can be implemented using 0-1 linear programming and gives two examples. (SLD)
Descriptors: Algorithms, Linear Programming, Test Construction, Test Format
Peer reviewedMeier, Scott T. – Measurement and Evaluation in Counseling and Development, 1998
Traditional Item Selection Rules (TISRs) were compared to Intervention Item Selection Rules (IISRs) on the same set of alcohol attitude items from archival data, producing scales with differing psychometric properties. A cross-validation study was run. Guidelines for selecting change-sensitive items, problems and advantages of IISRs are…
Descriptors: Item Analysis, Measurement Techniques, Psychometrics, Statistical Analysis
Peer reviewedWilson, Mark; Sloane, Kathryn – Applied Measurement in Education, 2000
Describes the principles that guided the creation and implementation of a system of embedded assessments, the Berkeley Evaluation and Assessment Research System (BEAR). The assessment system builds on methodological advances in alternative assessment. Discusses how the application of the principles generates the component parts of the system. (SLD)
Descriptors: Educational Practices, Evaluation Methods, Research, Student Evaluation
Peer reviewedCarlstedt, Berit; Gustafsson, Jan-Eric; Ullstadius, Eva – Intelligence, 2000
Studied whether a change of test item sequencing, intended to increase test complexity, would cause increased involvement of general intelligence using a sample of Swedish military recruits who received heterogeneous (n=1,778) or homogeneous (n=363) tests. Items presented homogeneously showed higher general intelligence ("G") loadings.…
Descriptors: Foreign Countries, Intelligence, Military Personnel, Test Construction
Peer reviewedWright, Benjamin D.; Stenner, A. Jackson – Popular Measurement, 1999
Discusses the use of "Lexile" units in test construction. (SLD)
Descriptors: Measurement Techniques, Reading Achievement, Scaling, Student Evaluation


