Publication Date
| In 2026 | 0 |
| Since 2025 | 186 |
| Since 2022 (last 5 years) | 1065 |
| Since 2017 (last 10 years) | 2887 |
| Since 2007 (last 20 years) | 6172 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Teachers | 480 |
| Practitioners | 358 |
| Researchers | 152 |
| Administrators | 122 |
| Policymakers | 51 |
| Students | 44 |
| Parents | 32 |
| Counselors | 25 |
| Community | 15 |
| Media Staff | 5 |
| Support Staff | 3 |
| More ▼ | |
Location
| Australia | 183 |
| Turkey | 157 |
| California | 133 |
| Canada | 124 |
| New York | 118 |
| United States | 112 |
| Florida | 107 |
| China | 103 |
| Texas | 72 |
| United Kingdom | 72 |
| Japan | 70 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 5 |
| Meets WWC Standards with or without Reservations | 11 |
| Does not meet standards | 8 |
Bowman, Harry L.; And Others – 1989
Memphis State University conducted a study for the Tennessee Department of Education to determine the validity of 11 previously unvalidated, extensively revised, or new Educational Testing Service (ETS) tests as measures of the skills and knowledge required for specific initial certification endorsements of public school personnel in Tennessee.…
Descriptors: College Faculty, Content Analysis, Elementary Secondary Education, Higher Education
Livingston, Samuel A.; Lewis, Charles – 1993
This paper presents a method for estimating the accuracy and consistency of classifications based on test scores. The scores can be produced by any scoring method, including the formation of a weighted composite. The estimates use data from a single form. The reliability of the score is used to estimate its effective test length in terms of…
Descriptors: Classification, Error of Measurement, Estimation (Mathematics), Reliability
Kaplan, Randy M.; Bennett, Randy Elliot – 1994
This study explores the potential for using a computer-based scoring procedure for the formulating-hypotheses (F-H) item. This item type presents a situation and asks the examinee to generate explanations for it. Each explanation is judged right or wrong, and the number of creditable explanations is summed to produce an item score. Scores were…
Descriptors: Automation, Computer Assisted Testing, Correlation, Higher Education
Porter, Eleanor; And Others – 1992
A United States Employment Service (USES) General Aptitude Test Battery (GATB) plan was formulated to evaluate and develop additional job-related assessment methods. Eighteen alternative predictors were reviewed, from which biodata was selected for development. The literature indicates that biodata provides increased validity with little or no…
Descriptors: Biographical Inventories, Educational Research, Occupational Tests, Personnel Evaluation
Bennett, Randy Elliot; And Others – 1988
This study investigated the extent of agreement between MicroPROUST, a prototype microcomputer-based expert scoring system, and human readers for two Advanced Placement Computer Science free-response items. To assess agreement, a balanced incomplete block design was used with 2 groups of 4 readers grading 43 student solutions to the first problem…
Descriptors: Advanced Placement, Computer Science, Constructed Response, Educational Technology
Thompson, Tony D.; Pommerich, Mary – 1996
Conditional item independence, also known as local independence, is necessary for the accurate estimation of item parameters within item response theory (IRT). Given that the condition of local independence will be violated to at least some degree when unidimensional models are used to represent multidimensional data, it is important to study the…
Descriptors: Achievement Tests, English, Item Response Theory, Mathematical Models
Kehoe, Jerard – 1995
This digest presents a list of recommendations for writing multiple-choice test items, based on psychometrics statistics are typically provided by a measurement, or test scoring, service, where tests are machine-scored or by testing software packages. Test makers can capitalize on the fact that "bad" items can be differentiated from…
Descriptors: Item Analysis, Item Banks, Measurement Techniques, Multiple Choice Tests
PDF pending restorationWheeler, Patricia H. – 1995
When individuals are given tests that are too hard or too easy, the resulting scores are likely to be poor estimates of their performance. To get valid and accurate test scores that provide meaningful results, one should use functional-level testing (FLT). FLT is the practice of administering to an individual a version of a test with a difficulty…
Descriptors: Adaptive Testing, Difficulty Level, Educational Assessment, Performance
Wolf, Kenneth; And Others – 1997
An overview of teaching portfolios is presented so that principals and other school administrators can make informed choices about their use. In its most basic form, a teaching portfolio is a collection of information about a teacher's practice. It becomes a structured documentary history when it is supported by reflective writing, deliberation,…
Descriptors: Elementary Secondary Education, Evaluation Methods, Portfolio Assessment, Portfolios (Background Materials)
Taylor, Catherine S. – 1997
Three mathematics scoring methods are being used or explored in large scale assessment programs: (1) item-by-item scoring; (2) holistic scoring; and (3) "trait" scoring. This study investigated all three methods of scoring on three mathematics performance-based assessments. Mathematics assessment tasks were selected from a pool of pilot tasks…
Descriptors: Alternative Assessment, Elementary Secondary Education, Mathematical Concepts, Mathematics Instruction
Spolsky, Bernard – 1990
A discussion of the differences between the Test of English as a Foreign Language (TOEFL), an American test battery, and the Cambridge English Examinations (Cambridge), a British battery, focuses on the different approaches to language test development embodied in the tests as the source of difficulty in translating between them for individual…
Descriptors: Comparative Analysis, Cultural Differences, English (Second Language), Foreign Countries
Shukla, P. K.; Bruno, James – 1990
An analytical technique from the field of market research called conjoint analysis was applied to a psychological measurement of student testing design preferences. Past concerns with testing design are reviewed, and a newer approach to testing is identified--the modified confidence weighted-admissible probability measurement (MCW-APM) test…
Descriptors: Attitude Measures, College Students, Demography, Higher Education
Peer reviewedGlanz, Peter K.; Brown, R. S. – Physics Teacher, 1976
States that final exams can best motivate students if the exams are counted substantially toward the final course grade. Proposes a weighting system in which a performance on the final which exceeds the student's average would be weighed more heavily than a poor performance. (CP)
Descriptors: College Science, Evaluation Methods, Higher Education, Motivation
Peer reviewedAndrews, Hans A. – Journal of Vocational Behavior, 1975
This study was designed to test and expand Holland's vocational development theory by utilizing more than a single high point code in classification of personality patterns of jobs. A more "refined" and/or "subtle" difference was shown in the personality-job relationships when two high point codes were used. (Author)
Descriptors: Career Choice, Career Development, Decision Making, Personality
Haberman, Shelby J.; Sinharay, Sadip; Puhan, Gautam – ETS Research Report Series, 2006
Recently, there has been an increasing level of interest in reporting subscores. This paper examines the issue of reporting subscores at an aggregate level, especially at the level of institutions that the examinees belong to. A series of statistical analyses is suggested to determine when subscores at the institutional level have any added value…
Descriptors: Scores, Statistical Analysis, Error of Measurement, Reliability


