Publication Date
| In 2026 | 5 |
| Since 2025 | 482 |
| Since 2022 (last 5 years) | 2440 |
| Since 2017 (last 10 years) | 6620 |
| Since 2007 (last 20 years) | 18024 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 2140 |
| Teachers | 1218 |
| Researchers | 1054 |
| Administrators | 486 |
| Policymakers | 456 |
| Students | 176 |
| Parents | 147 |
| Counselors | 100 |
| Community | 61 |
| Media Staff | 17 |
| Support Staff | 15 |
| More ▼ | |
Location
| Canada | 784 |
| Australia | 691 |
| United States | 582 |
| California | 569 |
| United Kingdom | 479 |
| Texas | 414 |
| Florida | 403 |
| Germany | 392 |
| New York | 378 |
| United Kingdom (England) | 369 |
| China | 361 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 17 |
| Meets WWC Standards with or without Reservations | 22 |
| Does not meet standards | 21 |
Peer reviewedCharlesworth, Rosalind; And Others – Early Education and Development, 1994
Since the late 1960s, curriculum and instruction have increasingly become "test driven." Research documents the negative effects of standardized testing on instruction, students, and teachers. At their best, tests reflect students' ability to take tests. Research supports the need for change in testing policies. Massive testing must be eliminated,…
Descriptors: Alternative Assessment, Curriculum Problems, Evaluation Methods, Group Testing
Peer reviewedCohen, Peter A.; Forde, Edward B. – Journal of Dental Education, 1992
A survey of 59 dental schools found most supportive of development of instructional technology but also found faculty perceived as unenthusiastic about or unrewarded for innovation. Testing and recordkeeping were the most common computer applications; individualized instruction and paper-and-pencil simulation are used in most schools. Support…
Descriptors: Computer Oriented Programs, Computer Uses in Education, Dental Schools, Educational Media
Peer reviewedJaeger, Richard M. – Educational Measurement: Issues and Practice, 1991
Issues concerning the selection of judges for standard setting are discussed. Determining the consistency of judges' recommendations, or their congruity with other expert recommendations, would help in selection. Enough judges must be chosen to allow estimation of recommendations by an entire population of judges. (SLD)
Descriptors: Cutting Scores, Evaluation Methods, Evaluators, Examiners
Peer reviewedReid, Jerry B. – Educational Measurement: Issues and Practice, 1991
Training judges to generate item ratings in standard setting once the reference group has been defined is discussed. It is proposed that sensitivity to the factors that determine difficulty can be improved through training. Three criteria for determining when training is sufficient are offered. (SLD)
Descriptors: Computer Assisted Instruction, Difficulty Level, Evaluators, Interrater Reliability
Peer reviewedHarvill, Leo M. – Educational Measurement: Issues and Practice, 1991
This paper discusses standard error of measurement (SEM), the amount of variation or spread in the measurement errors for a test, and gives information needed to interpret test scores using SEMs. SEMs at various score levels should be used in calculating score bands rather than a single SEM value. (SLD)
Descriptors: Definitions, Equations (Mathematics), Error of Measurement, Estimation (Mathematics)
Peer reviewedNaveh-Benjamin, Moshe – Journal of Educational Psychology, 1991
To investigate whether there are 2 types of test-anxious students, those with poor study skills and those with difficulties in retrieving material, study skills training or anxiety desensitization were provided to 84 high test-anxious university students in Israel. Results support the theory of two types of test-anxious students. (SLD)
Descriptors: Cognitive Processes, College Students, Comparative Analysis, Desensitization
Peer reviewedMoffitt, Robert – Evaluation Review, 1991
Statistical methods for program evaluation with nonexperimental data are reviewed with emphasis on circumstances in which nonexperimental data are valid. Three solutions are proposed for problems of selection bias, and implications for evaluation design and data collection and analysis are discussed. (SLD)
Descriptors: Bias, Cohort Analysis, Equations (Mathematics), Estimation (Mathematics)
Bell, Richard C. – Psychological Test Bulletin, 1991
A survey of 54 teachers of undergraduate and graduate psychological testing courses illustrates the teaching of testing in Australia. Tests covered in a course vary extensively. Intelligence testing is the most commonly taught (covered in 89 percent of the courses); most time on testing is spent in first-year postgraduate courses. (SLD)
Descriptors: College Curriculum, Course Content, Foreign Countries, Graduate Study
Peer reviewedRogers, W. Todd; Bateson, David J. – Journal of Experimental Education, 1991
Thirty-six testwise and 41 test-naive high school seniors in British Columbia (Canada) were tested to determine their abilities to apply selected test wiseness principles according to a proposed model of test-taking behavior. To apply the testwiseness strategy, students first needed knowledge of the content tested and test item content. (SLD)
Descriptors: Behavior Patterns, Comparative Testing, Foreign Countries, High School Seniors
Peer reviewedTzeng, Oliver C. S.; And Others – Educational and Psychological Measurement, 1991
Measurement properties of two response formats (bipolar and unipolar ratings) in personality assessment were compared using data from 135 college students taking the Myers-Briggs Type Indicator (MBTI). Factorial validity and construct validity of the MBTI were supported. Reasons why the bipolar method is preferable are discussed. (SLD)
Descriptors: College Students, Comparative Testing, Construct Validity, Factor Analysis
Peer reviewedFisek, M. Hamit; And Others – American Journal of Sociology, 1991
Presents a theoretical formulation that integrates, within the framework of expectation states theory, theories of the emergence of power-and-prestige orders in status-heterogeneous and homogeneous task-oriented groups. Discusses a model based on this theoretic formulation. Offers a model for predicting participation rates in open interaction…
Descriptors: Discussion Groups, Expectation, Group Dynamics, Hypothesis Testing
Mason, Emanuel; Zollman, Alan – Focus on Learning Problems in Mathematics, 1992
This study explored the relationship between traditional item difficulty and cognitive complexity as measured by response time. Rural students (n=43) responded to computer-based tests of the Individualized Study by Technology General Mathematics Course developed by Alaska's Department of Education. Results indicated that mean response times were…
Descriptors: Cognitive Measurement, Cognitive Processes, Computer Assisted Instruction, Computer Assisted Testing
Peer reviewedShea, Judy A.; And Others – Evaluation and the Health Professions, 1992
Video and print formats of cardiovascular motion studies were compared for use as assessment measures of interpretive skills for 392 doctors taking a cardiovascular disease certification test. Although video studies were easier to interpret, the equivalence of both motion studies supports use of the print format in national examinations. (SLD)
Descriptors: Cardiovascular System, Comparative Testing, Graduate Medical Education, Interpretive Skills
Peer reviewedFrisbie, David A. – Journal of Educational Measurement, 1992
This guide for school administrators is written to promote careful and wise use of scores from standardized achievement tests. Authors of two sections particularly criticized in the review respond about what should be included in a primer on testing and interpreting test scores for compensatory education students. (SLD)
Descriptors: Achievement Tests, Administrator Role, Compensatory Education, Educational Assessment
Peer reviewedBenedict, Ralph H. B.; And Others – Psychological Assessment, 1992
The concurrent validities of 3 short forms of the Wechsler Adult Intelligence Scale (WAIS) were compared for their prediction of full-scale IQ for 145 male and 159 female psychiatric inpatients. Results support previous research showing better predictive accuracy for L. C. Ward's (1990) seven-subtest short form than the others. (SLD)
Descriptors: Adults, Comparative Testing, Concurrent Validity, Cost Effectiveness


