Publication Date
| In 2026 | 3 |
| Since 2025 | 477 |
| Since 2022 (last 5 years) | 2435 |
| Since 2017 (last 10 years) | 6615 |
| Since 2007 (last 20 years) | 18019 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 2140 |
| Teachers | 1218 |
| Researchers | 1054 |
| Administrators | 486 |
| Policymakers | 456 |
| Students | 176 |
| Parents | 147 |
| Counselors | 100 |
| Community | 61 |
| Media Staff | 17 |
| Support Staff | 15 |
| More ▼ | |
Location
| Canada | 784 |
| Australia | 691 |
| United States | 582 |
| California | 569 |
| United Kingdom | 479 |
| Texas | 414 |
| Florida | 403 |
| Germany | 392 |
| New York | 378 |
| United Kingdom (England) | 369 |
| China | 361 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 17 |
| Meets WWC Standards with or without Reservations | 22 |
| Does not meet standards | 21 |
Peer reviewedRoss, Donald C. – Educational and Psychological Measurement, 1977
The kappa coefficient and the chi square statistic are used as indices of agreement between two judges' ratings of a set of stimuli on a nominal scale. In this article, the logic of these indices is extended to weighted multi-way cases. (Author/JKS)
Descriptors: Hypothesis Testing, Nouns, Weighted Scores
Peer reviewedWillson, Victor L. – Educational and Psychological Measurement, 1976
It is shown that the rank-biserial correlation coefficient is a linear function of the U-statistic (Mann and Whitney), so that a test of group mean difference is equivalent to a test of zero correlation for the rank-biserial coefficient. (RC)
Descriptors: Correlation, Hypothesis Testing, Statistical Significance
Peer reviewedAgresti, Alan; Wackerly, Dennis – Psychometrika, 1977
Exact conditional tests of independence in cross-classification tables are formulated based on chi square and other statistics with stronger operational interpretations, such as some nominal and ordinal measures of association. Guidelines for table dimensions and sample sizes for which the tests are economically implemented on a computer are…
Descriptors: Expectancy Tables, Hypothesis Testing, Sampling
Peer reviewedStrohmer, Douglas C.; Chiodo, Anthony L. – Journal of Counseling Psychology, 1984
Presents two experiments concerning confirmatory bias in the way counselors collect data to test their hypotheses. Counselors were asked either to develop their own clinical hypothesis or were given a hypothesis to test. Confirmatory bias in hypothesis testing was not supported in either experiment. (JAC)
Descriptors: Counseling Techniques, Counselors, Hypothesis Testing
Peer reviewedCentra, John A. – Journal of Learning Disabilities, 1986
Handicapped students' scores on timed and untimed editions of the Scholastic Aptitude Test (SAT) were studied. Of the approximately 1800 students studied, 79 percent were learning disabled. Handicapped students' performance improved with extended time, the increase being greater than that for nonhandicapped students tested with extra time.…
Descriptors: Disabilities, Testing, Time Factors (Learning)
Peer reviewedFagley, N. S. – Special Services in the Schools, 1984
Behavioral assessment is seen as a way of individualizing programing for special needs pupils by obtaining information (through such techniques as self-monitoring, direct observation, and permanent product review) and evaluating information (social validation and single-case research designs). (CL)
Descriptors: Disabilities, Elementary Secondary Education, Testing
Eaves, Ronald C.; Harwood, Pamela Lammonds – Diagnostique, 1984
It is argued that the manual of the Test of Adolescent Language lacks precision even though the authors explicitly state that the test may be used for the purpose of identifying intraindividual differences. To alleviate this problem, a table containing statistically significant differences between subtest standard scores is presented. (Author/CL)
Descriptors: Adolescents, Language Handicaps, Testing Problems
Peer reviewedJordan, Linda S.; Hall, Penelope K. – Language, Speech, and Hearing Services in Schools, 1985
Performance of 286 normal children (grades K-9) on the De Renzi and Faglioni form of the Token Test and the De Renzi and Ferrari Reporter's Test were analyzed. Two different scoring conventions were compared: number correct versus weighted scores. Normative data are presented by grade level and age. Specific administration and scoring procedures…
Descriptors: Elementary Education, Language Tests, Testing
Peer reviewedParks, Brian T.; And Others – Journal of Marital and Family Therapy, 1985
Investigated the validity of a microcomputer-administered version of the Marital Adjustment Test (MAT) (N=100). Results showed no significant differences in the computer and the paper-pencil versions, regardless of order of presentation. (JAC)
Descriptors: Computer Assisted Testing, Marital Satisfaction
Peer reviewedNaglieri, Jack A. – American Journal of Mental Deficiency, 1985
A comparison of the Wechsler Intelligence Scale for Children-Revised (WISC-R) and the Kaufman Assessment Battery for Children (K-ABC) with 37 mentally retarded children revealed that the WISC-R Full Scale IQ resulted in scores significantly lower than the K-ABC Mental Processing Composite. (CL)
Descriptors: Intelligence Tests, Mental Retardation, Testing
Peer reviewedChatman, Steven P.; And Others – Journal of Learning Disabilities, 1984
Discrepancy scores for four global scales and the Nonverbal Scale were examined for the Kaufman Assessment Battery for Children using the standardization sample as the data source. The range of subtest scores and the profile variability of subtest scores were determined for each sex, race, and age level. (Author/CL)
Descriptors: Disabilities, Elementary Education, Scoring, Testing
Peer reviewedCooper, Martin – Educational and Psychological Measurement, 1976
An exact probability test for use with certain Likert-type scales is presented. The procedure assumes equally-spaced points, independence of subjects' responses, and that each point has an equal likelihood of response for each subject. Tables for critical values are presented. (Author/JKS)
Descriptors: Hypothesis Testing, Probability, Rating Scales
Peer reviewedLevy, Kenneth J. – Educational and Psychological Measurement, 1976
A procedure is specified for testing the significance of predicted trends in k independent correlations. An example is also provided for illustrative purposes. (Author)
Descriptors: Correlation, Hypothesis Testing, Trend Analysis
Luecht, Richard M.; Burgin, William – 2003
Adaptive multistage testlet (MST) designs appear to be gaining popularity for many large-scale computer-based testing programs. These adaptive MST designs use a modularized configuration of preconstructed testlets and embedded score-routing schemes to prepackage different forms of an adaptive test. The conditional information targeting (CIT)…
Descriptors: Adaptive Testing, Simulation, Test Construction
Peer reviewedBrown, J. Cooper – Psychological Reports, 1973
Effects of two methods of test-taking behavior on performances were compared. Further study is in order of the hypothesis that students who take exams in small groups working for a common grade will perform better then when taking exams individually. (Author/JB)
Descriptors: Evaluation, Evaluation Methods, Performance, Testing


