Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 1 |
Descriptor
Comparative Analysis | 8 |
Research Reports | 8 |
Test Reliability | 8 |
Item Analysis | 6 |
Statistical Analysis | 4 |
Test Construction | 4 |
Efficiency | 2 |
Error of Measurement | 2 |
High Schools | 2 |
Mathematical Formulas | 2 |
Mathematical Models | 2 |
More ▼ |
Source
Developmental Psychology | 1 |
Author
Benson, Jeri | 2 |
Brennan, Robert L, | 1 |
Claessens, Amy | 1 |
Dowsett, Chantelle J. | 1 |
Duncan, Greg J. | 1 |
Dunivant, Noel | 1 |
Engel, Mimi | 1 |
Garrison, Wayne M. | 1 |
Lewis, Barbara | 1 |
Lockwood, Robert E. | 1 |
Rentfrow, Robert K. | 1 |
More ▼ |
Publication Type
Reports - Research | 5 |
Speeches/Meeting Papers | 5 |
Information Analyses | 1 |
Journal Articles | 1 |
Reports - Evaluative | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
McCarthy Scales of Childrens… | 1 |
What Works Clearinghouse Rating
Duncan, Greg J.; Engel, Mimi; Claessens, Amy; Dowsett, Chantelle J. – Developmental Psychology, 2014
Replications and robustness checks are key elements of the scientific method and a staple in many disciplines. However, leading journals in developmental psychology rarely include explicit replications of prior research conducted by different investigators, and few require authors to establish in their articles or online appendices that their key…
Descriptors: Replication (Evaluation), Robustness (Statistics), Developmental Psychology, Educational Research
Garrison, Wayne M.; White, Karl R. – 1979
Rasch and classical test analysis methods were compared with respect to their similarities and differences in the identification of noninformative items and implausible person records. Using computer simulated data with known parameters, each model was evaluated in terms of its effectiveness in: (1) identifying noninformative or "bad"…
Descriptors: Comparative Analysis, Item Analysis, Models, Monte Carlo Methods
Lewis, Barbara; And Others – 1973
This paper describes and evaluates a new abstract form of the Purdue Elementary Problem-Solving Inventory. The new test parallels a shortened form of the original Inventory, but presents problems verbally rather than through slides. Both forms were given to advantaged and disadvantaged second- and fourth-graders. For the total sample, the slide…
Descriptors: Abstract Reasoning, Cognitive Tests, Comparative Analysis, Elementary Education
Benson, Jeri – 1979
Two methods of item selection were used to select sets of 40 items from a 50-item verbal analogies test, and the resulting item sets were compared for relative efficiency. The BICAL program was used to select the 40 items having the best mean square fit to the one parameter logistic (Rasch) model. The LOGIST program was used to select the 40 items…
Descriptors: Comparative Analysis, Computer Programs, Costs, Efficiency
Brennan, Robert L,; Lockwood, Robert E. – 1979
Procedures for determining cutting scores have been proposed by Angoff and by Nedelsky. Nedelsky's approach requires that a rater examine each distractor within a test item to determine the probability of a minimally competent examinee answering correctly; whereas Angoff uses a judgment based on the whole item, rather than each of its components.…
Descriptors: Achievement Tests, Comparative Analysis, Cutting Scores, Guessing (Tests)
Dunivant, Noel – 1979
Eight different methods are reviewed for determining whether two or more tests are equivalent measures. These methods vary in restrictiveness from the Wilks-Votaw test of compound symmetry (which requires that all means, variances, and covariances are equal), to Joreskog's theory of congeneric tests (which requires only that the tests are measures…
Descriptors: Analysis of Variance, Comparative Analysis, Error of Measurement, Evaluation Methods
A Comparison of Three Types of Test Development Procedures Using Classical and Latent Trait Methods.
Benson, Jeri; Wilson, Michael – 1979
Three methods of item selection were used to select sets of 38 items from a 50-item verbal analogies test and the resulting item sets were compared for internal consistency, standard errors of measurement, item difficulty, biserial item-test correlations, and relative efficiency. Three groups of 1,500 cases each were used for item selection. First…
Descriptors: Comparative Analysis, Difficulty Level, Efficiency, Error of Measurement
Rentfrow, Robert K. – 1972
As part of the national Head Start Planned Variation Study, this study used a relatively small sample in an intensive evaluation of program implementation in one field community using the Tucson Early Education Model (TEEM). A modified Solomon four-group research design formed the organization framework. Evaluation of six TEEM classrooms and two…
Descriptors: Affective Behavior, Analysis of Covariance, Attitude Measures, Behavior Rating Scales