NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 976 to 990 of 4,036 results Save | Export
Nix, Thomas W.; Barnette, J. Jackson – Research in the Schools, 1998
Reviews null hypothesis statistical significance testing (NHST) in its historical context and concludes that workable alternatives to NHST are available. Among suggested alternatives, effect magnitude measures, replication techniques, and meta-analytic techniques are discussed. (SLD)
Descriptors: Educational Research, Effect Size, Hypothesis Testing, Meta Analysis
Nix, Thomas W.; Barnette, J. Jackson – Research in the Schools, 1998
Attempts to clarify the positions of T. Nix and J. Barnette on statistical significance testing, advocates the routine use of effect size, and encourages reporting results in simple terms. (SLD)
Descriptors: Educational Research, Effect Size, Hypothesis Testing, Research Methodology
Ernest, James M.; McLean, James E. – Research in the Schools, 1998
Discusses commonalities in the papers of this special issue, addresses concerns about errors of omission expressed by T. Knapp, and provides some recommendations for the use of statistical significance testing with requirements for estimates of effect sizes. (SLD)
Descriptors: Educational Research, Effect Size, Estimation (Mathematics), Hypothesis Testing
Peer reviewed Peer reviewed
Sternberg, Robert J. – Educational Researcher, 1998
Links the literatures on human abilities and expertise, suggesting that human abilities are a form of developing expertise. Discusses the role of tests in a scheme that regards abilities as developing expertise and presents a model that implies a shift toward practice grounded in the development of knowledge-based expertise in all children.…
Descriptors: Ability, Children, Educational Assessment, Elementary Secondary Education
Peer reviewed Peer reviewed
Phelps, Richard P. – Educational Measurement: Issues and Practice, 2000
Compiled information from 31 countries to study trends in large-scale testing. Shows a clear trend toward adding, not dropping, testing programs. Twenty-seven countries show a net increase in testing, while only three show a decrease. Fifty-nine testing programs have been added; only four have been dropped. (SLD)
Descriptors: Educational Trends, Foreign Countries, International Education, International Studies
Peer reviewed Peer reviewed
MacKay, Gilbert; Lundie, Jennifer – International Journal of Disability, Development and Education, 1998
Recognizes the attraction of Goal Attainment Scaling (GAS), a technique that uses a scale to measure client's achievement, but suggests that there are concerns about the calculation of its standard scores. Examples show how GAS may be used in service development, whether or not numerical values are attached. (Author/CR)
Descriptors: Achievement Gains, Achievement Rating, Adults, Children
Peer reviewed Peer reviewed
Graham, John R.; Ben-Porath, Yossef S.; McNulty, John L. – Psychological Assessment, 1997
The meaning of low scores on some Minnesota Multiphasic Personality Inventory-2 (MMPI-2) scales was examined by comparing therapists' descriptors of 669 mental health patients with high, normal, or low scores on each scale. Results show that for most scales both high and low scores provide potentially important information. (SLD)
Descriptors: Correlation, Mental Disorders, Patients, Personality Assessment
Peer reviewed Peer reviewed
Posavac, E. J. – Evaluation and Program Planning, 1998
Misuses of null hypothesis significance testing are reviewed and alternative approaches are suggested for carrying out and reporting statistical tests that might be useful to program evaluators. Several themes, including the importance of respecting the magnitude of Type II errors and describing effect sizes in units stakeholders can understand,…
Descriptors: Effect Size, Evaluation Methods, Hypothesis Testing, Program Evaluation
Peer reviewed Peer reviewed
Shavelson, Richard J.; Solano-Flores, Guillermo; Ruiz-Primo, Maria Araceli – Evaluation and Program Planning, 1998
Research on developing technology for large-scale performance assessments in science is reported briefly, and a conceptual framework is presented for defining, generating, and evaluating science performance assessments. Types of tasks are discussed, and the technical qualities of performance assessments are discussed in the context of…
Descriptors: Educational Technology, Generalizability Theory, Models, Performance Based Assessment
Peer reviewed Peer reviewed
Bartley, Anthony W. – Evaluation and Program Planning, 1998
Outlines each of the papers presented in this special section, describes difficulties the arguments posed, and raises questions that might be put to the author of each of these discussions of new assessment methods in mathematics. Implications for the technology for the development of performance assessments are discussed. (SLD)
Descriptors: Educational Technology, Mathematics Tests, Science Education, Science Tests
Peer reviewed Peer reviewed
Prieto, Luis; Roset, Montse; Badia, Xavier – Journal of Applied Measurement, 2001
Tested the metric properties of a Spanish version of the Assessment of Growth Hormone Deficiency in Adults (AGHDA) questionnaire through Rasch analysis with a sample of 356 adult patients in Spain. Results suggest that the Spanish AGHDA could be a useful complement of the clinical evaluation of growth hormone deficiency patients at group and…
Descriptors: Adults, Evaluation Methods, Foreign Countries, Individual Development
Olsen, Laurie – Leadership, 2001
Good reforms can have harmful results if equity effects are ignored. As California implements its accountability system, certain questions must be addressed concerning the system's data use, measurement features (consistency, meaningfulness, achievement growth, achievement gaps among groups), instructional improvement focus, incentives for…
Descriptors: Academic Achievement, Accountability, Data Collection, Educational Change
Peer reviewed Peer reviewed
Cashel, Mary Louise; And Others – Assessment, 1995
The use of scales on the Personality Assessment Inventory (PAI) to detect defensiveness in criminal and nonclinical samples was evaluated with 45 male inmates and 38 male undergraduates under standard conditions or under instructions to feign a positive role. Results indicate that the PAI is susceptible to defensive dissimulation. (SLD)
Descriptors: Criminals, Higher Education, Identification, Multivariate Analysis
Peer reviewed Peer reviewed
Reise, Steven P.; Flannery, Wm. Peter – Applied Measurement in Education, 1996
Statistical and theoretical issues that arise from assessing person-fit on measures of typical performance are discussed, including the frequent attenuation of detection of person-misfit, the need for methods of identifying sources of response aberrancy, and person-fit measures as moderators of trait-criterion relations. (SLD)
Descriptors: Item Response Theory, Measurement Techniques, Performance, Responses
Peer reviewed Peer reviewed
Raven, John – Cognitive Psychology, 2000
Summarizes data related to the stability and variation in the norms for the Raven's Progressive Matrices Test (J. Raven, 1936), a measure of basic cognitive functioning, for different cultural, ethnic, and socioeconomic groups worldwide and within countries. Also considers variation over time and suggests an explanation for the variation in norms…
Descriptors: Change, Cognitive Tests, Ethnicity, Foreign Countries
Pages: 1  |  ...  |  62  |  63  |  64  |  65  |  66  |  67  |  68  |  69  |  70  |  ...  |  270