NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 7 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Schochet, Peter Z. – Evaluation Review, 2009
In social policy evaluations, the multiple testing problem occurs due to the many hypothesis tests that are typically conducted across multiple outcomes and subgroups, which can lead to spurious impact findings. This article discusses a framework for addressing this problem that balances Types I and II errors. The framework involves specifying…
Descriptors: Policy, Evaluation, Testing Problems, Hypothesis Testing
Peer reviewed Peer reviewed
Alderman, Donald L. – Educational and Psychological Measurement, 1981
Student self-selection in deciding to repeat a test was examined by contrasting the test performance of students taking the Scholastic Aptitude Test (SAT) as juniors and again as seniors with the test performance of students taking the SAT only once as juniors. Results suggest there is self-selection in test repetition. (Author/GK)
Descriptors: College Entrance Examinations, Comparative Analysis, Error of Measurement, Scores
Alderman, Donald L. – 1981
The test performance of students who took the Scholastic Aptitude Test (SAT) only once as juniors was contrasted with students who took the test as juniors and again as seniors. Estimates of expected test performance on a common initial administration in the junior year were derived from separate equating sections and background variables.…
Descriptors: Comparative Analysis, Error of Measurement, High School Students, High Schools
Wilcox, Rand R. – 1978
Two fundamental problems in mental test theory are to estimate true score and to estimate the amount of error when testing an examinee. In this report, three probability models which characterize a single test item in terms of a population of examinees are described. How these models may be modified to characterize a single examinee in terms of an…
Descriptors: Achievement Tests, Comparative Analysis, Error of Measurement, Mathematical Models
Peer reviewed Peer reviewed
Direct linkDirect link
Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008
With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…
Descriptors: Global Approach, Cultural Differences, Norms, Human Resources
Lance, Charles E.; Moomaw, Michael E. – 1983
Direct assessments of the accuracy with which raters can use a rating instrument are presented. This study demonstrated how surplus behavioral incidents scaled during the development of Behaviorally Anchored Rating Scales (BARS) can be used effectively in the evaluation of the newly developed scales. Construction of scenarios of hypothetical…
Descriptors: Behavior Rating Scales, Comparative Analysis, Error of Measurement, Evaluation Criteria
Peer reviewed Peer reviewed
Direct linkDirect link
Lei, Pui-Wa; Koehly, Laura M. – Journal of Experimental Education, 2003
Classification studies are important for practitioners who need to identify individuals for specialized treatment or intervention. When interventions are irreversible or misclassifications are costly, information about the proficiency of different classification procedures becomes invaluable. This study furnishes information about the relative…
Descriptors: Monte Carlo Methods, Classification, Discriminant Analysis, Regression (Statistics)