ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	2

Descriptor

Comparative Analysis	7
Error of Measurement	7
Testing Problems	7
Scores	3
Statistical Analysis	3
Item Analysis	2
Psychometrics	2
Research Methodology	2
Test Reliability	2
Achievement Tests	1
Behavior Rating Scales	1
Business Administration	1
Classification	1
College Entrance Examinations	1
Cross Cultural Studies	1
Cultural Context	1
Cultural Differences	1
Discriminant Analysis	1
Evaluation	1
Evaluation Criteria	1
Evaluation Methods	1
Evaluation Research	1
Factor Analysis	1
Global Approach	1
High School Students	1
More ▼

Source

Educational and Psychological…	1
Evaluation Review	1
International Journal of…	1
Journal of Experimental…	1

Author

Alderman, Donald L.	2
Foster, Jeff L.	1
Koehly, Laura M.	1
Lance, Charles E.	1
Lei, Pui-Wa	1
Meyer, Kevin D.	1
Moomaw, Michael E.	1
Schochet, Peter Z.	1
Wilcox, Rand R.	1

Publication Type

Reports - Research	5
Journal Articles	4
Reports - Evaluative	2
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)

What Works Clearinghouse Rating

Showing all 7 results Save | Export

An Approach for Addressing the Multiple Testing Problem in Social Policy Impact Evaluations

Peer reviewed

Direct link

Schochet, Peter Z. – Evaluation Review, 2009

In social policy evaluations, the multiple testing problem occurs due to the many hypothesis tests that are typically conducted across multiple outcomes and subgroups, which can lead to spurious impact findings. This article discusses a framework for addressing this problem that balances Types I and II errors. The framework involves specifying…

Descriptors: Policy, Evaluation, Testing Problems, Hypothesis Testing

Student Self-Selection and Test Repetition.

Peer reviewed

Alderman, Donald L. – Educational and Psychological Measurement, 1981

Student self-selection in deciding to repeat a test was examined by contrasting the test performance of students taking the Scholastic Aptitude Test (SAT) as juniors and again as seniors with the test performance of students taking the SAT only once as juniors. Results suggest there is self-selection in test repetition. (Author/GK)

Descriptors: College Entrance Examinations, Comparative Analysis, Error of Measurement, Scores

Student Self-Selection and Test Repetition.

Download full text

Alderman, Donald L. – 1981

The test performance of students who took the Scholastic Aptitude Test (SAT) only once as juniors was contrasted with students who took the test as juniors and again as seniors. Estimates of expected test performance on a common initial administration in the junior year were derived from separate equating sections and background variables.…

Descriptors: Comparative Analysis, Error of Measurement, High School Students, High Schools

An Alternative Interpretation of Three Stability Models. Measurement and Methodology, Work Unit 2: Technical Adequacy of Tests.

Wilcox, Rand R. – 1978

Two fundamental problems in mental test theory are to estimate true score and to estimate the amount of error when testing an examinee. In this report, three probability models which characterize a single test item in terms of a population of examinees are described. How these models may be modified to characterize a single examinee in terms of an…

Descriptors: Achievement Tests, Comparative Analysis, Error of Measurement, Mathematical Models

Considerations for Creating Multi-Language Personality Norms: A Three-Component Model of Error

Peer reviewed

Direct link

Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008

With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…

Descriptors: Global Approach, Cultural Differences, Norms, Human Resources

Assessing the Psychometric Quality of Performance Rating Scales: Comparisons among Evaluative Criteria.

Download full text

Lance, Charles E.; Moomaw, Michael E. – 1983

Direct assessments of the accuracy with which raters can use a rating instrument are presented. This study demonstrated how surplus behavioral incidents scaled during the development of Behaviorally Anchored Rating Scales (BARS) can be used effectively in the evaluation of the newly developed scales. Construction of scenarios of hypothetical…

Descriptors: Behavior Rating Scales, Comparative Analysis, Error of Measurement, Evaluation Criteria

Linear Discriminant Analysis versus Logistic Regression: A Comparison of Classification Errors in the Two-Group Case

Peer reviewed

Direct link

Lei, Pui-Wa; Koehly, Laura M. – Journal of Experimental Education, 2003

Classification studies are important for practitioners who need to identify individuals for specialized treatment or intervention. When interventions are irreversible or misclassifications are costly, information about the proficiency of different classification procedures becomes invaluable. This study furnishes information about the relative…

Descriptors: Monte Carlo Methods, Classification, Discriminant Analysis, Regression (Statistics)