Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 2 |
Descriptor
Source
Educational and Psychological… | 9 |
Author
Abedi, Jamal | 1 |
Baker, Eva L. | 1 |
Bannister, Brendan D. | 1 |
Cooke, Robert A. | 1 |
Feldt, Leonard S. | 1 |
Gorin, Joanna S. | 1 |
Halpin, Gerald | 1 |
Johnson, Brian W. | 1 |
Kim, Seonghoon | 1 |
Kinicki, Angelo J. | 1 |
Marshall, Seth J. | 1 |
More ▼ |
Publication Type
Journal Articles | 9 |
Reports - Research | 8 |
Reports - Evaluative | 2 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
Coopersmith Self Esteem… | 1 |
What Works Clearinghouse Rating
Marshall, Seth J.; Wodrich, David L.; Gorin, Joanna S. – Educational and Psychological Measurement, 2009
This study examined psychometric properties of the Tempe Sorting Task (TST), a new measure of executive function (EF) for children. To increase the meaningfulness of test score interpretations, an age-appropriate construct was employed to incorporate Denckla's description of EF. Multiple measures of EF, including the TST, were collected for…
Descriptors: Cognitive Tests, Cognitive Processes, Children, Attention Deficit Hyperactivity Disorder

Powers, Stephen; And Others – Educational and Psychological Measurement, 1985
Results of an administration of the Language Proficiency Measure indicated that the interrater reliability was adequate, internal-consistency reliability estimates were high, concurrent validity coefficients were adequate, and the classification validity was acceptable. (Author/LMO)
Descriptors: Elementary Education, Interrater Reliability, Language Proficiency, Language Tests

Johnson, Brian W. – Educational and Psychological Measurement, 1983
Regression analyses indicated that the Coopersmith Self-Esteem Inventory has convergent validity with regard to the Piers-Harris Children's Self-Concept Scale and the Coopersmith Behavioral Academic Assessment Scale, has discriminant validity with regard to the Children's Social Desirability Scale, is sensitive to differences in achievement level,…
Descriptors: Academic Achievement, Intermediate Grades, Interrater Reliability, Self Concept Measures
Feldt, Leonard S.; Kim, Seonghoon – Educational and Psychological Measurement, 2006
Researchers sometimes need a statistical test of the hypothesis that two values of Cronbach's alpha reliability coefficient are equal. The situation may involve scores from two different measures administered to independent random samples or from the same measure administered to random samples from two different populations. Feldt derived a test…
Descriptors: Individual Testing, Test Items, Sample Size, Scores

Bannister, Brendan D.; And Others – Educational and Psychological Measurement, 1987
To control for response bias in student ratings of college teachers, an index of rater error was used that was theoretically independent of actual performance. Partialing out the effects of this extraneous response bias enhanced validity, but partialing out overall effectiveness resulted in reduced convergent and discriminant validities.…
Descriptors: Error of Measurement, Higher Education, Interrater Reliability, Response Style (Tests)

Cooke, Robert A.; And Others – Educational and Psychological Measurement, 1987
Lafferty's Life Styles Inventory was completed by 556 managers (Level I, Self-Description) and by 2,922 peers, subordinates, and supervisors (Level II, Description by Others). Factor analysis revealed the same three factors in both ratings. Coworkers generally agreed with each others' ratings, but correlations between self and coworker ratings…
Descriptors: Administrator Evaluation, Adults, Behavior Rating Scales, Cognitive Style

Halpin, Gerald; And Others – Educational and Psychological Measurement, 1983
Although arbitrary, whenever multiple judgmental standard-setting procedures are utilized by different groups concurrently, stability across raters can be achieved and decisions can be made in a relatively judicious manner. Greater stability across methods (Ebel, Nedelsky, Angoff) may be effected by slightly modifying the Ebel approach. (Author/PN)
Descriptors: Admission Criteria, College Entrance Examinations, Cutting Scores, Higher Education

Kinicki, Angelo J.; And Others – Educational and Psychological Measurement, 1985
Using both the Behaviorally Anchored Rating Scales (BARS) and the Purdue University Scales, 727 undergraduates rated 32 instructors. The BARS had less halo effect, more leniency error, and lower interrater reliability. Both formats were valid. The two tests did not differ in rate discrimination or susceptibility to rating bias. (Author/GDC)
Descriptors: Behavior Rating Scales, College Faculty, Comparative Testing, Higher Education

Abedi, Jamal; Baker, Eva L. – Educational and Psychological Measurement, 1995
Results from a performance assessment in which 68 high school students wrote essays support the use of latent variable modeling for estimating reliability, concurrent validity, and generalizability of a scoring rubric. The latent variable modeling approach overcomes the limitations of certain conventional statistical techniques in handling…
Descriptors: Criteria, Essays, Estimation (Mathematics), Generalizability Theory