Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 6 |
Descriptor
Predictive Validity | 13 |
Test Items | 13 |
Student Evaluation | 4 |
Test Construction | 4 |
Test Reliability | 4 |
Construct Validity | 3 |
Foreign Countries | 3 |
Item Analysis | 3 |
Test Bias | 3 |
Test Theory | 3 |
Test Validity | 3 |
More ▼ |
Source
Author
Benton, Tom | 1 |
Budescu, David V. | 1 |
Diamond, Esther E. | 1 |
Distefano, M. K., Jr. | 1 |
Gonzalez-Tamayo, Eulogio | 1 |
Greenlees, Iain | 1 |
Griffin, Noelle | 1 |
Hobson, Gina | 1 |
Holder, Tim | 1 |
Kerr, Stephen | 1 |
Krull, George | 1 |
More ▼ |
Publication Type
Reports - Evaluative | 13 |
Journal Articles | 8 |
Education Level
Higher Education | 3 |
Postsecondary Education | 2 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 2 | 1 |
Audience
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
SAT (College Admission Test) | 2 |
Armed Services Vocational… | 1 |
International English… | 1 |
What Works Clearinghouse Rating
Benton, Tom – Research Matters, 2020
This article reviews the evidence on the extent to which experts' perceptions of item difficulties, captured using comparative judgement, can predict empirical item difficulties. This evidence is drawn from existing published studies on this topic and also from statistical analysis of data held by Cambridge Assessment. Having reviewed the…
Descriptors: Test Items, Difficulty Level, Expertise, Comparative Analysis
Quaid, Ethan Douglas – Language Testing in Asia, 2018
This paper reviews the International English Language Testing System's speaking sub-test in the East Asia region with reference to theoretical and practice-based perspectives and identifies future research opportunities to enhance the measures of test qualities found. The test's construct validity was seen to accurately measure the abilities…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Speech Tests
Kerr, Stephen; Krull, George – Journal of Learning in Higher Education, 2017
This paper explored the authors' concerns about students enrolled in their introductory accounting course. Anecdotal evidence suggested that students struggle with basic arithmetic concepts that underlie basic business transactions even though their math placement and ACT scores are high. A survey of 125 students in a first accounting course was…
Descriptors: Accounting, Business Administration Education, Skill Development, Arithmetic
Soares, Joseph A. – Research & Practice in Assessment, 2012
In Philip Pullman's dark matter sci-fi trilogy, there is a golden compass that in the hands of the right person is predictively powerful; the same was supposed to be true of the SAT/ACT--the statistically indistinguishable standardized tests for college admissions. They were intended to be reliable mechanisms for identifying future trajectories,…
Descriptors: Aptitude Tests, College Entrance Examinations, Educational Benefits, Barriers

Kuder, Frederic; Diamond, Esther E.; Zytowski, Donald G. – Educational and Psychological Measurement, 1998
Predictive validity, generally taken to be the prime validity that occupationally normed interest inventories should demonstrate, is dependent on the capacity of an instrument to differentiate between occupations. A comparison of two methods of differentiation shows that a method using proportions of each occupational group to assign item-scoring…
Descriptors: Interest Inventories, Occupational Tests, Predictive Measurement, Predictive Validity
Budescu, David V. – 1979
This paper outlines a technique for differentially weighting options of a multiple choice test in a fashion that maximizes the item predictive validity. The rule can be applied with different number of categories and the "optimal" number of categories can be determined by significance tests and/or through the R2 criterion. Our theoretical analysis…
Descriptors: Multiple Choice Tests, Predictive Validity, Scoring Formulas, Test Items
Niemi, David; Vallone, Julia; Wang, Jia; Griffin, Noelle – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2007
Many districts and schools across the U. S. have begun to develop and administer assessments to complement state testing systems and provide additional information to monitor curriculum, instruction and schools. In advance of this trend, the Jackson Public Schools (JPS) district has had a district benchmark testing system in place for many years.…
Descriptors: Public Schools, Testing Programs, Educational Testing, Item Analysis

Distefano, M. K., Jr.; Pryer, Margaret W. – Educational and Psychological Measurement, 1987
From 13 objective interview items, five with adequate response variability were studied to determine if they would improve the validity of a verbal ability selection test in predicting work performance of 181 psychiatric aide trainees. In a multiple regression analysis, the verbal test correlated .27 with the weighted composite rating score.…
Descriptors: Multiple Regression Analysis, Objective Tests, Predictive Validity, Psychiatric Aides
Wiliam, Dylan – Review of Research in Education, 2010
The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…
Descriptors: Educational Assessment, Validity, Inferences, Construct Validity
Gonzalez-Tamayo, Eulogio – 1987
The agreement between the Educational Testing Service (ETS) and the Golden Rule Insurance Company of Illinois is interpreted as setting the general principles on which items must be selected to be included in a licensure test. These principles put a limit to the difficulty level of any item, and they also limit the size of the difference in…
Descriptors: Analysis of Variance, Content Validity, Difficulty Level, Item Analysis
Rosser, Phyllis – 1989
Questions on the Scholastic Aptitude Test (SAT) with the largest score differences between women and men of all racial and ethnic groups were identified. Patterns of difficulty that would explain the SAT's continuing underprediction of female first-year college performance were studied. An item analysis of one form of the June 1986 SAT for 1,112…
Descriptors: Ethnic Groups, Females, High School Seniors, High Schools
Greenlees, Iain; Lane, Andrew; Thelwell, Richard; Holder, Tim; Hobson, Gina – Research Quarterly for Exercise and Sport, 2005
The aim of this study was to develop and validate a team-referent attribution scale. Conducted over three studies, Study 1 modified items from McAuley, Duncan, and Russell's (1992) Causal Dimension Scale II by rewording items to reflect team attributions and adding one item per factor. This led to the development of a 16-item scale (Causal…
Descriptors: Attribution Theory, Team Sports, Athletes, Factor Analysis
Weltin, Mary M.; Popelka, Beverly A. – 1983
The composite of Armed Services Vocational Aptitude Battery (ASVAB) subtests used to select applicants for entry-level training in Army clerical schools was evaluated by correlating composite scores with training performance scores. Comparisons were made between the multiple R for this optimal set of predictors and that for the composite of…
Descriptors: Achievement, Aptitude Tests, Armed Forces, Clerical Occupations