ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	6

Source

Educational and Psychological…	2
Journal of Learning in Higher…	1
Language Testing in Asia	1
National Center for Research…	1
Research & Practice in…	1
Research Matters	1
Research Quarterly for…	1
Review of Research in…	1

Publication Type

Reports - Evaluative	13
Journal Articles	8

Education Level

Higher Education	3
Postsecondary Education	2
Elementary Education	1
Elementary Secondary Education	1
Grade 2	1

Audience

Location

Asia	1
Illinois	1
United Kingdom	1
United Kingdom (England)	1
United States	1

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

SAT (College Admission Test)	2
Armed Services Vocational…	1
International English…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

How Useful Is Comparative Judgement of Item Difficulty for Standard Maintaining?

Download full text

Benton, Tom – Research Matters, 2020

This article reviews the evidence on the extent to which experts' perceptions of item difficulties, captured using comparative judgement, can predict empirical item difficulties. This evidence is drawn from existing published studies on this topic and also from statistical analysis of data held by Cambridge Assessment. Having reviewed the…

Descriptors: Test Items, Difficulty Level, Expertise, Comparative Analysis

Reviewing the IELTS Speaking Test in East Asia: Theoretical and Practice-Based Insights

Peer reviewed

Direct link

Quaid, Ethan Douglas – Language Testing in Asia, 2018

This paper reviews the International English Language Testing System's speaking sub-test in the East Asia region with reference to theoretical and practice-based perspectives and identifies future research opportunities to enhance the measures of test qualities found. The test's construct validity was seen to accurately measure the abilities…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Speech Tests

The Risks and Opportunities Associated with Weak Arithmatic Skills of Accounting Students

Peer reviewed
PDF on ERIC

Download full text

Kerr, Stephen; Krull, George – Journal of Learning in Higher Education, 2017

This paper explored the authors' concerns about students enrolled in their introductory accounting course. Anecdotal evidence suggested that students struggle with basic arithmetic concepts that underlie basic business transactions even though their math placement and ACT scores are high. A survey of 125 students in a first accounting course was…

Descriptors: Accounting, Business Administration Education, Skill Development, Arithmetic

For Tests That Are Predictively Powerful and without Social Prejudice

Peer reviewed
PDF on ERIC

Download full text

Soares, Joseph A. – Research & Practice in Assessment, 2012

In Philip Pullman's dark matter sci-fi trilogy, there is a golden compass that in the hands of the right person is predictively powerful; the same was supposed to be true of the SAT/ACT--the statistically indistinguishable standardized tests for college admissions. They were intended to be reliable mechanisms for identifying future trajectories,…

Descriptors: Aptitude Tests, College Entrance Examinations, Educational Benefits, Barriers

Differentiation as Fundamental Validity for Criterion-Group Scaled Interest Inventories.

Peer reviewed

Kuder, Frederic; Diamond, Esther E.; Zytowski, Donald G. – Educational and Psychological Measurement, 1998

Predictive validity, generally taken to be the prime validity that occupationally normed interest inventories should demonstrate, is dependent on the capacity of an instrument to differentiate between occupations. A comparison of two methods of differentiation shows that a method using proportions of each occupational group to assign item-scoring…

Descriptors: Interest Inventories, Occupational Tests, Predictive Measurement, Predictive Validity

Differential Weighting of Multiple-Choice Items.

Download full text

Budescu, David V. – 1979

This paper outlines a technique for differentially weighting options of a multiple choice test in a fashion that maximizes the item predictive validity. The rule can be applied with different number of categories and the "optimal" number of categories can be determined by significance tests and/or through the R2 criterion. Our theoretical analysis…

Descriptors: Multiple Choice Tests, Predictive Validity, Scoring Formulas, Test Items

Recommendations for Building a Valid Benchmark Assessment System: Interim Report to the Jackson Public Schools. CRESST Report 723

Download full text

Niemi, David; Vallone, Julia; Wang, Jia; Griffin, Noelle – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2007

Many districts and schools across the U. S. have begun to develop and administer assessments to complement state testing systems and provide additional information to monitor curriculum, instruction and schools. In advance of this trend, the Jackson Public Schools (JPS) district has had a district benchmark testing system in place for many years.…

Descriptors: Public Schools, Testing Programs, Educational Testing, Item Analysis

Evaluation of Selected Interview Data in Improving the Predictive Validity of a Verbal Ability Test with Psychiatric Aide Trainees.

Peer reviewed

Distefano, M. K., Jr.; Pryer, Margaret W. – Educational and Psychological Measurement, 1987

From 13 objective interview items, five with adequate response variability were studied to determine if they would improve the validity of a verbal ability selection test in predicting work performance of 181 psychiatric aide trainees. In a multiple regression analysis, the verbal test correlated .27 with the weighted composite rating score.…

Descriptors: Multiple Regression Analysis, Objective Tests, Predictive Validity, Psychiatric Aides

What Counts as Evidence of Educational Achievement? The Role of Constructs in the Pursuit of Equity in Assessment

Peer reviewed

Direct link

Wiliam, Dylan – Review of Research in Education, 2010

The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…

Descriptors: Educational Assessment, Validity, Inferences, Construct Validity

The Golden Rule Agreement is Psychometrically Defensible.

Gonzalez-Tamayo, Eulogio – 1987

The agreement between the Educational Testing Service (ETS) and the Golden Rule Insurance Company of Illinois is interpreted as setting the general principles on which items must be selected to be included in a licensure test. These principles put a limit to the difficulty level of any item, and they also limit the size of the difference in…

Descriptors: Analysis of Variance, Content Validity, Difficulty Level, Item Analysis

The SAT Gender Gap: Identifying the Causes.

Download full text

Rosser, Phyllis – 1989

Questions on the Scholastic Aptitude Test (SAT) with the largest score differences between women and men of all racial and ethnic groups were identified. Patterns of difficulty that would explain the SAT's continuing underprediction of female first-year college performance were studied. An item analysis of one form of the June 1986 SAT for 1,112…

Descriptors: Ethnic Groups, Females, High School Seniors, High Schools

Team-Referent Attributions among Sport Performers

Peer reviewed

Direct link

Greenlees, Iain; Lane, Andrew; Thelwell, Richard; Holder, Tim; Hobson, Gina – Research Quarterly for Exercise and Sport, 2005

The aim of this study was to develop and validate a team-referent attribution scale. Conducted over three studies, Study 1 modified items from McAuley, Duncan, and Russell's (1992) Causal Dimension Scale II by rewording items to reflect team attributions and adding one item per factor. This led to the development of a 16-item scale (Causal…

Descriptors: Attribution Theory, Team Sports, Athletes, Factor Analysis

Evaluation of the ASVAB 8/9/10 Clerical Composite for Predicting Training School Performance. Technical Report 594.

Download full text

Weltin, Mary M.; Popelka, Beverly A. – 1983

The composite of Armed Services Vocational Aptitude Battery (ASVAB) subtests used to select applicants for entry-level training in Army clerical schools was evaluated by correlating composite scores with training performance scores. Comparisons were made between the multiple R for this optimal set of predictors and that for the composite of…

Descriptors: Achievement, Aptitude Tests, Armed Forces, Clerical Occupations

Predictive Validity	13
Test Items	13
Student Evaluation	4
Test Construction	4
Test Reliability	4
Construct Validity	3
Foreign Countries	3
Item Analysis	3
Test Bias	3
Test Theory	3
Test Validity	3
Aptitude Tests	2
College Entrance Examinations	2
Difficulty Level	2
Educational Testing	2
Multiple Choice Tests	2
Objective Tests	2
Occupational Tests	2
Predictive Measurement	2
Psychometrics	2
Racial Differences	2
Test Format	2
Test Interpretation	2
Test Results	2
Testing Problems	2
More ▼

Benton, Tom	1
Budescu, David V.	1
Diamond, Esther E.	1
Distefano, M. K., Jr.	1
Gonzalez-Tamayo, Eulogio	1
Greenlees, Iain	1
Griffin, Noelle	1
Hobson, Gina	1
Holder, Tim	1
Kerr, Stephen	1
Krull, George	1
Kuder, Frederic	1
Lane, Andrew	1
Niemi, David	1
Popelka, Beverly A.	1
Pryer, Margaret W.	1
Quaid, Ethan Douglas	1
Rosser, Phyllis	1
Soares, Joseph A.	1
Thelwell, Richard	1
Vallone, Julia	1
Wang, Jia	1
Weltin, Mary M.	1
Wiliam, Dylan	1
Zytowski, Donald G.	1
More ▼