Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 6 |
Descriptor
Test Bias | 180 |
Test Validity | 180 |
Testing Problems | 180 |
Test Reliability | 55 |
Standardized Tests | 53 |
Elementary Secondary Education | 51 |
Test Interpretation | 49 |
Achievement Tests | 42 |
Test Construction | 40 |
Educational Testing | 36 |
Intelligence Tests | 29 |
More ▼ |
Source
Author
Publication Type
Education Level
Elementary Secondary Education | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Location
California | 5 |
Illinois | 3 |
Arizona | 2 |
Canada | 2 |
Florida | 2 |
Brazil | 1 |
Chile | 1 |
China | 1 |
New Jersey | 1 |
Pennsylvania | 1 |
Texas | 1 |
More ▼ |
Laws, Policies, & Programs
Bakke v Regents of University… | 2 |
Civil Rights Act 1964 Title… | 2 |
Education for All Handicapped… | 2 |
Larry P v Riles | 2 |
Elementary and Secondary… | 1 |
Rehabilitation Act 1973… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Angela Johnson; Elizabeth Barker; Marcos Viveros Cespedes – Educational Measurement: Issues and Practice, 2024
Educators and researchers strive to build policies and practices on data and evidence, especially on academic achievement scores. When assessment scores are inaccurate for specific student populations or when scores are inappropriately used, even data-driven decisions will be misinformed. To maximize the impact of the research-practice-policy…
Descriptors: Equal Education, Inclusion, Evaluation Methods, Error of Measurement
Zhao, Cecilia Guanfang; Liu, Carina Jiayu – Language Testing, 2019
Celpe-Bras, is the exam for the certification of proficiency in Portuguese as a foreign language. It, is the only Portuguese proficiency test recognized by the Brazilian government (Ministério da Educação, 2013). Given the recent growth of interest and also its unique design as a large-scale proficiency test, this article provides a general…
Descriptors: Portuguese, Second Language Learning, Language Proficiency, Language Tests
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014
A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…
Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing
Popham, W. James – Phi Delta Kappan, 2014
The tests we use to evaluate student achievement may well be sound measures of what students know, but they are faulty indicators at best of how well they have been taught. A remedy to this this situation of judging teachers by the performance of their students on high-stakes tests may be in hand already. We should look to the methods successfully…
Descriptors: High Stakes Tests, Academic Achievement, Teacher Evaluation, Evaluation Methods
Camilli, Gregory – Educational Research and Evaluation, 2013
In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…
Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format
Dawis, Rene V. – New Directions for Testing and Measurement, 1980
New as well as landmark instruments and research are described. Among the contemporary issues dealt with is a concern for the source of and methods useful in controlling bias in the construction of interest inventories. With the assessment of interests, as with all measurement, validity is the bottom line. (Author)
Descriptors: Interest Inventories, Interest Research, Scaling, Test Bias

Echternacht, Gary – Educational and Psychological Measurement, 1974
Descriptors: Evaluation Criteria, Probability, Statistical Analysis, Test Bias
Reynolds, Cecil R. – 1981
The cultural test bias hypothesis represents the contention that all ethnic or racial group differences on mental tests are due to inherent, artifactual biases produced within the tests through flawed psychometric methodology. This address focuses on an empirical evaluation of the cultural test bias hypothesis, especially emphasizing the construct…
Descriptors: Elementary Secondary Education, Intelligence Tests, Personality Measures, Test Bias
Diamond, Esther E. – 1981
As test standards and research literature in general indicate, definitions of test bias and item bias vary considerably, as do the results of existing methods of identifying biased items. The situation is further complicated by issues of content, context, construct, and criterion. In achievement tests, for example, content validity may impose…
Descriptors: Achievement Tests, Aptitude Tests, Psychometrics, Test Bias
Ysseldyke, James E. – 1977
The author traces reasons to support his contention that the state of the art in assessing learning disabled students is not good. Among issues examined are the following: use of tests for purposes other than those for which they were intended; technical adequacy of currently used tests (standardization, reliability, validity); the use of deficit…
Descriptors: Evaluation Methods, Learning Disabilities, Student Evaluation, Test Bias
Synk, David J. – 1983
This study used meta-analysis research techniques to determine if there are differences in General Aptitude Test Battery (GATB) validities and test scores between males and females. The sample consisted of 26,111 subjects from 122 Specific Aptitude Test Battery (SATB) validation or revalidation studies analyzed since 1972. Four approaches were…
Descriptors: Aptitude Tests, Measurement Techniques, Meta Analysis, Scores
Ekstrom, Ruth B. – 1979
Three areas of concern related to test bias and validity should be considered during the revision of the Standards for Educational and Psychological Tests. The first area concerns the sources and consequences of test bias. Five sources of bias have been identified: numerical bias, role bias, status bias, stereotypic bias, and familiarity bias. The…
Descriptors: Evaluation Criteria, Psychometrics, Test Bias, Test Construction

Howard, George S.; And Others – Journal of Educational Measurement, 1979
Evaluations of experimental interventions which employ self-report measures are subject to contamination known as response-shift bias. Response-shift effects may be attenuated by substituting retrospective pretest ratings for the traditional self-report pretest ratings. This study indicated that the retrospective rating more accurately reflected…
Descriptors: Higher Education, Rating Scales, Response Style (Tests), Self Evaluation
Kahn, Ann P. – Today's Education, 1979
Questions are raised regarding the validity of norm-referenced tests and mass testing as accurate methods for evaluating student educational needs. (LH)
Descriptors: Educational Needs, Evaluation Methods, Needs Assessment, Norm Referenced Tests

Seymour, Richard T. – Journal of Vocational Behavior, 1988
Argues that occupational tests can exclude racial minorities and that many industrial psychologists have overlooked evidence that many tests are biased and that some claims for validity generalization are based on faulty science. Outlines what plaintiff's counsel looks for in deciding to try a testing case, and provides primer on how to challenge…
Descriptors: Court Litigation, Employment Practices, Generalization, Minority Groups