NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 1 to 15 of 180 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Angela Johnson; Elizabeth Barker; Marcos Viveros Cespedes – Educational Measurement: Issues and Practice, 2024
Educators and researchers strive to build policies and practices on data and evidence, especially on academic achievement scores. When assessment scores are inaccurate for specific student populations or when scores are inappropriately used, even data-driven decisions will be misinformed. To maximize the impact of the research-practice-policy…
Descriptors: Equal Education, Inclusion, Evaluation Methods, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Zhao, Cecilia Guanfang; Liu, Carina Jiayu – Language Testing, 2019
Celpe-Bras, is the exam for the certification of proficiency in Portuguese as a foreign language. It, is the only Portuguese proficiency test recognized by the Brazilian government (Ministério da Educação, 2013). Given the recent growth of interest and also its unique design as a large-scale proficiency test, this article provides a general…
Descriptors: Portuguese, Second Language Learning, Language Proficiency, Language Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014
A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…
Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing
Popham, W. James – Phi Delta Kappan, 2014
The tests we use to evaluate student achievement may well be sound measures of what students know, but they are faulty indicators at best of how well they have been taught. A remedy to this this situation of judging teachers by the performance of their students on high-stakes tests may be in hand already. We should look to the methods successfully…
Descriptors: High Stakes Tests, Academic Achievement, Teacher Evaluation, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Camilli, Gregory – Educational Research and Evaluation, 2013
In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…
Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format
Dawis, Rene V. – New Directions for Testing and Measurement, 1980
New as well as landmark instruments and research are described. Among the contemporary issues dealt with is a concern for the source of and methods useful in controlling bias in the construction of interest inventories. With the assessment of interests, as with all measurement, validity is the bottom line. (Author)
Descriptors: Interest Inventories, Interest Research, Scaling, Test Bias
Peer reviewed Peer reviewed
Echternacht, Gary – Educational and Psychological Measurement, 1974
Descriptors: Evaluation Criteria, Probability, Statistical Analysis, Test Bias
Reynolds, Cecil R. – 1981
The cultural test bias hypothesis represents the contention that all ethnic or racial group differences on mental tests are due to inherent, artifactual biases produced within the tests through flawed psychometric methodology. This address focuses on an empirical evaluation of the cultural test bias hypothesis, especially emphasizing the construct…
Descriptors: Elementary Secondary Education, Intelligence Tests, Personality Measures, Test Bias
Diamond, Esther E. – 1981
As test standards and research literature in general indicate, definitions of test bias and item bias vary considerably, as do the results of existing methods of identifying biased items. The situation is further complicated by issues of content, context, construct, and criterion. In achievement tests, for example, content validity may impose…
Descriptors: Achievement Tests, Aptitude Tests, Psychometrics, Test Bias
Ysseldyke, James E. – 1977
The author traces reasons to support his contention that the state of the art in assessing learning disabled students is not good. Among issues examined are the following: use of tests for purposes other than those for which they were intended; technical adequacy of currently used tests (standardization, reliability, validity); the use of deficit…
Descriptors: Evaluation Methods, Learning Disabilities, Student Evaluation, Test Bias
Synk, David J. – 1983
This study used meta-analysis research techniques to determine if there are differences in General Aptitude Test Battery (GATB) validities and test scores between males and females. The sample consisted of 26,111 subjects from 122 Specific Aptitude Test Battery (SATB) validation or revalidation studies analyzed since 1972. Four approaches were…
Descriptors: Aptitude Tests, Measurement Techniques, Meta Analysis, Scores
Ekstrom, Ruth B. – 1979
Three areas of concern related to test bias and validity should be considered during the revision of the Standards for Educational and Psychological Tests. The first area concerns the sources and consequences of test bias. Five sources of bias have been identified: numerical bias, role bias, status bias, stereotypic bias, and familiarity bias. The…
Descriptors: Evaluation Criteria, Psychometrics, Test Bias, Test Construction
Peer reviewed Peer reviewed
Howard, George S.; And Others – Journal of Educational Measurement, 1979
Evaluations of experimental interventions which employ self-report measures are subject to contamination known as response-shift bias. Response-shift effects may be attenuated by substituting retrospective pretest ratings for the traditional self-report pretest ratings. This study indicated that the retrospective rating more accurately reflected…
Descriptors: Higher Education, Rating Scales, Response Style (Tests), Self Evaluation
Kahn, Ann P. – Today's Education, 1979
Questions are raised regarding the validity of norm-referenced tests and mass testing as accurate methods for evaluating student educational needs. (LH)
Descriptors: Educational Needs, Evaluation Methods, Needs Assessment, Norm Referenced Tests
Peer reviewed Peer reviewed
Seymour, Richard T. – Journal of Vocational Behavior, 1988
Argues that occupational tests can exclude racial minorities and that many industrial psychologists have overlooked evidence that many tests are biased and that some claims for validity generalization are based on faulty science. Outlines what plaintiff's counsel looks for in deciding to try a testing case, and provides primer on how to challenge…
Descriptors: Court Litigation, Employment Practices, Generalization, Minority Groups
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  12