Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 7 |
Descriptor
| Evaluation Methods | 48 |
| Test Bias | 48 |
| Testing Problems | 48 |
| Student Evaluation | 19 |
| Elementary Secondary Education | 18 |
| Test Validity | 13 |
| Standardized Tests | 11 |
| Test Construction | 10 |
| Test Items | 10 |
| Educational Testing | 9 |
| Minority Groups | 9 |
| More ▼ | |
Source
Author
| Childs, Ruth A. | 2 |
| White, Edward M. | 2 |
| Ysseldyke, James E. | 2 |
| Angela Johnson | 1 |
| Archer, Mary | 1 |
| Ascher, Carol | 1 |
| Boscardin, Mary Lynn | 1 |
| Camilli, Gregory | 1 |
| Cancelli, Anthony A. | 1 |
| Chandler, Harry N. | 1 |
| Cronje, Johannes C. | 1 |
| More ▼ | |
Publication Type
Education Level
| Elementary Secondary Education | 1 |
| Higher Education | 1 |
| Postsecondary Education | 1 |
Audience
| Practitioners | 5 |
| Researchers | 2 |
Location
| South Africa | 2 |
| Canada | 1 |
| Minnesota | 1 |
Laws, Policies, & Programs
| Education for All Handicapped… | 3 |
| Social Security | 1 |
Assessments and Surveys
| Wechsler Adult Intelligence… | 1 |
What Works Clearinghouse Rating
Angela Johnson; Elizabeth Barker; Marcos Viveros Cespedes – Educational Measurement: Issues and Practice, 2024
Educators and researchers strive to build policies and practices on data and evidence, especially on academic achievement scores. When assessment scores are inaccurate for specific student populations or when scores are inappropriately used, even data-driven decisions will be misinformed. To maximize the impact of the research-practice-policy…
Descriptors: Equal Education, Inclusion, Evaluation Methods, Error of Measurement
Popham, W. James – Phi Delta Kappan, 2014
The tests we use to evaluate student achievement may well be sound measures of what students know, but they are faulty indicators at best of how well they have been taught. A remedy to this this situation of judging teachers by the performance of their students on high-stakes tests may be in hand already. We should look to the methods successfully…
Descriptors: High Stakes Tests, Academic Achievement, Teacher Evaluation, Evaluation Methods
Emenogu, Barnabas C.; Falenchuk, Olesya; Childs, Ruth A. – Alberta Journal of Educational Research, 2010
Most implementations of the Mantel-Haenszel differential item functioning procedure delete records with missing responses or replace missing responses with scores of 0. These treatments of missing data make strong assumptions about the causes of the missing data. Such assumptions may be particularly problematic when groups differ in their patterns…
Descriptors: Foreign Countries, Test Bias, Test Items, Educational Testing
Penfield, Randall D.; Gattamorta, Karina; Childs, Ruth A. – Educational Measurement: Issues and Practice, 2009
Traditional methods for examining differential item functioning (DIF) in polytomously scored test items yield a single item-level index of DIF and thus provide no information concerning which score levels are implicated in the DIF effect. To address this limitation of DIF methodology, the framework of differential step functioning (DSF) has…
Descriptors: Test Bias, Test Items, Evaluation Methods, Scores
Camilli, Gregory – Educational Research and Evaluation, 2013
In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…
Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format
Robitzsch, Alexander; Rupp, Andre A. – Educational and Psychological Measurement, 2009
This article describes the results of a simulation study to investigate the impact of missing data on the detection of differential item functioning (DIF). Specifically, it investigates how four methods for dealing with missing data (listwise deletion, zero imputation, two-way imputation, response function imputation) interact with two methods of…
Descriptors: Test Bias, Simulation, Interaction, Effect Size
Ysseldyke, James E. – 1977
The author traces reasons to support his contention that the state of the art in assessing learning disabled students is not good. Among issues examined are the following: use of tests for purposes other than those for which they were intended; technical adequacy of currently used tests (standardization, reliability, validity); the use of deficit…
Descriptors: Evaluation Methods, Learning Disabilities, Student Evaluation, Test Bias
Kahn, Ann P. – Today's Education, 1979
Questions are raised regarding the validity of norm-referenced tests and mass testing as accurate methods for evaluating student educational needs. (LH)
Descriptors: Educational Needs, Evaluation Methods, Needs Assessment, Norm Referenced Tests
Hills, John R. – 1984
The literature on item bias, i.e., the question of whether some items in tests favor one cultural group over another cultural group due to irrelevant factors, is reviewed and evaluated. All known references through 1981 are described including a large number of unpublished reports. Each method is described and the criticisms that have appeared in…
Descriptors: Evaluation Methods, Item Analysis, Racial Differences, Test Bias
Peer reviewedChandler, Harry N. – Journal of Learning Disabilities, 1984
The author discusses the complex nature of assessing bilingual special education students, recommends a textbook on the topic, notes the contributions of J. Mercer's System of Multicultural Pluralistic Assessment, and cites the need for special educators to focus on the issue. (CL)
Descriptors: Disability Identification, Elementary Secondary Education, Evaluation Methods, Limited English Speaking
Peer reviewedJohanson, George A.; And Others – Evaluation Review, 1993
The tendency of some respondents to omit items more often when they feel they have a less positive evaluation to make and less frequently when the evaluation is more positive is discussed. Five examples illustrate this form of nonresponse bias. Recommendations to overcome nonresponse bias are offered. (SLD)
Descriptors: Estimation (Mathematics), Evaluation Methods, Questionnaires, Response Style (Tests)
MacMillan, Donald I.; Meyers, C. Edward – Viewpoints, 1977
The author discusses the difficulties inherent in testing procedures when used for classifying children as handicapped, and procedures which could be used to insure that such tests are applied, evaluated, and interpreted in a nondiscriminatory manner. (MB)
Descriptors: Civil Rights, Classification, Disability Discrimination, Evaluation Methods
Peer reviewedLevin, Henry M. – Journal of Vocational Behavior, 1988
Considers racial impact of using ability tests for employment decisions. Notes agreement that large differences exist in performance on general ability among races and that there is probable relation between general ability and job performance for most jobs; disagreement about whether racial differences in general ability are fixed and about size…
Descriptors: Aptitude Tests, Employment Practices, Evaluation Criteria, Evaluation Methods
Peer reviewedStinespring, John A. – Roeper Review, 1991
A culture-specific approach to identifying artistic talent in African-American students is presented, called Tactuality. Tactuality identifies four characteristics: emotional intensity, flexibility and open-endedness, holistic perception, and tactile sensitivity. A pilot test with 63 fourth graders resulted in statistical problems with validation.…
Descriptors: Black Students, Elementary Secondary Education, Evaluation Methods, Talent
Peer reviewedKrushat, W. Mark; Molnar, John I. – Evaluation Practice, 1993
After a mailed questionnaire and two follow-ups, hard-core survey nonrespondents were contacted to ask why they had not responded and what they would have answered had they responded. Results from 28 subjects indicate that efforts to pursue nonrespondents may be unnecessary, because bias resulting from nonresponse appeared minimal. (SLD)
Descriptors: Comparative Analysis, Data Collection, Evaluation Methods, Followup Studies

Direct link
