Publication Date
| In 2026 | 0 |
| Since 2025 | 40 |
| Since 2022 (last 5 years) | 227 |
| Since 2017 (last 10 years) | 572 |
| Since 2007 (last 20 years) | 1379 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 110 |
| Practitioners | 107 |
| Teachers | 46 |
| Administrators | 25 |
| Policymakers | 24 |
| Counselors | 12 |
| Parents | 7 |
| Students | 7 |
| Support Staff | 4 |
| Community | 2 |
Location
| California | 61 |
| Canada | 60 |
| United States | 57 |
| Turkey | 47 |
| Australia | 43 |
| Florida | 34 |
| Germany | 26 |
| Texas | 26 |
| China | 25 |
| Netherlands | 25 |
| Iran | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Evans, Sion Wyn – Educational Studies in Mathematics, 2007
This paper draws on data from the development of annual national mathematics assessment materials for 7-year-old pupils in Wales for use during the period 2000-2002. The materials were developed in both English and Welsh and were designed to be matched. The paper reports on item analyses which sought items that exhibited differential performance…
Descriptors: Foreign Countries, Welsh, Test Bias, Educational Testing
Ferne, Tracy; Rupp, Andre A. – Language Assessment Quarterly, 2007
This article reviews research on differential item functioning (DIF) in language testing conducted primarily between 1990 and 2005 with an eye toward providing methodological guidelines for developing, conducting, and disseminating research in this area. The article contains a synthesis of 27 studies with respect to five essential sets of…
Descriptors: Test Bias, Evaluation Research, Testing, Language Tests
Westby, Carol – Communication Disorders Quarterly, 2007
This article reviews the concept of intelligence from different cultural perspectives and explains why the traditional approach to determining "who is smart" is inappropriate for students from culturally/linguistically diverse backgrounds and inadequate even for determining if mainstream students will be successful in daily living. The concept of…
Descriptors: Intelligence, Cultural Differences, Cultural Relevance, Student Diversity
Papanastasiou, Elena C.; Reckase, Mark D. – International Journal of Testing, 2007
Because of the increased popularity of computerized adaptive testing (CAT), many admissions tests, as well as certification and licensure examinations, have been transformed from their paper-and-pencil versions to computerized adaptive versions. A major difference between paper-and-pencil tests and CAT from an examinee's point of view is that in…
Descriptors: Simulation, Adaptive Testing, Computer Assisted Testing, Test Items
Meade, Adam W.; Lautenschlager, Gary J.; Johnson, Emily C. – Applied Psychological Measurement, 2007
This article highlights issues associated with the use of the differential functioning of items and tests (DFIT) methodology for assessing measurement invariance (or differential functioning) with Likert-type data. Monte Carlo analyses indicate relatively low sensitivity of the DFIT methodology for identifying differential item functioning (DIF)…
Descriptors: Measures (Individuals), Monte Carlo Methods, Likert Scales, Effect Size
Birenbaum, Menucha – Studies in Educational Evaluation, 2007
High quality assessment practice is expected to yield valid and useful score-based interpretations about what the examinees know and are able to do with respect to a defined target domain. Given this assertion, the article presents a framework based on the "unified view of validity," advanced by Cronbach and Messick over two decades ago, to assist…
Descriptors: Quality Control, Student Evaluation, Validity, Evaluation Methods
Braun, Henry; Zhang, Jinming; Vezzu, Sailesh – ETS Research Report Series, 2008
At present, although the percentages of students with disabilities (SDs) and/or students who are English language learners (ELL) excluded from a NAEP administration are reported, no statistical adjustment is made for these excluded students in the calculation of NAEP results. However, the exclusion rates for both SD and ELL students vary…
Descriptors: Research Methodology, Computation, Disabilities, English Language Learners
Sellbom, Martin; Bagby, R. Michael – Psychological Assessment, 2008
In the current investigation, the authors examined the validity of the L-r and K-r scales on the recently developed Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF; Y. S. Ben-Porath & A. Tellegen, in press) in measuring underreported response bias. Three archival samples previously collected for examining MMPI-2…
Descriptors: Schizophrenia, Response Style (Tests), Test Validity, Child Custody
Walker, Ruth; Barwell, Graham – International Journal for the Scholarship of Teaching and Learning, 2009
Peer bias is recognised as a primary factor in negative student perceptions of peer assessment strategies. This study trialled the use of classroom response systems, widely known as clickers, in small seminar classes in order to actively engage students in their subject's assessment process while providing the anonymity that would lessen the…
Descriptors: Educational Technology, Peer Evaluation, Audience Response Systems, Focus Groups
Zhang, Ying; Elder, Catherine – Language Assessment Quarterly, 2009
The College English Test-Spoken English Test is a nationwide spoken English test designed to assess the oral communicative ability of Chinese university and college students who have undertaken compulsory English study at a Chinese university. This article describes the test and evaluates it in terms of reliability, validity, authenticity,…
Descriptors: Test Results, Language Tests, Rating Scales, Foreign Countries
Peer reviewedDawson, George; And Others – Science Teacher, 1974
Discusses possible sources of bias in reading tests used to evaluate science materials. (PEB)
Descriptors: Bias, Readability, Reading Difficulty, Reading Skills
Southwest Regional Resource Center, Salt Lake City, UT. – 1977
Intended for personnel in State Educational Agencies, the document provides guidelines, procedures, and forms for implementation of nondiscriminatory assessment practices with handicapped children and adults. In an introductory section, a model is presented for establishing the relationships between various components of an unbiased assessment…
Descriptors: Conceptual Schemes, Disabilities, Evaluation Methods, Guidelines
Peer reviewedGreene, Roger L. – Journal of Consulting and Clinical Psychology, 1987
Reviews Minnesota Multiphasic Personality Inventory (MMPI) performance as a function of ethnic group membership in Asian Americans, Blacks, Hispanics, and Native Americans. Recommends issues raised in the review be addressed by research before it can be concluded that new norms for the MMPI are needed for specific ethnic groups. (Author/NB)
Descriptors: Ethnic Groups, Ethnicity, Personality Assessment, Test Bias
Peer reviewedLippmann, Walter – Educational Forum, 1986
The author answers Terman's allegations. He states that, while he honestly thinks that there is a considerable future for mental testing, it is also a field that could be dangerous if the people in positions of leadership are "loose-minded." (CT)
Descriptors: Intelligence Tests, Test Bias, Test Reliability, Test Validity
Peer reviewedSawyer, Richard L.; And Others – Journal of Educational Measurement, 1976
This article examines some of the values that might be considered in a selection situation within the context of a decision theoretic model also described here. Several alternate expressions of fair selection are suggested in the form of utility statements in which these values can be understood and compared. (Author/DEP)
Descriptors: Bias, Decision Making, Evaluation Criteria, Models

Direct link
