Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 12 |
Descriptor
Evaluation Methods | 32 |
Test Bias | 32 |
Testing | 32 |
Student Evaluation | 19 |
Test Validity | 9 |
Standardized Tests | 8 |
Test Use | 8 |
Elementary Secondary Education | 7 |
Test Construction | 7 |
Test Interpretation | 7 |
Test Reliability | 6 |
More ▼ |
Source
Author
Woods, Carol M. | 3 |
Ascher, Carol | 1 |
Bennett, Randy Elliot | 1 |
Boyle, J. David | 1 |
Cancelli, Anthony A. | 1 |
Cheng, Britte H. | 1 |
Chia, Magda | 1 |
Colker, Alexis M. | 1 |
Conway, Lee | 1 |
Davis, Derrick D. | 1 |
DeBarger, Angela | 1 |
More ▼ |
Publication Type
Education Level
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Audience
Practitioners | 2 |
Administrators | 1 |
Community | 1 |
Parents | 1 |
Teachers | 1 |
Location
Alabama | 1 |
China | 1 |
United Kingdom | 1 |
United States | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Assessments and Surveys
National Assessment of… | 1 |
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Davis, Derrick D. – Alabama Journal of Educational Leadership, 2021
Without question, faculty (regardless of discipline) should be equipped with the necessary skills to assess students fairly and ethically. This study focuses on the central and prevailing importance of faculty judgment and how that judgment (or lack thereof) influences perceptions related to ethics and assessment of students. The study outlines…
Descriptors: Student Evaluation, Evaluative Thinking, Elementary School Teachers, Secondary School Teachers
Thurlow, Martha L.; Warren, Sandra H.; Chia, Magda – National Center on Educational Outcomes, 2020
This report provides 10 lessons about how to ensure inclusive assessment practices for students with disabilities and English learners. In addition to the 10 lessons, it provides foundational information on the characteristics of these students that require consideration during all phases of assessment design, development, and implementation. The…
Descriptors: Students with Disabilities, English Language Learners, Inclusion, Student Evaluation
Fan, Xumei; Johnson, Robert; Liu, Xiumei – New Waves-Educational Research and Development Journal, 2017
The purpose of this study was to investigate Chinese university professors' perceptions about the ethicality of classroom assessment practices. In a survey of Chinese professors, participants completed a questionnaire with 15 scenarios that depicted ethical and unethical assessment practices. Participants consisted of 555 professors from 143…
Descriptors: Foreign Countries, College Faculty, Ethics, Student Evaluation
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
Kim, Eun Sook; Yoon, Myeongsun; Lee, Taehun – Educational and Psychological Measurement, 2012
Multiple-indicators multiple-causes (MIMIC) modeling is often used to test a latent group mean difference while assuming the equivalence of factor loadings and intercepts over groups. However, this study demonstrated that MIMIC was insensitive to the presence of factor loading noninvariance, which implies that factor loading invariance should be…
Descriptors: Test Items, Simulation, Testing, Statistical Analysis
Woods, Carol M.; Grimm, Kevin J. – Applied Psychological Measurement, 2011
In extant literature, multiple indicator multiple cause (MIMIC) models have been presented for identifying items that display uniform differential item functioning (DIF) only, not nonuniform DIF. This article addresses, for apparently the first time, the use of MIMIC models for testing both uniform and nonuniform DIF with categorical indicators. A…
Descriptors: Test Bias, Testing, Interaction, Item Response Theory
Mislevy, Robert J.; Haertel, Geneva; Cheng, Britte H.; Ructtinger, Liliana; DeBarger, Angela; Murray, Elizabeth; Rose, David; Gravel, Jenna; Colker, Alexis M.; Rutstein, Daisy; Vendlinski, Terry – Educational Research and Evaluation, 2013
Standardizing aspects of assessments has long been recognized as a tactic to help make evaluations of examinees fair. It reduces variation in irrelevant aspects of testing procedures that could advantage some examinees and disadvantage others. However, recent attention to making assessment accessible to a more diverse population of students…
Descriptors: Testing Accommodations, Access to Education, Testing, Psychometrics
Reddell, Samantha – Online Submission, 2010
The purpose of this research paper was to examine the effects of standardized testing on the youth of America. It was intended to point out the shortcomings of the usage of such tests. There were comparisons of the effects testing has on different cultures of students as well as different socioeconomic classes. Court cases were brought into play…
Descriptors: Evaluation Methods, Student Evaluation, Court Litigation, Test Bias
Woods, Carol M. – Applied Psychological Measurement, 2011
Differential item functioning (DIF) occurs when an item on a test, questionnaire, or interview has different measurement properties for one group of people versus another, irrespective of true group-mean differences on the constructs being measured. This article is focused on item response theory based likelihood ratio testing for DIF (IRT-LR or…
Descriptors: Simulation, Item Response Theory, Testing, Questionnaires
Woods, Carol M. – Applied Psychological Measurement, 2009
Differential item functioning (DIF) occurs when items on a test or questionnaire have different measurement properties for one group of people versus another, irrespective of group-mean differences on the construct. Methods for testing DIF require matching members of different groups on an estimate of the construct. Preferably, the estimate is…
Descriptors: Test Results, Testing, Item Response Theory, Test Bias
Sireci, Stephen G.; Han, Kyung T.; Wells, Craig S. – Educational Assessment, 2008
In the United States, when English language learners (ELLs) are tested, they are usually tested in English and their limited English proficiency is a potential cause of construct-irrelevant variance. When such irrelevancies affect test scores, inaccurate interpretations of ELLs' knowledge, skills, and abilities may occur. In this article, we…
Descriptors: Test Use, Educational Assessment, Psychological Testing, Validity

Hinkle, J. Scott – Measurement and Evaluation in Counseling and Development, 1994
Presents a cross-cultural perspective for testing practitioners. Discusses practical cross-cultural assessment issues, including test unfairness or bias. Offers solutions to testing issues regarding diverse populations. Includes 80 citations. (Author/CRR)
Descriptors: Counselor Training, Cultural Differences, Cultural Pluralism, Evaluation

Bennett, Randy Elliot – Exceptional Children, 1983
The article summarizes current knowledge as it relates to three basic requirements for assessment (qualified personnel, adequate tools, and fair implementation) and identifies research and evaluation priorities for special education and school psychology. Priorities include defining minimum competency for assessment personnel and identifying…
Descriptors: Disabilities, Elementary Secondary Education, Evaluation Methods, School Psychologists
Lam, Tony C. M. – 1995
Performance assessment is a type of educational assessment in which judgments are made about student knowledge and skills based on observation of student behavior or inspection of student products. In dealing with the issue of fairness in performance assessment, educators are confronted with some dilemmas. Assuring equality in performance…
Descriptors: Evaluation Methods, Evaluation Problems, Evaluation Research, Performance Factors
Fineman, Carol A.; Ross, Amparo – 1980
The project titled "Evaluating the non-English Speaking Handicapped" was established to research existing evaluation instruments in language other than English, validate the tests as well as additional translations where needed, and develop a procedural manual for distribution to utilize in evaluating non-English speaking handicapped students. The…
Descriptors: Bilingual Education, Disabilities, Elementary Secondary Education, Evaluation Methods