Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 11 |
Since 2006 (last 20 years) | 25 |
Descriptor
Evaluation Methods | 51 |
Test Bias | 51 |
Test Construction | 51 |
Student Evaluation | 23 |
Test Validity | 23 |
Test Items | 18 |
Test Reliability | 17 |
Testing Problems | 10 |
Elementary Secondary Education | 8 |
Item Analysis | 8 |
Standardized Tests | 8 |
More ▼ |
Source
Author
Hambleton, Ronald K. | 3 |
Rogers, H. Jane | 3 |
Liu, Kristin K. | 2 |
Osmundson, Ellen | 2 |
Thurlow, Martha L. | 2 |
Albano, Anthony D. | 1 |
Amit Sevak | 1 |
Angela Johnson | 1 |
Arjoon, Janelle A. | 1 |
Astrid de Leeuw | 1 |
Bell, Gregory | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 7 |
Higher Education | 7 |
Postsecondary Education | 6 |
Elementary Education | 4 |
Intermediate Grades | 3 |
Secondary Education | 3 |
Middle Schools | 2 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
Two Year Colleges | 1 |
More ▼ |
Location
Canada | 2 |
Alabama | 1 |
California | 1 |
Slovakia | 1 |
South Africa | 1 |
United States | 1 |
Laws, Policies, & Programs
Every Student Succeeds Act… | 3 |
Individuals with Disabilities… | 3 |
Rehabilitation Act 1973… | 3 |
No Child Left Behind Act 2001 | 2 |
Assessments and Surveys
Graduate Record Examinations | 1 |
National Assessment of… | 1 |
Program for International… | 1 |
SAT (College Admission Test) | 1 |
Woodcock Johnson Tests of… | 1 |
What Works Clearinghouse Rating
Angela Johnson; Elizabeth Barker; Marcos Viveros Cespedes – Educational Measurement: Issues and Practice, 2024
Educators and researchers strive to build policies and practices on data and evidence, especially on academic achievement scores. When assessment scores are inaccurate for specific student populations or when scores are inappropriately used, even data-driven decisions will be misinformed. To maximize the impact of the research-practice-policy…
Descriptors: Equal Education, Inclusion, Evaluation Methods, Error of Measurement
Rainey, Katherine D.; Vignal, Michael; Wilcox, Bethany R. – Physical Review Physics Education Research, 2022
Currently there are no assessment instruments available for upper-division thermal physics, though several introductory assessments are currently available. Notably missing from these introductory assessment are items targeting statistical mechanics. This leaves a gap in the content that can be assessed by upper-division thermal physics faculty.…
Descriptors: Physics, Science Instruction, Thermodynamics, College Science
Russell, Michael; Szendey, Olivia; Li, Zhushan – Educational Assessment, 2022
Recent research provides evidence that an intersectional approach to defining reference and focal groups results in a higher percentage of comparisons flagged for potential DIF. The study presented here examined the generalizability of this pattern across methods for examining DIF. While the level of DIF detection differed among the four methods…
Descriptors: Comparative Analysis, Item Analysis, Test Items, Test Construction
Patrick C. Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Institute, 2024
Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international largescale assessments of cognitive and…
Descriptors: Performance Based Assessment, Evaluation Criteria, Evaluation Methods, Test Bias
Davis, Derrick D. – Alabama Journal of Educational Leadership, 2021
Without question, faculty (regardless of discipline) should be equipped with the necessary skills to assess students fairly and ethically. This study focuses on the central and prevailing importance of faculty judgment and how that judgment (or lack thereof) influences perceptions related to ethics and assessment of students. The study outlines…
Descriptors: Student Evaluation, Evaluative Thinking, Elementary School Teachers, Secondary School Teachers
Kaplan, David; Su, Dan – Large-scale Assessments in Education, 2018
Background: This paper extends a recent study by Kaplan and Su ("J Educ Behav Stat" 41: 51-80, 2016) examining the problem of matrix sampling of context questionnaire scales with respect to the generation of plausible values of cognitive outcomes in large-scale assessments. Methods: Following Weirich et al. ("Nested multiple…
Descriptors: Questionnaires, Measurement, Measurement Techniques, Evaluation Methods
Smarter Balanced Assessment Consortium, 2020
The Smarter Balanced Assessment Consortium (Smarter Balanced) strives to provide every student with a positive and productive assessment experience, generating results that are a fair and accurate estimate of each student's achievement. Further, Smarter Balanced is building on a framework of accessibility for all students, including English…
Descriptors: Student Evaluation, Evaluation Methods, English Language Learners, Students with Disabilities
Updated Assessment Principles and Guidelines for English Learners with Disabilities. NCEO Report 424
Liu, Kristin K.; Lazarus, Sheryl S.; Thurlow, Martha L.; Jarmin, Jaime; Ward, Jenna; Christensen, Laurene – National Center on Educational Outcomes, 2020
This report is an update of the assessment principles and guidelines for English language learners published in 2013 (Thurlow, Liu, Ward, & Christensen). That report, which was developed by the Improving the Validity of Assessment Results for English Language Learners with Disabilities (IVARED) project, presented essential principles of…
Descriptors: English Language Learners, Students with Disabilities, Student Evaluation, Evaluation Methods
Smarter Balanced Assessment Consortium, 2019
The Smarter Balanced Assessment Consortium (Smarter Balanced) strives to provide every student with a positive and productive assessment experience, generating results that are a fair and accurate estimate of each student's achievement. Further, Smarter Balanced is building on a framework of accessibility for all students, including English…
Descriptors: Student Evaluation, Evaluation Methods, English Language Learners, Students with Disabilities
Smarter Balanced Assessment Consortium, 2018
The Smarter Balanced Assessment Consortium (Smarter Balanced) strives to provide every student with a positive and productive assessment experience, generating results that are a fair and accurate estimate of each student's achievement. Further, Smarter Balanced is building on a framework of accessibility for all students, including English…
Descriptors: Student Evaluation, Evaluation Methods, English Language Learners, Disabilities
Lyse Langlois; Claire Lapointe; Pierre Valois; Astrid de Leeuw – Journal of Educational Administration, 2014
Purpose: This study had five objectives: explain the initial steps that led to the construction of the Ethical Leadership Questionnaire (ELQ); analyze the items and verify the ELQ reliability using item response theory (IRT); examine its factorial structure with a confirmatory factor analysis (CFA) and an exploratory structural equation modeling…
Descriptors: Test Construction, Test Reliability, Test Validity, Questionnaires
International Journal of Testing, 2019
These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…
Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage
Lin, Pei-Ying; Lin, Yu-Cheng – Educational and Psychological Measurement, 2014
This exploratory study investigated potential sources of setting accommodation resulting in differential item functioning (DIF) on math and reading assessments for examinees with varied learning characteristics. The examinees were those who participated in large-scale assessments and were tested in either standardized or accommodated testing…
Descriptors: Test Bias, Multivariate Analysis, Testing Accommodations, Mathematics Tests
Arjoon, Janelle A.; Xu, Xiaoying; Lewis, Jennifer E. – Journal of Chemical Education, 2013
Many of the instruments developed for research use by the chemistry
education community are relatively new. Because psychometric evidence dictates the validity of interpretations made from test scores, gathering and reporting validity and reliability evidence is of utmost importance. Therefore, the purpose of this study was to investigate what…
Descriptors: Science Instruction, Measurement Techniques, Psychometrics, Evidence
Albano, Anthony D. – Journal of Educational Measurement, 2013
In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…
Descriptors: Test Items, Item Response Theory, Test Format, Questioning Techniques