Publication Date
In 2025 | 3 |
Since 2024 | 15 |
Since 2021 (last 5 years) | 32 |
Since 2016 (last 10 years) | 68 |
Since 2006 (last 20 years) | 199 |
Descriptor
Evaluation Methods | 413 |
Test Bias | 413 |
Student Evaluation | 157 |
Test Validity | 104 |
Test Items | 85 |
Elementary Secondary Education | 83 |
Test Reliability | 54 |
Standardized Tests | 53 |
Foreign Countries | 52 |
Test Construction | 51 |
Testing Problems | 48 |
More ▼ |
Source
Author
Publication Type
Education Level
Higher Education | 40 |
Elementary Secondary Education | 39 |
Postsecondary Education | 21 |
Elementary Education | 19 |
Secondary Education | 15 |
Grade 4 | 11 |
Intermediate Grades | 8 |
Grade 8 | 7 |
High Schools | 7 |
Middle Schools | 6 |
Grade 10 | 5 |
More ▼ |
Audience
Practitioners | 22 |
Teachers | 13 |
Researchers | 11 |
Administrators | 9 |
Policymakers | 4 |
Support Staff | 3 |
Counselors | 2 |
Community | 1 |
Parents | 1 |
Location
Canada | 9 |
United States | 7 |
California | 5 |
United Kingdom | 5 |
Australia | 4 |
Arizona | 3 |
Florida | 3 |
Minnesota | 3 |
United Kingdom (England) | 3 |
Alaska | 2 |
Germany | 2 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Corinne Huggins-Manley; Anthony W. Raborn; Peggy K. Jones; Ted Myers – Journal of Educational Measurement, 2024
The purpose of this study is to develop a nonparametric DIF method that (a) compares focal groups directly to the composite group that will be used to develop the reported test score scale, and (b) allows practitioners to explore for DIF related to focal groups stemming from multicategorical variables that constitute a small proportion of the…
Descriptors: Nonparametric Statistics, Test Bias, Scores, Statistical Significance
Jacklin H. Stonewall; Michael C. Dorneich; Jane Rongerude – Assessment & Evaluation in Higher Education, 2024
Peer assessment training was motivated, developed and evaluated to address fairness in higher education group learning. Team-centric pedagogies, such as team-based learning have been shown to improve engagement and learning outcomes. For many instructors using teams, peer assessments are integral for monitoring team performance and ensuring…
Descriptors: Peer Evaluation, Training, Student Attitudes, Program Effectiveness
Ming-Chi Tseng – Structural Equation Modeling: A Multidisciplinary Journal, 2024
The primary objective of this investigation is the formulation of random intercept latent profile transition analysis (RI-LPTA). Our simulation investigation suggests that the election between LPTA and RI-LPTA for examination has negligible impact on the estimation of transition probability parameters when the population parameters are generated…
Descriptors: Monte Carlo Methods, Predictor Variables, Research Methodology, Test Bias
Mohammad Ahmadi Safa; Bahare Nasiri – Language Testing in Asia, 2025
Studies have confirmed that fair assessment practices in educational contexts affect learners' motivation, self-regulation, and above all teacher credibility, yet the concept has been subject to educational stakeholders' diverse outlooks and perspectives. On this basis, this study delves into the high school English as Foreign Language (EFL)…
Descriptors: Student Evaluation, High School Teachers, Evaluation Methods, English (Second Language)
Blair Lehman; Jesse R. Sparks; Diego Zapata-Rivera; Jonathan Steinberg; Carol Forsyth – Practical Assessment, Research & Evaluation, 2024
Most assessments adopt a one-size-fits-all approach to provide fair testing opportunities to all learners. However, this rigid approach to assessment may limit the ability for some learners to show what they know and can do. The Caring Assessments framework proposed a guide for the design and development of flexible, personalized, and adaptive…
Descriptors: Alternative Assessment, Evaluation Methods, Student Evaluation, Culturally Relevant Education
Chenchen Ma; Jing Ouyang; Chun Wang; Gongjun Xu – Grantee Submission, 2024
Survey instruments and assessments are frequently used in many domains of social science. When the constructs that these assessments try to measure become multifaceted, multidimensional item response theory (MIRT) provides a unified framework and convenient statistical tool for item analysis, calibration, and scoring. However, the computational…
Descriptors: Algorithms, Item Response Theory, Scoring, Accuracy
Douglas B. Petersen; Alisa Konishi-Therkildsen; Kallie Dawn Clark; Anahi Kamila DeRobles; Ashley Elizabeth Frahm; Kristi Jones; Camryn Lettich; Trina D. Spencer – Journal of Speech, Language, and Hearing Research, 2024
Purpose: Several studies have demonstrated that dynamic assessment can be a less biased, valid approach for the identification of language disorder among diverse school-age children. However, all prior studies have included a relatively small number of participants, which is generally not adequate for psychometric research. This is the first…
Descriptors: Elementary School Students, Language Impairments, Language Usage, Individual Characteristics
Carmen Köhler; Lale Khorramdel; Artur Pokropek; Johannes Hartig – Journal of Educational Measurement, 2024
For assessment scales applied to different groups (e.g., students from different states; patients in different countries), multigroup differential item functioning (MG-DIF) needs to be evaluated in order to ensure that respondents with the same trait level but from different groups have equal response probabilities on a particular item. The…
Descriptors: Measures (Individuals), Test Bias, Models, Item Response Theory
Phillips, Gregory, II; Felt, Dylan; Perez-Bill, Esrea; Ruprecht, Megan M.; Glenn, Erik Elías; Lindeman, Peter; Miller, Robin Lin – American Journal of Evaluation, 2023
Lesbian, gay, bisexual, transgender, queer, intersex, Two-Spirit, and other sexual and gender minority (LGBTQ+) individuals encounter numerous obstacles to equity across health and healthcare, education, housing, employment, and other domains. Such barriers are even greater for LGBTQ+ individuals who are also Black, Indigenous, and People of Color…
Descriptors: Student Evaluation, LGBTQ People, Test Bias, Barriers
Ahmad Suryadi; Sahal Fawaiz; Eka Kurniati; Ahmad Swandi – Journal of Pedagogical Research, 2024
The waning interest of students in science became a global concern. The purpose of this research was to translate, adapt, and validate the My Attitude toward Science [MATS] questionnaire instrument, which was used to measure students' attitudes toward science in the Indonesian context. We also investigated the items that contributed to gender and…
Descriptors: Foreign Countries, Science Education, Achievement Tests, Secondary School Students
Esteban Guevara Hidalgo – International Journal for Educational Integrity, 2025
The COVID-19 pandemic had a profound impact on education, forcing many teachers and students who were not used to online education to adapt to an unanticipated reality by improvising new teaching and learning methods. Within the realm of virtual education, the evaluation methods underwent a transformation, with some assessments shifting towards…
Descriptors: Foreign Countries, Higher Education, COVID-19, Pandemics
Valentine, Nyoli; Durning, Steven; Shanahan, Ernst Michael; Schuwirth, Lambert – Advances in Health Sciences Education, 2021
Human judgement is widely used in workplace-based assessment despite criticism that it does not meet standards of objectivity. There is an ongoing push within the literature to better embrace subjective human judgement in assessment not as a 'problem' to be corrected psychometrically but as legitimate perceptions of performance. Taking a step back…
Descriptors: Justice, Literature Reviews, Evaluation Methods, Test Bias
Child, Simon; Ellis, Paul – SAGE Publications Ltd (UK), 2021
How do teachers develop their understanding of the foundation principles of assessment, stay up to date with the latest classroom approaches and have the confidence to evaluate and question the effectiveness of new methods? This professional resource for teachers supports them to understand the what, why and how of assessment. It provides key…
Descriptors: Assessment Literacy, Student Evaluation, Evaluation Methods, Self Efficacy
Nisbet, Isabel; Shaw, Stuart – Assessment in Education: Principles, Policy & Practice, 2022
Fairness in assessment has become increasingly topical and controversial in recent years. Assessment theoreticians are writing more about fairness and assessment practitioners have developed processes and good practice to minimise unfairness. There is also increased scrutiny by students, parents and the wider public--not only of the fairness of…
Descriptors: High Stakes Tests, Test Bias, COVID-19, Pandemics
Meyer, J. Patrick; Dahlin, Michael – NWEA, 2022
The MAP® Growth™ theory of action describes key features of MAP Growth and its position in a comprehensive assessment system. The basic premise of the theory of action is that all students learn when MAP Growth is situated in a comprehensive assessment system and used for its intended purposes to yield information about student learning and enable…
Descriptors: Achievement Tests, Academic Achievement, Achievement Gains, Student Evaluation