Publication Date
In 2025 | 16 |
Since 2024 | 97 |
Since 2021 (last 5 years) | 273 |
Since 2016 (last 10 years) | 617 |
Since 2006 (last 20 years) | 1413 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 110 |
Practitioners | 107 |
Teachers | 46 |
Administrators | 25 |
Policymakers | 24 |
Counselors | 12 |
Parents | 7 |
Students | 7 |
Support Staff | 4 |
Community | 2 |
Location
California | 60 |
Canada | 60 |
United States | 56 |
Turkey | 47 |
Australia | 43 |
Florida | 34 |
Germany | 26 |
Texas | 26 |
Netherlands | 25 |
China | 24 |
Iran | 21 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Does not meet standards | 1 |
Amirhossein Rasooli; Christopher DeLuca – Applied Measurement in Education, 2024
Inspired by the recent 21st century social and educational movements toward equity, diversity, and inclusion for disadvantaged groups, educational researchers have sought in conceptualizing fairness in classroom assessment contexts. These efforts have provoked promising key theoretical foundations and empirical investigations to examine fairness…
Descriptors: Test Bias, Student Evaluation, Social Justice, Equal Education
Weese, James D.; Turner, Ronna C.; Liang, Xinya; Ames, Allison; Crawford, Brandon – Educational and Psychological Measurement, 2023
A study was conducted to implement the use of a standardized effect size and corresponding classification guidelines for polytomous data with the POLYSIBTEST procedure and compare those guidelines with prior recommendations. Two simulation studies were included. The first identifies new unstandardized test heuristics for classifying moderate and…
Descriptors: Effect Size, Classification, Guidelines, Statistical Analysis
Ebru Dogruöz; Hülya Kelecioglu – International Journal of Assessment Tools in Education, 2024
In this research, multistage adaptive tests (MST) were compared according to sample size, panel pattern and module length for top-down and bottom-up test assembly methods. Within the scope of the research, data from PISA 2015 were used and simulation studies were conducted according to the parameters estimated from these data. Analysis results for…
Descriptors: Adaptive Testing, Test Construction, Foreign Countries, Achievement Tests
ETS Research Institute, 2024
ETS experts are exploring and defining the standards for responsible AI use in assessments. A comprehensive framework and principles will be unveiled in the coming months. In the meantime, this document outlines the critical areas these standards will encompass, including the principles of: (1) Fairness and bias mitigation; (2) Privacy and…
Descriptors: Artificial Intelligence, Computer Assisted Testing, Educational Testing, Ethics
Corinne Huggins-Manley; Anthony W. Raborn; Peggy K. Jones; Ted Myers – Journal of Educational Measurement, 2024
The purpose of this study is to develop a nonparametric DIF method that (a) compares focal groups directly to the composite group that will be used to develop the reported test score scale, and (b) allows practitioners to explore for DIF related to focal groups stemming from multicategorical variables that constitute a small proportion of the…
Descriptors: Nonparametric Statistics, Test Bias, Scores, Statistical Significance
Jacklin H. Stonewall; Michael C. Dorneich; Jane Rongerude – Assessment & Evaluation in Higher Education, 2024
Peer assessment training was motivated, developed and evaluated to address fairness in higher education group learning. Team-centric pedagogies, such as team-based learning have been shown to improve engagement and learning outcomes. For many instructors using teams, peer assessments are integral for monitoring team performance and ensuring…
Descriptors: Peer Evaluation, Training, Student Attitudes, Program Effectiveness
Paula Elosua – Language Assessment Quarterly, 2024
In sociolinguistic contexts where standardized languages coexist with regional dialects, the study of differential item functioning is a valuable tool for examining certain linguistic uses or varieties as threats to score validity. From an ecological perspective, this paper describes three stages in the study of differential item functioning…
Descriptors: Reading Tests, Reading Comprehension, Scores, Test Validity
Ming-Chi Tseng – Structural Equation Modeling: A Multidisciplinary Journal, 2024
The primary objective of this investigation is the formulation of random intercept latent profile transition analysis (RI-LPTA). Our simulation investigation suggests that the election between LPTA and RI-LPTA for examination has negligible impact on the estimation of transition probability parameters when the population parameters are generated…
Descriptors: Monte Carlo Methods, Predictor Variables, Research Methodology, Test Bias
Yvonne Kao; Daniel Murphy; Aleata Hubbard Cheuoua; Priya Kannan; Jennifer Tsan; Kyle E. Jennings; Heather Smith; Shameeka Emanuel; Emily R. Miller – WestEd, 2023
In spring 2022, WestEd conducted a literature review to summarize the major frameworks used in career intentions research and the evidence supporting each framework, as well as to develop an initial set of constructs to guide the development of a brief, culturally sensitive computing career intentions survey measuring individual, situational, and…
Descriptors: Career Planning, Computer Science Education, Test Bias, Self Efficacy
Craig Peck; Tiffanie Lewis-Durham – Journal of School Leadership, 2025
In this conceptual study, we examined ways in which Black educational leaders responded to the nascent rise of widespread educational accountability in the 1960s and 1970s. In conducting our research, we used history research methods in investigating contemporary historical sources, including publications by the leaders whom we examined. We also…
Descriptors: Urban Schools, Accountability, African Americans, Leadership
Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022
Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…
Descriptors: Ability, Tests, Equated Scores, Testing Problems
Bolt, Daniel M.; Liao, Xiangyi – Journal of Educational Measurement, 2021
We revisit the empirically observed positive correlation between DIF and difficulty studied by Freedle and commonly seen in tests of verbal proficiency when comparing populations of different mean latent proficiency levels. It is shown that a positive correlation between DIF and difficulty estimates is actually an expected result (absent any true…
Descriptors: Test Bias, Difficulty Level, Correlation, Verbal Tests
Kristína Czekóová; Tomáš Urbánek – Journal of Psychoeducational Assessment, 2024
An accurate assessment of cognitive abilities in populations that differ from the majority in cultural and linguistic characteristics is one of the main challenges in cognitive testing. Previously developed methods for assessment of the validity of cognitive scores in individuals with diverse backgrounds, such as the Culture-Language…
Descriptors: Foreign Countries, Intelligence Tests, Test Validity, Abstract Reasoning
Anela Hrnjicic; Adis Alihodžic – International Electronic Journal of Mathematics Education, 2024
Understanding the concepts related to real function is essential in learning mathematics. To determine how students understand these concepts, it is necessary to have an appropriate measurement tool. In this paper, we have created a web application using 32 items from conceptual understanding of real functions (CURF) item bank. We conducted a…
Descriptors: Mathematical Concepts, College Freshmen, Foreign Countries, Computer Assisted Testing
Tien-Ling Hu; Dubravka Svetina Valdivia – Research in Higher Education, 2024
Undergraduate research, recognized as one of the High-Impact Practices (HIPs), has demonstrated a positive association with diverse student learning outcomes. Understanding the pivotal quality factors essential for its efficacy is important for enhancing student success. This study evaluates the psychometric properties of survey items employed to…
Descriptors: Undergraduate Students, Student Research, Student Experience, Psychometrics