Publication Date
| In 2026 | 0 |
| Since 2025 | 38 |
| Since 2022 (last 5 years) | 225 |
| Since 2017 (last 10 years) | 570 |
| Since 2007 (last 20 years) | 1377 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 110 |
| Practitioners | 107 |
| Teachers | 46 |
| Administrators | 25 |
| Policymakers | 24 |
| Counselors | 12 |
| Parents | 7 |
| Students | 7 |
| Support Staff | 4 |
| Community | 2 |
Location
| California | 61 |
| Canada | 60 |
| United States | 57 |
| Turkey | 47 |
| Australia | 43 |
| Florida | 34 |
| Germany | 26 |
| Texas | 26 |
| China | 25 |
| Netherlands | 25 |
| Iran | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Gungor, Metehan; Atalay Kabasakal, Kubra – International Journal of Assessment Tools in Education, 2020
Measurement invariance analyses are carried out in order to find evidence for the structural validity of the measurement tools used in the field of educational sciences and psychology. The purpose of this research is to examine the measurement invariance of Science Motivation and Self-Efficacy Model constructed by Instrumental Motivation to Learn…
Descriptors: Achievement Tests, Foreign Countries, Secondary School Students, Gender Differences
Fikis, David R. J.; Oshima, T. C. – Educational and Psychological Measurement, 2017
Purification of the test has been a well-accepted procedure in enhancing the performance of tests for differential item functioning (DIF). As defined by Lord, purification requires reestimation of ability parameters after removing DIF items before conducting the final DIF analysis. IRTPRO 3 is a recently updated program for analyses in item…
Descriptors: Test Bias, Item Response Theory, Statistical Analysis, Computer Software
Yildirim, Ozen – International Education Studies, 2019
The measurement tool not measuring the specific construct has a validity problem. Individuals based on the results obtained from this type of tool should not be evaluated. The purpose of this study was to examine the differentiated item functioning and item bias of mathematics items in the Programme for International Student Achievement 2012…
Descriptors: Gender Differences, Mathematics Tests, Test Bias, Achievement Tests
Bundsgaard, Jeppe – Large-scale Assessments in Education, 2019
International large-scale assessments like international computer and information literacy study (ICILS) (Fraillon et al. in International Association for the Evaluation of Educational Achievement (IEA), 2015) provide important empirically-based knowledge through the proficiency scales, of what characterizes tasks at different difficulty levels,…
Descriptors: Test Bias, International Assessment, Test Items, Difficulty Level
Siegfried, Christin; Wuttke, Eveline – Citizenship, Social and Economics Education, 2019
Due to their test economy and objective evaluability, multiple-choice items are used much more frequently to test knowledge than constructed-response questions. However, studies point out that dependencies may exist between the individual test result and the test format (multiple-choice or constructed-response). Studies testing economic knowledge…
Descriptors: Multiple Choice Tests, Test Bias, Sex Fairness, Gender Differences
Baris Pekmezci, Fulya; Gulleroglu, H. Deniz – Eurasian Journal of Educational Research, 2019
Purpose: This study aims to investigate the orthogonality assumption, which restricts the use of Bifactor item response theory under different conditions. Method: Data of the study have been obtained in accordance with the Bifactor model. It has been produced in accordance with two different models (Model 1 and Model 2) in a simulated way.…
Descriptors: Item Response Theory, Accuracy, Item Analysis, Correlation
Davis, Derrick D. – Alabama Journal of Educational Leadership, 2021
Without question, faculty (regardless of discipline) should be equipped with the necessary skills to assess students fairly and ethically. This study focuses on the central and prevailing importance of faculty judgment and how that judgment (or lack thereof) influences perceptions related to ethics and assessment of students. The study outlines…
Descriptors: Student Evaluation, Evaluative Thinking, Elementary School Teachers, Secondary School Teachers
Stovicek, Thomas W. – Applied Language Learning, 2021
Recent empirical research in sociolinguistics and social psychology has established the existence of the socio-psychological phenomena known as linguistic stereotyping (LS) and reverse linguistic stereotyping (RLS), which have an implicit or unconscious effect on listeners' perception of speech and speakers. Despite such findings, little research…
Descriptors: Stereotypes, Sociolinguistics, Oral Language, Language Proficiency
Moghadam, M.; Nasirzadeh, F. – Language Testing in Asia, 2020
The present study tries to investigate the fairness of an English reading comprehension test employing Kunnan's (2004) test fairness framework (TFF) as the most comprehensive model available for test fairness. The participants of this study comprised 300 freshman students taking general English course chosen based on the availability sampling,…
Descriptors: Test Bias, Reading Tests, Reading Comprehension, Test Items
Nguyen, Tutrang; Reich, Stephanie M.; Jenkins, Jade Marcus; Abedi, Jamal – Journal of Psychoeducational Assessment, 2020
This study reports an independent investigation of the psychometric properties of Desired Results Developmental Profile (DRDP), a teacher-rated measure of school readiness for preschool-aged children. In a sample of 2,031 low-income, 3- to 5-year-old children attending Head Start, we tested three measurement models: a higher order one-factor…
Descriptors: School Readiness, Measures (Individuals), Preschool Children, Test Validity
El Masri, Yasmine H.; Andrich, David – Applied Measurement in Education, 2020
In large-scale educational assessments, it is generally required that tests are composed of items that function invariantly across the groups to be compared. Despite efforts to ensure invariance in the item construction phase, for a range of reasons (including the security of items) it is often necessary to account for differential item…
Descriptors: Models, Goodness of Fit, Test Validity, Achievement Tests
Roberson, Nathan D.; Zumbo, Bruno D. – International Journal of Testing, 2019
This paper investigates measurement invariance as it relates to migration background using the Program for International Student Assessment measure of social belonging. We explore how the use of two measurement invariance techniques provide insights into differential item functioning using the alignment method in conjunction with logistic…
Descriptors: Achievement Tests, Foreign Countries, International Assessment, Secondary School Students
Woods, Kevin; McCaldin, Tamsin; Hipkiss, Amanda; Tyrrell, Beverley; Dawes, Megan – Oxford Review of Education, 2019
This paper presents a novel explanation for the continued absence of a children's rights strategy within high-stakes educational assessment with reference to the competing purposes of high-stakes assessments and group-based constructions of fairness in assessment. We provide an original critique of group-based perspectives on the validity of…
Descriptors: Childrens Rights, Student Rights, Student Evaluation, High Stakes Tests
Lang, David – Grantee Submission, 2019
Whether high-stakes exams such as the SAT or College Board AP exams should penalize incorrect answers is a controversial question. In this paper, we document that penalty functions can have differential effects depending on a student's risk tolerance. Moreover, literature shows that risk aversion tends to vary along other areas of concern such as…
Descriptors: High Stakes Tests, Risk, Item Response Theory, Test Bias
Fager, Meghan L. – ProQuest LLC, 2019
Recent research in multidimensional item response theory has introduced within-item interaction effects between latent dimensions in the prediction of item responses. The objective of this study was to extend this research to bifactor models to include an interaction effect between the general and specific latent variables measured by an item.…
Descriptors: Test Items, Item Response Theory, Factor Analysis, Simulation

Peer reviewed
Direct link
