Publication Date
| In 2026 | 0 |
| Since 2025 | 38 |
| Since 2022 (last 5 years) | 225 |
| Since 2017 (last 10 years) | 570 |
| Since 2007 (last 20 years) | 1377 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 110 |
| Practitioners | 107 |
| Teachers | 46 |
| Administrators | 25 |
| Policymakers | 24 |
| Counselors | 12 |
| Parents | 7 |
| Students | 7 |
| Support Staff | 4 |
| Community | 2 |
Location
| California | 61 |
| Canada | 60 |
| United States | 57 |
| Turkey | 47 |
| Australia | 43 |
| Florida | 34 |
| Germany | 26 |
| Texas | 26 |
| China | 25 |
| Netherlands | 25 |
| Iran | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Ögretmen, Tuncay – Educational Research and Reviews, 2015
The purpose of this study is to carry out differential item functioning (DIF) analysis for content areas of a reading comprehension subtest using four area indices within Item Response Theory (IRT) framework. The differences in the magnitudes of the area indices were compared based on the subject areas. The DIF analysis was carried out across…
Descriptors: Foreign Countries, Test Bias, Gender Differences, Reading Comprehension
Ercikan, Kadriye; Chen, Michelle Y.; Lyons-Thomas, Juliette; Goodrich, Shawna; Sandilands, Debra; Roth, Wolff-Michael; Simon, Marielle – International Journal of Testing, 2015
The purpose of this research is to examine the comparability of mathematics and science scores for students from English language backgrounds (ELB) and non-English language backgrounds (NELB). We examine the relationship between English reading proficiency and performance on mathematics and science assessments in Australia, Canada, the United…
Descriptors: Scores, Mathematics Tests, Science Tests, Native Speakers
Lei, Pui-Wa; Li, Hongli – Applied Psychological Measurement, 2013
Minimum sample sizes of about 200 to 250 per group are often recommended for differential item functioning (DIF) analyses. However, there are times when sample sizes for one or both groups of interest are smaller than 200 due to practical constraints. This study attempts to examine the performance of Simultaneous Item Bias Test (SIBTEST),…
Descriptors: Sample Size, Test Bias, Computation, Accuracy
Crowder, Marisa K.; Gordon, Rachel A.; Brown, Randal D.; Davidson, Laura A.; Domitrovich, Celene E. – School Psychology, 2019
Growing interest in understanding the role of students' social-emotional competence for school success necessitates valid measures for large-scale use. We provide validity evidence for the 40-item Washoe County School District Social-Emotional Competency Assessment (WCSD-SECA), a student self-report measure that came from a researcher-practitioner…
Descriptors: Interpersonal Competence, Test Validity, Measurement Techniques, Social Development
Pichardo, Blanca – Journal of the American Academy of Special Education Professionals, 2014
Limited research has been accomplished within the past few years regarding issues and concerns of assessment for English Language Learners (ELL) with Learning Disabilities (LD). The increasing number of this unique population throughout schools has raised many concerns for professionals in education. English Language Learners with Learning…
Descriptors: English Language Learners, Learning Disabilities, Teacher Competencies, Special Education
Mylonas, Kostas; Furnham, Adrian; Divale, William; Leblebici, Cigdem; Gondim, Sonia; Moniz, Angela; Grad, Hector; Alvaro, Jose Luis; Cretu, Romeo Zeno; Filus, Ania; Boski, Pawel – Educational and Psychological Measurement, 2014
Several sources of bias can plague research data and individual assessment. When cultural groups are considered, across or even within countries, it is essential that the constructs assessed and evaluated are as free as possible from any source of bias and specifically from bias caused due to culturally specific characteristics. Employing the…
Descriptors: Test Bias, Measures (Individuals), Unemployment, Adults
Jade Caines; Beatrice L. Bridglall; Madhabi Chatterji – Quality Assurance in Education: An International Perspective, 2014
Purpose: This policy brief discusses validity and fairness issues that could arise when test-based information is used for making "high stakes" decisions at an individual level, such as, for the certification of teachers or other professionals, or when admitting students into higher education programs and colleges, or for making…
Descriptors: Test Construction, Test Validity, High Stakes Tests, Measures (Individuals)
Kornilov, Sergey A.; Kornilova, Tatiana V.; Grigorenko, Elena L. – New Directions for Child and Adolescent Development, 2016
Unlike intelligence, creativity has rarely been investigated from the standpoint of cross-cultural invariance of the structure of the instruments used to measure it. In the study reported in this article, we investigated the cross-cultural invariance of expert ratings of creative stories written by undergraduate students from the Russian…
Descriptors: Creative Writing, Cross Cultural Studies, Case Studies, Undergraduate Students
Geiser, Saul – Center for Studies in Higher Education, 2016
The SAT is used for two purposes at the University of California. First is "eligibility": Determining whether applicants meet the minimum requirements for admission to the UC system. Second is "admissions selection": At high-demand campuses such as Berkeley, with many more eligible applicants than places available, test scores…
Descriptors: College Entrance Examinations, Eligibility, Selective Admission, Scores
Oliveri, Maria Elena; Lawless, Rene; Robin, Frederic; Bridgeman, Brent – Applied Measurement in Education, 2018
We analyzed a pool of items from an admissions test for differential item functioning (DIF) for groups based on age, socioeconomic status, citizenship, or English language status using Mantel-Haenszel and item response theory. DIF items were systematically examined to identify its possible sources by item type, content, and wording. DIF was…
Descriptors: Test Bias, Comparative Analysis, Item Banks, Item Response Theory
Rios, Joseph A.; Sparks, Jesse R.; Zhang, Mo; Liu, Ou Lydia – ETS Research Report Series, 2017
Proficiency with written communication (WC) is critical for success in college and careers. As a result, institutions face a growing challenge to accurately evaluate their students' writing skills to obtain data that can support demands of accreditation, accountability, or curricular improvement. Many current standardized measures, however, lack…
Descriptors: Test Construction, Test Validity, Writing Tests, College Outcomes Assessment
Dahlke, Katie; Yang, Rui; Martínez, Carmen; Chavez, Suzette; Martin, Alejandra; Hawkinson, Laura; Shields, Joseph; Garland, Marshall; Carle, Jill – Regional Educational Laboratory Southwest, 2017
The New Mexico Public Education Department developed the Kindergarten Observation Tool (KOT) as a multidimensional observational measure of students' knowledge and skills at kindergarten entry. The primary purpose of the KOT is to inform instruction, so that kindergarten teachers can use the information about their students' knowledge and skills…
Descriptors: Test Validity, Observation, Measures (Individuals), Kindergarten
Oshima, T. C.; Wright, Keith; White, Nick – International Journal of Testing, 2015
Raju, van der Linden, and Fleer (1995) introduced a framework for differential functioning of items and tests (DFIT) for unidimensional dichotomous models. Since then, DFIT has been shown to be a quite versatile framework as it can handle polytomous as well as multidimensional models both at the item and test levels. However, DFIT is still limited…
Descriptors: Test Bias, Item Response Theory, Test Items, Simulation
Morey, Melissa E.; Arora, Prerna; Stark, Kevin D. – Psychology in the Schools, 2015
Schools present a unique environment in which to conduct universal screenings for youth depression. The present study examines the efficiency of a multiple-stage assessment procedure assessing youth depression in the schools by calculating hit rates and establishing diagnostic accuracy for the measures used. Girls (N = 3318) aged 8 to 13,…
Descriptors: Depression (Psychology), Psychological Evaluation, Children, Adolescents
Lowe, Patricia A. – Journal of Psychoeducational Assessment, 2015
The present study examined measurement invariance across gender and gender differences on two measures of test anxiety developed for U.S. middle and high school, and college students. It was hypothesized that measurement invariance and gender differences would be found on the two measures of test anxiety, suggesting no separate scoring system is…
Descriptors: Test Anxiety, Affective Measures, Gender Differences, Test Bias

Peer reviewed
Direct link
