Publication Date
| In 2026 | 0 |
| Since 2025 | 40 |
| Since 2022 (last 5 years) | 227 |
| Since 2017 (last 10 years) | 572 |
| Since 2007 (last 20 years) | 1379 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 110 |
| Practitioners | 107 |
| Teachers | 46 |
| Administrators | 25 |
| Policymakers | 24 |
| Counselors | 12 |
| Parents | 7 |
| Students | 7 |
| Support Staff | 4 |
| Community | 2 |
Location
| California | 61 |
| Canada | 60 |
| United States | 57 |
| Turkey | 47 |
| Australia | 43 |
| Florida | 34 |
| Germany | 26 |
| Texas | 26 |
| China | 25 |
| Netherlands | 25 |
| Iran | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Brown, Kevin – CEA Forum, 2015
In this article, the author describes his project to take every standardized exam English majors students take. During the summer and fall semesters of 2012, the author signed up for and took the GRE General Test, the Praxis Content Area Exam (English Language, Literature, and Composition: Content Knowledge), the Senior Major Field Tests in…
Descriptors: College Faculty, College English, Test Preparation, Standardized Tests
Ferjencík, Ján; Slavkovská, Miriam; Kresila, Juraj – Journal of Pedagogy, 2015
The paper reports on the adaptation of a D-KEFS test battery for Slovakia. Drawing on concrete examples, it describes and illustrates the key issues relating to the transfer of test items from one socio-cultural environment to another. The standardisation sample of the population of Slovak pupils in the fourth year of primary school included 250…
Descriptors: Executive Function, Foreign Countries, Test Construction, Test Items
Bornstein, Robert F. – Psychological Assessment, 2011
Although definitions of validity have evolved considerably since L. J. Cronbach and P. E. Meehl's classic (1955) review, contemporary validity research continues to emphasize correlational analyses assessing predictor-criterion relationships, with most outcome criteria being self-reports. The present article describes an alternative way of…
Descriptors: Test Validity, Scores, Models, Psychological Evaluation
Facon, Bruno; Magis, David; Belmont, John M. – Research in Developmental Disabilities: A Multidisciplinary Journal, 2011
The matching of groups is a traditional way to control for confounding variables in developmental disabilities research. The equivalency of means across groups is routinely checked for these variables, but not the homogeneity of their variances or the shapes of their distributions. In the present paper, it is argued that group matching can go…
Descriptors: Test Bias, Developmental Disabilities, Scores, Statistical Analysis
Cai, Li; Yang, Ji Seung; Hansen, Mark – Psychological Methods, 2011
Full-information item bifactor analysis is an important statistical method in psychological and educational measurement. Current methods are limited to single-group analysis and inflexible in the types of item response models supported. We propose a flexible multiple-group item bifactor analysis framework that supports a variety of…
Descriptors: Item Analysis, Item Response Theory, Factor Analysis, Maximum Likelihood Statistics
Magis, David; De Boeck, Paul – Multivariate Behavioral Research, 2011
We focus on the identification of differential item functioning (DIF) when more than two groups of examinees are considered. We propose to consider items as elements of a multivariate space, where DIF items are outlying elements. Following this approach, the situation of multiple groups is a quite natural case. A robust statistics technique is…
Descriptors: Test Bias, Mathematics Tests, Identification, Sampling
Aryadoust, Vahid; Goh, Christine C. M.; Kim, Lee Ong – Language Assessment Quarterly, 2011
Differential item functioning (DIF) analysis is a way of determining whether test items function differently across subgroups of test takers after controlling for ability level. DIF results are used to evaluate tests' validity arguments. This study uses Rasch measurement to examine the Michigan English Language Assessment Battery listening test…
Descriptors: Test Items, Listening Comprehension Tests, Test Bias, Test Validity
Acar, Tulin – Educational Sciences: Theory and Practice, 2011
The purpose of this study is to examine the number of DIF items detected by HGLM at different sample sizes. Eight different sized data files have been composed. The population of the study is 798307 students who had taken the 2006 OKS Examination. 10727 students of 798307 are chosen by random sampling method as the sample of the study. Turkish,…
Descriptors: Test Bias, Sample Size, Regression (Statistics), Models
Wang, Wen-Chung; Shih, Ching-Lin – Applied Psychological Measurement, 2010
Three multiple indicators-multiple causes (MIMIC) methods, namely, the standard MIMIC method (M-ST), the MIMIC method with scale purification (M-SP), and the MIMIC method with a pure anchor (M-PA), were developed to assess differential item functioning (DIF) in polytomous items. In a series of simulations, it appeared that all three methods…
Descriptors: Methods, Test Bias, Test Items, Error of Measurement
Paek, Insu – Applied Psychological Measurement, 2010
Conservative bias in rejection of a null hypothesis from using the continuity correction in the Mantel-Haenszel (MH) procedure was examined through simulation in a differential item functioning (DIF) investigation context in which statistical testing uses a prespecified level [alpha] for the decision on an item with respect to DIF. The standard MH…
Descriptors: Test Bias, Statistical Analysis, Sample Size, Error of Measurement
French, Brian F.; Finch, W. Holmes – Journal of Educational Measurement, 2010
The purpose of this study was to examine the performance of differential item functioning (DIF) assessment in the presence of a multilevel structure that often underlies data from large-scale testing programs. Analyses were conducted using logistic regression (LR), a popular, flexible, and effective tool for DIF detection. Data were simulated…
Descriptors: Test Bias, Testing Programs, Evaluation, Measurement
Chaiyapornpattana, Niorn; Wongwanich, Suwimon – Research in Higher Education Journal, 2013
This study designed 1) to develop a multidimensional thinking styles scale based on theory of mental self-government for sixth grade student 2) to investigate quality of the developed scale 3) to study profile of styles of sixth grade student and a relation of profile of styles of student in each dimension and background of gender and grade with…
Descriptors: Multidimensional Scaling, Grade 6, Cognitive Style, Mental Health
Wasanga, Paul; Somerset, Anthony – Assessment in Education: Principles, Policy & Practice, 2013
This paper starts with an analysis of the twin purposes of an examination reform programme originally launched in Kenya during the 1970s: first, to broaden the spectrum of cognitive skills being tested; and second, to set up a feedback system based on the performance profiles, providing schools and teachers with guidance as to how pedagogy and…
Descriptors: Foreign Countries, Exit Examinations, Change, Test Construction
Leong, Samuel; Qiu, Xue-Lan – Educational Psychology, 2013
Having accurate insights of teachers' conceptions of creativity and the role of assessment in arts education would inform education policy, training programmes and the measurement of learning outcomes. Yet no study has been found to examine the relationship between teachers' conceptions of creativity and their conceptions of assessment in arts…
Descriptors: Foreign Countries, Creativity, Art Education, Teacher Attitudes
Dodeen, Hamzeh – Educational Assessment, 2013
Students' opinions continue to be a significant factor in the evaluation of teaching in higher education institutions. The purpose of this study was to psychometrically assess short students evaluation of teaching (SET) forms using the UAE University form as a model. The study evaluated the form validity, reliability, the overall question, and…
Descriptors: Foreign Countries, Student Evaluation of Teacher Performance, Test Validity, Test Reliability

Peer reviewed
Direct link
