Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 16 |
Descriptor
Source
Author
Publication Type
Education Level
Elementary Education | 4 |
Higher Education | 3 |
Middle Schools | 3 |
Secondary Education | 3 |
Grade 5 | 2 |
High Schools | 2 |
Intermediate Grades | 2 |
Postsecondary Education | 2 |
Early Childhood Education | 1 |
Grade 11 | 1 |
Grade 8 | 1 |
More ▼ |
Location
Netherlands | 5 |
Canada | 3 |
Florida | 3 |
Texas | 3 |
Turkey | 3 |
Arizona | 2 |
Australia | 2 |
California | 2 |
Massachusetts | 2 |
United Kingdom | 2 |
Arkansas | 1 |
More ▼ |
Laws, Policies, & Programs
Education for All Handicapped… | 2 |
Civil Rights Act 1964 Title VI | 1 |
Civil Rights Act 1964 Title… | 1 |
Larry P v Riles | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Xue, Kang; Huggins-Manley, Anne Corinne; Leite, Walter – Grantee Submission, 2020
In data collected from virtual learning environments (VLEs), item response theory (IRT) models can be used to guide the ongoing measurement of student ability. However, such applications of IRT rely on unbiased item parameter estimates associated with test items in the VLE. Without formal piloting of the items, one can expect a large amount of…
Descriptors: Virtual Classrooms, Item Response Theory, Test Bias, Test Items
Sanguras, Laila Y.; Gibson, Shavonne D.; Haqqi, Hamza S.; Torres, Angie M. – AERA Online Paper Repository, 2021
Minority studies are underrepresented in gifted and talented education programs across the nation and the methods used to identify students for advanced services may be the issue. This study examined the Scales for Identifying Gifted Students (SIGS), a set of nationally normed behavior rating scales, for the purpose of updating the instrument. The…
Descriptors: Gifted, Academically Gifted, Talent Identification, Measures (Individuals)
Gübes, Nese; Uyar, Seyma – International Journal of Progressive Education, 2020
This study aims to compare the performance of different small sample equating methods in the presence and absence of differential item functioning (DIF) in common items. In this research, Tucker linear equating, Levine linear equating, unsmoothed and pre-smoothed (C=4) chained equipercentile equating, and simplified circle arc equating methods…
Descriptors: Test Bias, Equated Scores, Test Items, Methods
Lang, David – Grantee Submission, 2019
Whether high-stakes exams such as the SAT or College Board AP exams should penalize incorrect answers is a controversial question. In this paper, we document that penalty functions can have differential effects depending on a student's risk tolerance. Moreover, literature shows that risk aversion tends to vary along other areas of concern such as…
Descriptors: High Stakes Tests, Risk, Item Response Theory, Test Bias
Geiger, Tray; Amerein-Beardsley, Audrey – AERA Online Paper Repository, 2017
The Education Value-Added Assessment System (EVAAS), the value-added model (VAM) sold by the business analytics software company SAS Institute Inc., is advertised as offering "precise, reliable and unbiased results that go far beyond what other simplistic [value-added] models found in the market today can provide." In this study, we…
Descriptors: Value Added Models, Test Validity, Test Reliability, Teacher Evaluation
Selvi, Hüseyin; Özdemir Alici, Devrim – International Journal of Assessment Tools in Education, 2018
In this study, it is aimed to investigate the impact of different missing data handling methods on the detection of Differential Item Functioning methods (Mantel Haenszel and Standardization methods based on Classical Test Theory and Likelihood Ratio Test method based on Item Response Theory). In this regard, on the data acquired from 1046…
Descriptors: Test Bias, Test Theory, Item Response Theory, Multiple Choice Tests
Luo, Xin; Reckase, Mark D.; He, Wei – AERA Online Paper Repository, 2016
While dichotomous item dominates the application of computerized adaptive testing (CAT), polytomous item and set-based item hold promises for being incorporated in CAT. However, how to assemble a CAT containing mixed item formats is challenging. This study investigated: (1) how the mixed CAT works compared with the dichotomous-item-based CAT; (2)…
Descriptors: Test Items, Test Format, Computer Assisted Testing, Adaptive Testing
Raddatz, Mikaela M.; Royal, Kenneth D.; Pennington, Jessica – Online Submission, 2012
The purpose of this study is to determine if the construct of a medical subspecialty examination, as defined by the hierarchy of item difficulties, is stable across physicians who completed a fellowship and recertifiers as compared to non-fellows. Three comparisons of groups are made: 1) Practice pathway board candidates compared to members of all…
Descriptors: Evidence, Fellowships, Board Candidates, Test Bias
Kachchaf, Rachel; Noble, Tracy; Rosebery, Ann; Wang, Yang; Warren, Beth; O'Connor, Mary Catherine – Grantee Submission, 2014
Most research on linguistic features of test items negatively impacting English language learners' (ELLs') performance has focused on lexical and syntactic features, rather than discourse features that operate at the level of the whole item. This mixed-methods study identified two discourse features in 162 multiple-choice items on a standardized…
Descriptors: English Language Learners, Science Tests, Test Items, Discourse Analysis
Carmichael, Colin – Mathematics Education Research Group of Australasia, 2013
With reports of declining enrolments in mathematics related degrees and low female participation rates in these degrees, the issue of gender differences in mathematics remains relevant. Results of recent studies suggest gender differences in mathematics are nuanced and that small differences in the early years can manifest as larger differences in…
Descriptors: Gender Differences, Mathematics Achievement, Longitudinal Studies, Foreign Countries
Zoanetti, Nathan; Les, Magdalena; Leigh-Lancaster, David – Mathematics Education Research Group of Australasia, 2014
From 2011-2013 the VCAA conducted a trial aligning the use of computers in curriculum, pedagogy and assessment culminating in a group of 62 volunteer students sitting their end of Year 12 technology-active Mathematical Methods (CAS) Examination 2 as a computer-based examination. This paper reports on statistical modelling undertaken to compare the…
Descriptors: Computer Assisted Testing, Comparative Analysis, Mathematical Concepts, Mathematics Tests
Noble, Tracy; Kachchaf, Rachel; Rosebery, Ann; Warren, Beth; O'Connor, Mary Catherine; Wang, Yang – Grantee Submission, 2014
Little research has examined individual linguistic features that influence English language learners (ELLs) test performance. Furthermore, research has yet to explore the relationship between the science strand of test items and the types of linguistic features the items include. Utilizing Differential Item Functioning, this study examines ELL…
Descriptors: Science Tests, English Language Learners, Linguistics, Test Items
Reshetar, Rosemary; Melican, Gerald J. – College Board, 2010
This paper discusses issues related to the design and psychometric work for mixed-format tests --tests containing both multiple-choice (MC) and constructed-response (CR) items. The issues of validity, fairness, reliability and score consistency can be addressed but for mixed-format tests there are many decisions to be made and no examination or…
Descriptors: Psychometrics, Test Construction, Multiple Choice Tests, Test Items
Tristan, Agustin; Vidal, Rafael – Online Submission, 2007
Wright and Stone had proposed three features to assess the quality of the distribution of the items difficulties in a test, on the so called "most probable response map": line, stack and gap. Once a line is accepted as a design model for a test, gaps and stacks are practically eliminated, producing an evidence of the "scale…
Descriptors: Test Validity, Models, Difficulty Level, Test Items
Green, Donald Ross – 1979
Sources of test bias are discussed and steps to prevent or reduce bias in tests are listed. Test bias can occur because of the way test materials are written, the conditions of administration, and the interpretations given the results. Steps to prevent or reduce bias arising in the test development process include: (1) using heterogeneous sets of…
Descriptors: Educational Testing, Test Bias, Test Construction, Test Interpretation