NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 24 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Beheshti, Shima; Safa, Mohammad Ahmadi – Iranian Journal of Language Teaching Research, 2023
The indefinite nature of test fairness and different interpretations and definitions of the concept have stirred a lot of controversy over the years, necessitating the reconceptualization of the concept. On this basis, this study aimed to explore the empirical validity of Kunnan's (2008) Test Fairness Framework (TFF) and revisit the established…
Descriptors: Test Bias, Equal Education, Grounded Theory, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Andrich, David; Marais, Ida – Journal of Educational Measurement, 2018
Even though guessing biases difficulty estimates as a function of item difficulty in the dichotomous Rasch model, assessment programs with tests which include multiple-choice items often construct scales using this model. Research has shown that when all items are multiple-choice, this bias can largely be eliminated. However, many assessments have…
Descriptors: Multiple Choice Tests, Test Items, Guessing (Tests), Test Bias
Peer reviewed Peer reviewed
Direct linkDirect link
Albano, Anthony D. – Journal of Educational Measurement, 2013
In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…
Descriptors: Test Items, Item Response Theory, Test Format, Questioning Techniques
Tristan, Agustin; Vidal, Rafael – Online Submission, 2007
Wright and Stone had proposed three features to assess the quality of the distribution of the items difficulties in a test, on the so called "most probable response map": line, stack and gap. Once a line is accepted as a design model for a test, gaps and stacks are practically eliminated, producing an evidence of the "scale…
Descriptors: Test Validity, Models, Difficulty Level, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Wainer, Howard – Journal of Educational and Behavioral Statistics, 2010
In this essay, the author tries to look forward into the 21st century to divine three things: (i) What skills will researchers in the future need to solve the most pressing problems? (ii) What are some of the most likely candidates to be those problems? and (iii) What are some current areas of research that seem mined out and should not distract…
Descriptors: Research Skills, Researchers, Internet, Access to Information
Peer reviewed Peer reviewed
Direct linkDirect link
van Barneveld, Christina – Applied Psychological Measurement, 2007
The purpose of this study is to examine the effects of a false assumption regarding the motivation of examinees on test construction. Simulated data were generated using two models of item responses (the three-parameter logistic item response model alone and in combination with Wise's examinee persistence model) and were calibrated using a…
Descriptors: Test Construction, Item Response Theory, Models, Bayesian Statistics
Furr, Mike; Bacharach, Verne R. – SAGE Publications (CA), 2007
The authors center their presentation of material around a conceptual understanding of psychometric issues, such as validity and reliability, and on purpose rather than procedure, the "why" rather than the "how to." Their goal is to introduce psychometric principles at a level that is deeper and more focused than found in introductory…
Descriptors: Generalizability Theory, Test Bias, Research Methodology, Testing
Peer reviewed Peer reviewed
Oshima, T. C.; Raju, Nambury S. Rajo; Flowers, Claudia P. – Journal of Educational Measurement, 1997
Defines and demonstrates a framework for studying differential item functioning and differential test functioning for tests that are intended to be multidimensional. The procedure, which is illustrated with simulated data, is an extension of the unidimensional differential functioning of items and tests approach (N. Raju, W. van der Linden, and P.…
Descriptors: Item Bias, Item Response Theory, Models, Simulation
Karma, Kai – 1973
This report defines musical aptitude in order to construct a test and obtain data pertinent to the perfecting of the definition and helpful in the practical assessment of student aptitude. Criteria for assessing potential music students often reflect achievement rather than aptitude; objective tests are often too atomistic and narrow in scope or…
Descriptors: Aesthetic Education, Aptitude Tests, Models, Music
Peer reviewed Peer reviewed
Lord, Frederic M. – Journal of Educational Measurement, 1977
A variety of practical applications of item characteristic curve test theory are discussed. Among these applications are tailored testing, two stage testing, determining whether two tests measure the same latent trait, and measuring item bias towards minority or other groups. (Author/JKS)
Descriptors: Computer Programs, Latent Trait Theory, Mastery Tests, Measurement
Diamond, Esther E. – Measurement and Evaluation in Guidance, 1976
Three principal sources of sex bias in measurement are defined: the society itself, the extent to which the test content is biased, and biased use of the test results. Intervention seems necessary if equality of opportunity is to be achieved. As one type of intervention, a model is suggested. (Author)
Descriptors: Intervention, Measurement Instruments, Models, Sex Discrimination
Beins, Bernard C. – 1992
The two-part activity outlined in this paper reveals to undergraduate students that assumptions made in theory building remain unquestioned until one steps outside the initial realm of expectations, and that theories adopted have a demonstrable impact on behaviors. Part I defines a theory, describes the roles of assumptions and knowledge in…
Descriptors: Cognitive Structures, Expectation, Higher Education, Knowledge Level
Peer reviewed Peer reviewed
Kay, Patricia M. – Education and Urban Society, 1975
Note that judgments about the relationship of test items to actual job duties have generally been made as a single comparison, proposes a model for analyzing the accuracy of translations for each step of the process of test development for use in personnel administration, and suggests scaling procedures appropriate for the various judgments. (JM)
Descriptors: Court Litigation, Federal Courts, Legal Problems, Models
Green, Donald Ross; And Others – 1988
Potential benefits of using item response theory in test construction are evaluated, based on the experience and evidence accumulated during 9 years of using a three-parameter model in the construction of major achievement batteries. Specific benefits covered include obtaining sample-free item calibrations and item-free person measurement,…
Descriptors: Achievement Tests, Computer Assisted Testing, Difficulty Level, Elementary Secondary Education
Peer reviewed Peer reviewed
Wood, Judy W.; And Others – Social Education, 1989
Focuses on adapting the construction of teacher-made social studies tests for mildly disabled mainstreamed students in grades K-12. Provides a generic model for modifying tests in order to avoid student failure due to test anxiety or the nature of the student's disability. (LS)
Descriptors: Elementary School Students, Elementary Secondary Education, Mainstreaming, Mild Disabilities
Previous Page | Next Page ยป
Pages: 1  |  2