Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 14 |
Descriptor
Evaluation Research | 15 |
Models | 15 |
Test Items | 15 |
Item Response Theory | 11 |
Comparative Analysis | 6 |
Psychometrics | 6 |
Item Analysis | 5 |
Evaluation Methods | 4 |
Factor Analysis | 4 |
Goodness of Fit | 4 |
Measurement Techniques | 3 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 13 |
Reports - Research | 7 |
Reports - Descriptive | 3 |
Opinion Papers | 2 |
Reports - Evaluative | 2 |
Books | 1 |
Dissertations/Theses -… | 1 |
Education Level
Elementary Secondary Education | 4 |
Higher Education | 3 |
Grade 4 | 2 |
Adult Education | 1 |
Elementary Education | 1 |
Grade 3 | 1 |
Grade 7 | 1 |
Grade 8 | 1 |
Intermediate Grades | 1 |
Postsecondary Education | 1 |
Audience
Practitioners | 1 |
Researchers | 1 |
Students | 1 |
Location
New York | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Graduate Record Examinations | 1 |
NEO Personality Inventory | 1 |
National Assessment of… | 1 |
SAT (College Admission Test) | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Liu, Chen-Wei; Wang, Wen-Chung – Journal of Educational Measurement, 2017
The examinee-selected-item (ESI) design, in which examinees are required to respond to a fixed number of items in a given set of items (e.g., choose one item to respond from a pair of items), always yields incomplete data (i.e., only the selected items are answered and the others have missing data) that are likely nonignorable. Therefore, using…
Descriptors: Item Response Theory, Models, Maximum Likelihood Statistics, Data Analysis
Maydeu-Olivares, Alberto – Measurement: Interdisciplinary Research and Perspectives, 2013
In this rejoinder, Maydeu-Olivares states that, in item response theory (IRT) measurement applications, the application of goodness-of-fit (GOF) methods informs researchers of the discrepancy between the model and the data being fitted (the room for improvement). By routinely reporting the GOF of IRT models, together with the substantive results…
Descriptors: Goodness of Fit, Models, Evaluation Methods, Item Response Theory
Berliner, David C. – Teacher Educator, 2013
In the United States, but not only here, the movement to evaluate teachers based on student test scores has received powerful political and parental support. The logic is simple. From one testing occasion to another students should show growth in their knowledge and skill. Similar types of students should show similar patterns of growth. Those…
Descriptors: Teacher Evaluation, Merit Pay, Evaluation Problems, Models
Van Dam, Nicholas T.; Hobkirk, Andrea L.; Danoff-Burg, Sharon; Earleywine, Mitch – Assessment, 2012
Mindfulness, a construct that entails moment-to-moment effort to be aware of present experiences and positive attitudinal features, has become integrated into the sciences. The Five Facet Mindfulness Questionnaire (FFMQ), one popular measure of mindfulness, exhibits different responses to positively and negatively worded items in nonmeditating…
Descriptors: Factor Structure, Measures (Individuals), Factor Analysis, Questionnaires
Smithson, Michael; Merkle, Edgar C.; Verkuilen, Jay – Journal of Educational and Behavioral Statistics, 2011
This paper describes the application of finite-mixture general linear models based on the beta distribution to modeling response styles, polarization, anchoring, and priming effects in probability judgments. These models, in turn, enhance our capacity for explicitly testing models and theories regarding the aforementioned phenomena. The mixture…
Descriptors: Priming, Research Methodology, Probability, Item Response Theory
Chen, Tzu-An – ProQuest LLC, 2010
This simulation study compared the performance of two multilevel measurement testlet (MMMT) models: Beretvas and Walker's (2008) two-level MMMT model and Jiao, Wang, and Kamata's (2005) three-level model. Several conditions were manipulated (including testlet length, sample size, and the pattern of the testlet effects) to assess the impact on the…
Descriptors: Simulation, Item Response Theory, Comparative Analysis, Models
Wang, Wen-Chung; Shih, Ching-Lin; Yang, Chih-Chien – Educational and Psychological Measurement, 2009
This study implements a scale purification procedure onto the standard MIMIC method for differential item functioning (DIF) detection and assesses its performance through a series of simulations. It is found that the MIMIC method with scale purification (denoted as M-SP) outperforms the standard MIMIC method (denoted as M-ST) in controlling…
Descriptors: Test Items, Measures (Individuals), Test Bias, Evaluation Research
Advantages of the Rasch Measurement Model in Analysing Educational Tests: An Applicator's Reflection
Tormakangas, Kari – Educational Research and Evaluation, 2011
Educational achievement is a very important issue for parents, teachers, and the government. An accurate measurement plays a very important role in evaluating achievement fairly, and, therefore, analysis methods have been developed considerably in recent years. Education based on long-time learning processes forms a fruitful base for item tests,…
Descriptors: Test Items, Item Analysis, Learning Processes, Item Response Theory
Wang, Jianjun – School Science and Mathematics, 2011
As the largest international study ever taken in history, the Trend in Mathematics and Science Study (TIMSS) has been held as a benchmark to measure U.S. student performance in the global context. In-depth analyses of the TIMSS project are conducted in this study to examine key issues of the comparative investigation: (1) item flaws in mathematics…
Descriptors: Test Items, Figurative Language, Item Response Theory, Benchmarking
Schulz, Wolfram; Fraillon, Julian – Educational Research and Evaluation, 2011
When comparing data derived from tests or questionnaires in cross-national studies, researchers commonly assume measurement invariance in their underlying scaling models. However, different cultural contexts, languages, and curricula can have powerful effects on how students respond in different countries. This article illustrates how the…
Descriptors: Citizenship Education, International Studies, Item Response Theory, International Education
Nering, Michael L., Ed.; Ostini, Remo, Ed. – Routledge, Taylor & Francis Group, 2010
This comprehensive "Handbook" focuses on the most used polytomous item response theory (IRT) models. These models help us understand the interaction between examinees and test questions where the questions have various response categories. The book reviews all of the major models and includes discussions about how and where the models…
Descriptors: Guides, Item Response Theory, Test Items, Correlation
Gierl, Mark J.; Wang, Changjiang; Zhou, Jiawen – Journal of Technology, Learning, and Assessment, 2008
The purpose of this study is to apply the attribute hierarchy method (AHM) to a sample of SAT algebra items administered in March 2005. The AHM is a psychometric method for classifying examinees' test item responses into a set of structured attribute patterns associated with different components from a cognitive model of task performance. An…
Descriptors: Test Items, Protocol Analysis, Psychometrics, Algebra
Xu, Xueli; von Davier, Matthias – ETS Research Report Series, 2008
Three strategies for linking two consecutive assessments are investigated and compared by analyzing reading data for the National Assessment of Educational Progress (NAEP) using the general diagnostic model. These strategies are compared in terms of marginal and joint expectations of skills, joint probabilities of skill patterns, and item…
Descriptors: National Competency Tests, Probability, Reading Achievement, Test Items
Newgent, Rebecca A.; Lee, Sang Min; Higgins, Kristin K.; Mulvenon, Sean W.; Connors, Joanie V. – Journal of Educational Research & Policy Studies, 2004
The Revised NEO Personality Inventory (NEO PI-R) was developed to operationalize the Five-Factor Model of Personality. Using correlational analysis and confirmatory and exploratory factor analysis, the present study investigates the facet structure of the domain of Agreeableness of the NEO-PI-R at the facet and item level to assess which is a more…
Descriptors: Personality Traits, Personality, Factor Analysis, Evaluation Research
Gorin, Joanna S.; Embretson, Susan E. – Applied Psychological Measurement, 2006
Recent assessment research joining cognitive psychology and psychometric theory has introduced a new technology, item generation. In algorithmic item generation, items are systematically created based on specific combinations of features that underlie the processing required to correctly solve a problem. Reading comprehension items have been more…
Descriptors: Difficulty Level, Test Items, Modeling (Psychology), Paragraph Composition