ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	14

Descriptor

Evaluation Research	15
Models	15
Test Items	15
Item Response Theory	11
Comparative Analysis	6
Psychometrics	6
Item Analysis	5
Evaluation Methods	4
Factor Analysis	4
Goodness of Fit	4
Measurement Techniques	3
Regression (Statistics)	3
Simulation	3
Citizenship Education	2
Cognitive Ability	2
Comparative Education	2
Comparative Testing	2
Correlation	2
Difficulty Level	2
Educational Testing	2
Equated Scores	2
Evaluation Criteria	2
Evaluation Problems	2
Measurement	2
Measurement Objectives	2
More ▼

Source

Educational Research and…	2
Applied Psychological…	1
Assessment	1
ETS Research Report Series	1
Educational and Psychological…	1
Journal of Educational…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Technology,…	1
Measurement:…	1
ProQuest LLC	1
Routledge, Taylor & Francis…	1
School Science and Mathematics	1
Teacher Educator	1
More ▼

Publication Type

Journal Articles	13
Reports - Research	7
Reports - Descriptive	3
Opinion Papers	2
Reports - Evaluative	2
Books	1
Dissertations/Theses -…	1

Education Level

Elementary Secondary Education	4
Higher Education	3
Grade 4	2
Adult Education	1
Elementary Education	1
Grade 3	1
Grade 7	1
Grade 8	1
Intermediate Grades	1
Postsecondary Education	1

Audience

Practitioners	1
Researchers	1
Students	1

Location

New York	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	1
NEO Personality Inventory	1
National Assessment of…	1
SAT (College Admission Test)	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Parameter Estimation in Rasch Models for Examinee-Selected Items

Peer reviewed

Direct link

Liu, Chen-Wei; Wang, Wen-Chung – Journal of Educational Measurement, 2017

The examinee-selected-item (ESI) design, in which examinees are required to respond to a fixed number of items in a given set of items (e.g., choose one item to respond from a pair of items), always yields incomplete data (i.e., only the selected items are answered and the others have missing data) that are likely nonignorable. Therefore, using…

Descriptors: Item Response Theory, Models, Maximum Likelihood Statistics, Data Analysis

Why Should We Assess the Goodness-of-Fit of IRT Models?

Peer reviewed

Direct link

Maydeu-Olivares, Alberto – Measurement: Interdisciplinary Research and Perspectives, 2013

In this rejoinder, Maydeu-Olivares states that, in item response theory (IRT) measurement applications, the application of goodness-of-fit (GOF) methods informs researchers of the discrepancy between the model and the data being fitted (the room for improvement). By routinely reporting the GOF of IRT models, together with the substantive results…

Descriptors: Goodness of Fit, Models, Evaluation Methods, Item Response Theory

Problems with Value-Added Evaluations of Teachers? Let Me Count the Ways!

Peer reviewed

Direct link

Berliner, David C. – Teacher Educator, 2013

In the United States, but not only here, the movement to evaluate teachers based on student test scores has received powerful political and parental support. The logic is simple. From one testing occasion to another students should show growth in their knowledge and skill. Similar types of students should show similar patterns of growth. Those…

Descriptors: Teacher Evaluation, Merit Pay, Evaluation Problems, Models

Mind Your Words: Positive and Negative Items Create Method Effects on the Five Facet Mindfulness Questionnaire

Peer reviewed

Direct link

Van Dam, Nicholas T.; Hobkirk, Andrea L.; Danoff-Burg, Sharon; Earleywine, Mitch – Assessment, 2012

Mindfulness, a construct that entails moment-to-moment effort to be aware of present experiences and positive attitudinal features, has become integrated into the sciences. The Five Facet Mindfulness Questionnaire (FFMQ), one popular measure of mindfulness, exhibits different responses to positively and negatively worded items in nonmeditating…

Descriptors: Factor Structure, Measures (Individuals), Factor Analysis, Questionnaires

Beta Regression Finite Mixture Models of Polarization and Priming

Peer reviewed

Direct link

Smithson, Michael; Merkle, Edgar C.; Verkuilen, Jay – Journal of Educational and Behavioral Statistics, 2011

This paper describes the application of finite-mixture general linear models based on the beta distribution to modeling response styles, polarization, anchoring, and priming effects in probability judgments. These models, in turn, enhance our capacity for explicitly testing models and theories regarding the aforementioned phenomena. The mixture…

Descriptors: Priming, Research Methodology, Probability, Item Response Theory

Random or Fixed Testlet Effects: A Comparison of Two Multilevel Testlet Models

Direct link

Chen, Tzu-An – ProQuest LLC, 2010

This simulation study compared the performance of two multilevel measurement testlet (MMMT) models: Beretvas and Walker's (2008) two-level MMMT model and Jiao, Wang, and Kamata's (2005) three-level model. Several conditions were manipulated (including testlet length, sample size, and the pattern of the testlet effects) to assess the impact on the…

Descriptors: Simulation, Item Response Theory, Comparative Analysis, Models

The MIMIC Method with Scale Purification for Detecting Differential Item Functioning

Peer reviewed

Direct link

Wang, Wen-Chung; Shih, Ching-Lin; Yang, Chih-Chien – Educational and Psychological Measurement, 2009

This study implements a scale purification procedure onto the standard MIMIC method for differential item functioning (DIF) detection and assesses its performance through a series of simulations. It is found that the MIMIC method with scale purification (denoted as M-SP) outperforms the standard MIMIC method (denoted as M-ST) in controlling…

Descriptors: Test Items, Measures (Individuals), Test Bias, Evaluation Research

Advantages of the Rasch Measurement Model in Analysing Educational Tests: An Applicator's Reflection

Peer reviewed

Direct link

Tormakangas, Kari – Educational Research and Evaluation, 2011

Educational achievement is a very important issue for parents, teachers, and the government. An accurate measurement plays a very important role in evaluating achievement fairly, and, therefore, analysis methods have been developed considerably in recent years. Education based on long-time learning processes forms a fruitful base for item tests,…

Descriptors: Test Items, Item Analysis, Learning Processes, Item Response Theory

Re-Examining Test Item Issues in the TIMSS Mathematics and Science Assessments

Peer reviewed

Direct link

Wang, Jianjun – School Science and Mathematics, 2011

As the largest international study ever taken in history, the Trend in Mathematics and Science Study (TIMSS) has been held as a benchmark to measure U.S. student performance in the global context. In-depth analyses of the TIMSS project are conducted in this study to examine key issues of the comparative investigation: (1) item flaws in mathematics…

Descriptors: Test Items, Figurative Language, Item Response Theory, Benchmarking

The Analysis of Measurement Equivalence in International Studies Using the Rasch Model

Peer reviewed

Direct link

Schulz, Wolfram; Fraillon, Julian – Educational Research and Evaluation, 2011

When comparing data derived from tests or questionnaires in cross-national studies, researchers commonly assume measurement invariance in their underlying scaling models. However, different cultural contexts, languages, and curricula can have powerful effects on how students respond in different countries. This article illustrates how the…

Descriptors: Citizenship Education, International Studies, Item Response Theory, International Education

Handbook of Polytomous Item Response Theory Models

Direct link

Nering, Michael L., Ed.; Ostini, Remo, Ed. – Routledge, Taylor & Francis Group, 2010

This comprehensive "Handbook" focuses on the most used polytomous item response theory (IRT) models. These models help us understand the interaction between examinees and test questions where the questions have various response categories. The book reviews all of the major models and includes discussions about how and where the models…

Descriptors: Guides, Item Response Theory, Test Items, Correlation

Using the Attribute Hierarchy Method to Make Diagnostic Inferences about Examinees' Cognitive Skills in Algebra on the SAT

Peer reviewed
PDF on ERIC

Download full text

Direct link

Gierl, Mark J.; Wang, Changjiang; Zhou, Jiawen – Journal of Technology, Learning, and Assessment, 2008

The purpose of this study is to apply the attribute hierarchy method (AHM) to a sample of SAT algebra items administered in March 2005. The AHM is a psychometric method for classifying examinees' test item responses into a set of structured attribute patterns associated with different components from a cognitive model of task performance. An…

Descriptors: Test Items, Protocol Analysis, Psychometrics, Algebra

Linking for the General Diagnostic Model. Research Report. ETS RR-08-08

Peer reviewed
PDF on ERIC

Download full text

Xu, Xueli; von Davier, Matthias – ETS Research Report Series, 2008

Three strategies for linking two consecutive assessments are investigated and compared by analyzing reading data for the National Assessment of Educational Progress (NAEP) using the general diagnostic model. These strategies are compared in terms of marginal and joint expectations of skills, joint probabilities of skill patterns, and item…

Descriptors: National Competency Tests, Probability, Reading Achievement, Test Items

The Construct of Agreeableness: Facet vs. Item Level Analysis

Peer reviewed
PDF on ERIC

Download full text

Newgent, Rebecca A.; Lee, Sang Min; Higgins, Kristin K.; Mulvenon, Sean W.; Connors, Joanie V. – Journal of Educational Research & Policy Studies, 2004

The Revised NEO Personality Inventory (NEO PI-R) was developed to operationalize the Five-Factor Model of Personality. Using correlational analysis and confirmatory and exploratory factor analysis, the present study investigates the facet structure of the domain of Agreeableness of the NEO-PI-R at the facet and item level to assess which is a more…

Descriptors: Personality Traits, Personality, Factor Analysis, Evaluation Research

Item Difficulty Modeling of Paragraph Comprehension Items

Peer reviewed

Direct link

Gorin, Joanna S.; Embretson, Susan E. – Applied Psychological Measurement, 2006

Recent assessment research joining cognitive psychology and psychometric theory has introduced a new technology, item generation. In algorithmic item generation, items are systematically created based on specific combinations of features that underlie the processing required to correctly solve a problem. Reading comprehension items have been more…

Descriptors: Difficulty Level, Test Items, Modeling (Psychology), Paragraph Composition

Wang, Wen-Chung	2
Berliner, David C.	1
Chen, Tzu-An	1
Connors, Joanie V.	1
Danoff-Burg, Sharon	1
Earleywine, Mitch	1
Embretson, Susan E.	1
Fraillon, Julian	1
Gierl, Mark J.	1
Gorin, Joanna S.	1
Higgins, Kristin K.	1
Hobkirk, Andrea L.	1
Lee, Sang Min	1
Liu, Chen-Wei	1
Maydeu-Olivares, Alberto	1
Merkle, Edgar C.	1
Mulvenon, Sean W.	1
Nering, Michael L., Ed.	1
Newgent, Rebecca A.	1
Ostini, Remo, Ed.	1
Schulz, Wolfram	1
Shih, Ching-Lin	1
Smithson, Michael	1
Tormakangas, Kari	1
Van Dam, Nicholas T.	1
More ▼