ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	6

Descriptor

Comparative Analysis	9
Scaling	7
Item Response Theory	6
Simulation	4
Test Items	3
Evaluation Criteria	2
Factor Analysis	2
Foreign Countries	2
High School Students	2
High Schools	2
Multidimensional Scaling	2
Nonparametric Statistics	2
Ability	1
Academic Achievement	1
Accuracy	1
Achievement Tests	1
Bayesian Statistics	1
Bias	1
Cluster Analysis	1
Computation	1
Computer Software	1
Dimensional Preference	1
Educational Assessment	1
Elementary Education	1
Elementary School Students	1
More ▼

Source

Applied Measurement in…

Publication Type

Journal Articles	9
Reports - Research	6
Reports - Evaluative	4

Education Level

Secondary Education	2
Elementary Education	1
Elementary Secondary Education	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Kindergarten	1
Middle Schools	1
More ▼

Audience

Location

Colorado	1
Florida	1
New York	1
North Carolina	1
South Korea	1
Tennessee	1
Texas	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Maintaining Score Scales over Time: A Comparison of Five Scoring Methods

Peer reviewed

Direct link

Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023

This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…

Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

IRT Item Parameter Scaling for Developing New Item Pools

Peer reviewed

Direct link

Kang, Hyeon-Ah; Lu, Ying; Chang, Hua-Hua – Applied Measurement in Education, 2017

Increasing use of item pools in large-scale educational assessments calls for an appropriate scaling procedure to achieve a common metric among field-tested items. The present study examines scaling procedures for developing a new item pool under a spiraled block linking design. The three scaling procedures are considered: (a) concurrent…

Descriptors: Item Response Theory, Accuracy, Educational Assessment, Test Items

On the Cross-Country Comparability of Indicators of Socioeconomic Resources in PISA

Peer reviewed

Direct link

Pokropek, Artur; Borgonovi, Francesca; McCormick, Carina – Applied Measurement in Education, 2017

Large-scale international assessments rely on indicators of the resources that students report having in their homes to capture the financial capital of their families. The scaling methodology currently used to develop the Programme for International Student Assessment (PISA) background indices is designed to maximize within-country comparability…

Descriptors: Foreign Countries, Achievement Tests, Secondary School Students, International Assessment

A Comparison of Teacher Effectiveness Measures Calculated Using Three Multilevel Models for Raters Effects

Peer reviewed

Direct link

Murphy, Daniel L.; Beretvas, S. Natasha – Applied Measurement in Education, 2015

This study examines the use of cross-classified random effects models (CCrem) and cross-classified multiple membership random effects models (CCMMrem) to model rater bias and estimate teacher effectiveness. Effect estimates are compared using CTT versus item response theory (IRT) scaling methods and three models (i.e., conventional multilevel…

Descriptors: Teacher Effectiveness, Comparative Analysis, Hierarchical Linear Modeling, Test Theory

Vertical Scaling with the Rasch Model Utilizing Default and Tight Convergence Settings with WINSTEPS and BILOG-MG

Peer reviewed

Direct link

Custer, Michael; Omar, Md Hafidz; Pomplun, Mark – Applied Measurement in Education, 2006

This study compared vertical scaling results for the Rasch model from BILOG-MG and WINSTEPS. The item and ability parameters for the simulated vocabulary tests were scaled across 11 grades; kindergarten through 10th. Data were based on real data and were simulated under normal and skewed distribution assumptions. WINSTEPS and BILOG-MG were each…

Descriptors: Models, Scaling, Computer Software, Vocabulary

A Simulation Comparison of Parametric and Nonparametric Dimensionality Detection Procedures

Peer reviewed

Direct link

Mroch, Andrew A.; Bolt, Daniel M. – Applied Measurement in Education, 2006

Recently, nonparametric methods have been proposed that provide a dimensionally based description of test structure for tests with dichotomous items. Because such methods are based on different notions of dimensionality than are assumed when using a psychometric model, it remains unclear whether these procedures might lead to a different…

Descriptors: Simulation, Comparative Analysis, Psychometrics, Methods Research

Mokken Scale Analysis: Theoretical Considerations and an Application to Transivity Tasks.

Peer reviewed

Sijtsma, Klaas, Verweij, Anton C. – Applied Measurement in Education, 1992

Empirical data analysis using the Mokken models of monotone homogeneity and double monotonicity is discussed. Results from the Mokken approach with 3 data sets (for a total of 425 elementary school students) pertaining to transitive interference items are compared to Rasch analysis. (SLD)

Descriptors: Comparative Analysis, Elementary Education, Elementary School Students, Item Response Theory

Measuring Self-Efficacy: Multitrait-Multimethod Comparison of Scaling Procedures.

Peer reviewed

Bong, Mimi; Hocevar, Dennis – Applied Measurement in Education, 2002

Examined convergent and discriminant validity of various self-efficacy measures across two studies, one involving 358 U.S. high school students and another involving 235 Korean female high school students. Across the studies the first-order confirmatory factor analyses provide support for both convergent validity of different self-efficacy…

Descriptors: Comparative Analysis, Foreign Countries, High School Students, High Schools

Cluster Analysis of Angular Data in Applications of Multidimensional Item-Response Theory.

Peer reviewed

Miller, Timothy R.; Hirsch, Thomas M. – Applied Measurement in Education, 1992

A procedure for interpreting multiple-discrimination indices from a multidimensional item-response theory analysis is described and demonstrated with responses of 1,635 high school students to a multiple-choice test. The procedure consists of converting discrimination parameter estimates to direction cosines and analyzing the angular distances…

Descriptors: Ability, Cluster Analysis, Comparative Analysis, Estimation (Mathematics)

Beretvas, S. Natasha	1
Bolt, Daniel M.	1
Bong, Mimi	1
Borgonovi, Francesca	1
Chang, Hua-Hua	1
Custer, Michael	1
Hirsch, Thomas M.	1
Hocevar, Dennis	1
Kang, Hyeon-Ah	1
Kim, Stella Yun	1
Lee, Won-Chan	1
Lu, Ying	1
McCormick, Carina	1
Miller, Timothy R.	1
Mroch, Andrew A.	1
Murphy, Daniel L.	1
Omar, Md Hafidz	1
Pokropek, Artur	1
Pomplun, Mark	1
Sijtsma, Klaas, Verweij,…	1
More ▼