NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
Race to the Top1
What Works Clearinghouse Rating
Showing 1 to 15 of 18 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Chengyu Cui; Chun Wang; Gongjun Xu – Grantee Submission, 2024
Multidimensional item response theory (MIRT) models have generated increasing interest in the psychometrics literature. Efficient approaches for estimating MIRT models with dichotomous responses have been developed, but constructing an equally efficient and robust algorithm for polytomous models has received limited attention. To address this gap,…
Descriptors: Item Response Theory, Accuracy, Simulation, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Robert Meyer; Sara Hu; Michael Christian – Society for Research on Educational Effectiveness, 2022
This paper develops models to measure growth in student achievement with a focus on the possibility of differential growth in achievement for low and high-achieving students. We consider a gap-closing model that evaluates the degree to which students in a target group -- students in the bottom quartile of measured achievement -- perform better…
Descriptors: Academic Achievement, Achievement Gap, Models, Measurement Techniques
Shear, Benjamin R.; Reardon, Sean F. – Stanford Center for Education Policy Analysis, 2019
This paper describes a method for pooling grouped, ordered-categorical data across multiple waves to improve small-sample heteroskedastic ordered probit (HETOP) estimates of latent distributional parameters. We illustrate the method with aggregate proficiency data reporting the number of students in schools or districts scoring in each of a small…
Descriptors: Computation, Scores, Statistical Distributions, Sample Size
Peer reviewed Peer reviewed
Direct linkDirect link
Zieger, Laura Raffaella; Jerrim, J.; Anders, J.; Shure, N. – Assessment in Education: Principles, Policy & Practice, 2022
The OECD's Programme for International Student Assessment (PISA) has become one of the key studies for evidence-based education policymaking across the globe. PISA has however received a lot of methodological criticism, including how the test scores are created. The aim of this paper is to investigate the so-called 'conditioning model', where…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Tong, Xin; Zhang, Zhiyong – Grantee Submission, 2020
Despite broad applications of growth curve models, few studies have dealt with a practical issue -- nonnormality of data. Previous studies have used Student's "t" distributions to remedy the nonnormal problems. In this study, robust distributional growth curve models are proposed from a semiparametric Bayesian perspective, in which…
Descriptors: Robustness (Statistics), Bayesian Statistics, Models, Error of Measurement
Yanan Feng – ProQuest LLC, 2021
This dissertation aims to investigate the effect size measures of differential item functioning (DIF) detection in the context of cognitive diagnostic models (CDMs). A variety of DIF detection techniques have been developed in the context of CDMs. However, most of the DIF detection procedures focus on the null hypothesis significance test. Few…
Descriptors: Effect Size, Item Response Theory, Cognitive Measurement, Models
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Dirlik, Ezgi Mor – International Journal of Progressive Education, 2019
Item response theory (IRT) has so many advantages than its precedent Classical Test Theory (CTT) such as non-changing item parameters, ability parameter estimations free from the items. However, in order to get these advantages, some assumptions should be met and they are; unidimensionality, normality and local independence. However, it is not…
Descriptors: Comparative Analysis, Nonparametric Statistics, Item Response Theory, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Sideridis, Georgios; Tsaousis, Ioannis; Al Harbi, Khaleel – Educational and Psychological Measurement, 2017
The purpose of the present article was to illustrate, using an example from a national assessment, the value from analyzing the behavior of distractors in measures that engage the multiple-choice format. A secondary purpose of the present article was to illustrate four remedial actions that can potentially improve the measurement of the…
Descriptors: Multiple Choice Tests, Attention Control, Testing, Remedial Instruction
Stapleton, Laura M.; Kang, Yoonjeong – Sociological Methods & Research, 2018
This research empirically evaluates data sets from the National Center for Education Statistics (NCES) for design effects of ignoring the sampling design in weighted two-level analyses. Currently, researchers may ignore the sampling design beyond the levels that they model which might result in incorrect inferences regarding hypotheses due to…
Descriptors: Probability, Hierarchical Linear Modeling, Sampling, Inferences
Peer reviewed Peer reviewed
Direct linkDirect link
Paek, Insu; Park, Hyun-Jeong; Cai, Li; Chi, Eunlim – Educational and Psychological Measurement, 2014
Typically a longitudinal growth modeling based on item response theory (IRT) requires repeated measures data from a single group with the same test design. If operational or item exposure problems are present, the same test may not be employed to collect data for longitudinal analyses and tests at multiple time points are constructed with unique…
Descriptors: Item Response Theory, Comparative Analysis, Test Items, Equated Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Bouhlila, Donia Smaali; Sellaouti, Fethi – Large-scale Assessments in Education, 2013
In this paper, we document a study that involved applying a multiple imputation technique with chained equations to data drawn from the 2007 iteration of the TIMSS database. More precisely, we imputed missing variables contained in the student background datafile for Tunisia (one of the TIMSS 2007 participating countries), by using Van Buuren,…
Descriptors: Databases, Student Characteristics, Error of Measurement, Intervals
Isenberg, Eric; Hock, Heinrich – Mathematica Policy Research, Inc., 2012
In this report, the authors describe the value-added models used as part of teacher evaluation systems in the District of Columbia Public Schools (DCPS) and in eligible DC charter schools participating in Race to the Top. They estimated (1) teacher effectiveness in DCPS and eligible DC charter schools during the 2011-2012 school year; and (2)…
Descriptors: Value Added Models, Teacher Evaluation, Public Schools, Urban Schools
Green, Donald Ross; And Others – 1988
Potential benefits of using item response theory in test construction are evaluated, based on the experience and evidence accumulated during 9 years of using a three-parameter model in the construction of major achievement batteries. Specific benefits covered include obtaining sample-free item calibrations and item-free person measurement,…
Descriptors: Achievement Tests, Computer Assisted Testing, Difficulty Level, Elementary Secondary Education
Cypress, Beulah K. – 1973
The potential of the Rasch model to develop scores, on a ratio scale, suitable for interindividual comparisons, from intact groups with disparate distribution characteristics was investigated. The specific problems studied were: (1) the effects of skewed test score distributions on the ability parameter of the Rasch measurement model; (2) the…
Descriptors: Achievement Tests, Data Analysis, Error of Measurement, Measurement Instruments
Hendrickson, Leslie; Jones, Barnie – 1982
The logic of using a gain score approach versus longitudinal causal models is studied in this secondary analysis of a complex data base. The gain score model used by the Federal Reserve Bank and the School District of Philadelphia in their "What Works in Reading?" study is successively refined using the LISREL structural equation…
Descriptors: Achievement Gains, Achievement Tests, Data Analysis, Elementary Education
Previous Page | Next Page ยป
Pages: 1  |  2