NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 11 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Chan, Wendy – American Journal of Evaluation, 2022
Over the past ten years, propensity score methods have made an important contribution to improving generalizations from studies that do not select samples randomly from a population of inference. However, these methods require assumptions and recent work has considered the role of bounding approaches that provide a range of treatment impact…
Descriptors: Probability, Scores, Scoring, Generalization
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Li, Zhen; Cai, Li – Grantee Submission, 2017
In standard item response theory (IRT) applications, the latent variable is typically assumed to be normally distributed. If the normality assumption is violated, the item parameter estimates can become biased. Summed score likelihood based statistics may be useful for testing latent variable distribution fit. We develop Satorra-Bentler type…
Descriptors: Scores, Goodness of Fit, Statistical Distributions, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Bramley, Tom – Research in Mathematics Education, 2017
This study compared models of assessment structure for achieving differentiation across the range of examinee attainment in the General Certificate of Secondary Education (GCSE) examination taken by 16-year-olds in England. The focus was on the "adjacent levels" model, where papers are targeted at three specific non-overlapping ranges of…
Descriptors: Foreign Countries, Mathematics Education, Student Certification, Student Evaluation
Feinberg, Richard A. – ProQuest LLC, 2012
Subscores, also known as domain scores, diagnostic scores, or trait scores, can help determine test-takers' relative strengths and weaknesses and appropriately focus remediation. However, subscores often have poor psychometric properties, particularly reliability and distinctiveness (Folske, Gessaroli, & Swanson, 1999; Monaghan, 2006;…
Descriptors: Simulation, Tests, Testing, Scores
Corcoran, Sean P.; Baker-Smith, Christine – Research Alliance for New York City Schools, 2015
New York City's elite public specialized high schools have a long history of offering a rigorous college preparatory education to the City's most academically talented students. Though immensely popular and highly selective, their policy of admitting students on the basis of a single entrance exam has been heavily criticized. Many argue, for…
Descriptors: High Schools, Urban Schools, Special Schools, Gifted
Peer reviewed Peer reviewed
Zimmerman, Donald W.; Williams, Richard H. – Applied Psychological Measurement, 2000
Restricted the range of nonnormal distributions by eliminating scores above a designated cutoff value or eliminating scores above or below the mean by a certain distance. Results of a simulation study show that range restriction sometimes increased the correlation between variables having outlier prone distributions. Discusses practical…
Descriptors: Correlation, Scores, Simulation, Statistical Distributions
Peer reviewed Peer reviewed
You, Soon-Hyung; Stone-Romero, Eugene F. – Educational and Psychological Measurement, 1996
To clarify the findings of R. Gillett (1991) about the inequality of the means of test scores of minority and majority examinees, the standard errors of the quota-selected sample means and the sampling distribution of these means were studied through Monte Carlo simulation. Results explain that the quota selection inequality results from…
Descriptors: Error of Measurement, Minority Groups, Monte Carlo Methods, Sampling
Meijer, Rob R.; van Krimpen-Stoop, Edith M. L. A. – 1998
Several person-fit statistics have been proposed to detect item score patterns that do not fit an item response theory model. To classify response patterns as not fitting a model, a distribution of a person-fit statistic is needed. The null distributions of several fit statistics have been investigated using conventionally administered tests, but…
Descriptors: Ability, Adaptive Testing, Foreign Countries, Item Response Theory
Pommerich, Mary; And Others – 1994
The functioning of two population-based Mantel-Haenszel (MH) common-odds ratios was compared. One ratio is conditioned on the observed test score, while the other is conditioned on a latent trait or true ability score. When the comparison group distributions are incongruent or nonoverlapping to some degree, the observed score represents different…
Descriptors: Ability, Comparative Analysis, Item Bias, Performance
Hambleton, Ronald K. – 1995
Performance assessments in education and credentialing are becoming popular. At the same time, there do not exist any well established and validated methods for setting standards on performance assessments. This paper describes several of the new standard-setting methods that are emerging for use with performance assessments and considers their…
Descriptors: Achievement Tests, Cutting Scores, Holistic Evaluation, Licensing Examinations (Professions)
Sarvela, Paul D. – 1986
Four discrimination indices were compared, using score distributions which were normal, bimodal, and negatively skewed. The score distributions were systematically varied to represent the common circumstances of a military training situation using criterion-referenced mastery tests. Three 20-item tests were administered to 110 simulated subjects.…
Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Analysis, Mastery Tests