ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	5

Descriptor

Scores	11
Simulation	11
Statistical Distributions	11
Item Response Theory	3
Test Items	3
Ability	2
Comparative Analysis	2
Correlation	2
Foreign Countries	2
Licensing Examinations…	2
Probability	2
Reliability	2
Sample Size	2
Sampling	2
Statistical Studies	2
Test Construction	2
Testing Problems	2
Academic Achievement	1
Academic Persistence	1
Academic Standards	1
Accuracy	1
Achievement Tests	1
Adaptive Testing	1
Admission Criteria	1
Classification	1
More ▼

Source

American Journal of Evaluation	1
Applied Psychological…	1
Educational and Psychological…	1
Grantee Submission	1
ProQuest LLC	1
Research Alliance for New…	1
Research in Mathematics…	1

Author

Baker-Smith, Christine	1
Bramley, Tom	1
Cai, Li	1
Chan, Wendy	1
Corcoran, Sean P.	1
Feinberg, Richard A.	1
Hambleton, Ronald K.	1
Li, Zhen	1
Meijer, Rob R.	1
Pommerich, Mary	1
Sarvela, Paul D.	1
Stone-Romero, Eugene F.	1
Williams, Richard H.	1
You, Soon-Hyung	1
Zimmerman, Donald W.	1
van Krimpen-Stoop, Edith M.…	1
More ▼

Publication Type

Reports - Evaluative	5
Journal Articles	4
Reports - Research	4
Speeches/Meeting Papers	3
Dissertations/Theses -…	1
Information Analyses	1
Reports -…	1

Education Level

Secondary Education	2
Elementary Education	1
Grade 8	1
High Schools	1
Junior High Schools	1
Middle Schools	1

Audience

Researchers

Location

New York	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 11 results Save | Export

The Role of Distributional Overlap on the Precision Gain of Bounds for Generalization

Peer reviewed

Direct link

Chan, Wendy – American Journal of Evaluation, 2022

Over the past ten years, propensity score methods have made an important contribution to improving generalizations from studies that do not select samples randomly from a population of inference. However, these methods require assumptions and recent work has considered the role of bounding approaches that provide a range of treatment impact…

Descriptors: Probability, Scores, Scoring, Generalization

Summed Score Likelihood Based Indices for Testing Latent Variable Distribution Fit in Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Li, Zhen; Cai, Li – Grantee Submission, 2017

In standard item response theory (IRT) applications, the latent variable is typically assumed to be normally distributed. If the normality assumption is violated, the item parameter estimates can become biased. Summed score likelihood based statistics may be useful for testing latent variable distribution fit. We develop Satorra-Bentler type…

Descriptors: Scores, Goodness of Fit, Statistical Distributions, Item Response Theory

Some Implications of Choice of Tiering Model in GCSE Mathematics for Inferences about What Students Know and Can Do

Peer reviewed

Direct link

Bramley, Tom – Research in Mathematics Education, 2017

This study compared models of assessment structure for achieving differentiation across the range of examinee attainment in the General Certificate of Secondary Education (GCSE) examination taken by 16-year-olds in England. The focus was on the "adjacent levels" model, where papers are targeted at three specific non-overlapping ranges of…

Descriptors: Foreign Countries, Mathematics Education, Student Certification, Student Evaluation

A Simulation Study of the Situations in Which Reporting Subscores Can Add Value to Licensure Examinations

Direct link

Feinberg, Richard A. – ProQuest LLC, 2012

Subscores, also known as domain scores, diagnostic scores, or trait scores, can help determine test-takers' relative strengths and weaknesses and appropriately focus remediation. However, subscores often have poor psychometric properties, particularly reliability and distinctiveness (Folske, Gessaroli, & Swanson, 1999; Monaghan, 2006;…

Descriptors: Simulation, Tests, Testing, Scores

Pathways to an Elite Education: Application, Admission, and Matriculation to New York City's Specialized High Schools. Working Paper

Direct link

Corcoran, Sean P.; Baker-Smith, Christine – Research Alliance for New York City Schools, 2015

New York City's elite public specialized high schools have a long history of offering a rigorous college preparatory education to the City's most academically talented students. Though immensely popular and highly selective, their policy of admitting students on the basis of a single entrance exam has been heavily criticized. Many argue, for…

Descriptors: High Schools, Urban Schools, Special Schools, Gifted

Restriction of Range and Correlation in Outlier-Prone Distributions.

Peer reviewed

Zimmerman, Donald W.; Williams, Richard H. – Applied Psychological Measurement, 2000

Restricted the range of nonnormal distributions by eliminating scores above a designated cutoff value or eliminating scores above or below the mean by a certain distance. Results of a simulation study show that range restriction sometimes increased the correlation between variables having outlier prone distributions. Discusses practical…

Descriptors: Correlation, Scores, Simulation, Statistical Distributions

Determinants of the Quota Selection Inequality Phenomenon: Clarification of the Basis for Gillett's (1991) Findings.

Peer reviewed

You, Soon-Hyung; Stone-Romero, Eugene F. – Educational and Psychological Measurement, 1996

To clarify the findings of R. Gillett (1991) about the inequality of the means of test scores of minority and majority examinees, the standard errors of the quota-selected sample means and the sampling distribution of these means were studied through Monte Carlo simulation. Results explain that the quota selection inequality results from…

Descriptors: Error of Measurement, Minority Groups, Monte Carlo Methods, Sampling

Simulating the Null Distribution of Person-Fit Statistics for Conventional and Adaptive Tests. Research Report 98-02.

Download full text

Meijer, Rob R.; van Krimpen-Stoop, Edith M. L. A. – 1998

Several person-fit statistics have been proposed to detect item score patterns that do not fit an item response theory model. To classify response patterns as not fitting a model, a distribution of a person-fit statistic is needed. The null distributions of several fit statistics have been investigated using conventionally administered tests, but…

Descriptors: Ability, Adaptive Testing, Foreign Countries, Item Response Theory

The Performance of the Mantel-Haenszel DIF Statistic When Comparison Group Distributions Are Incongruent.

Download full text

Pommerich, Mary; And Others – 1994

The functioning of two population-based Mantel-Haenszel (MH) common-odds ratios was compared. One ratio is conditioned on the observed test score, while the other is conditioned on a latent trait or true ability score. When the comparison group distributions are incongruent or nonoverlapping to some degree, the observed score represents different…

Descriptors: Ability, Comparative Analysis, Item Bias, Performance

Setting Standards on Performance Assessments: Promising New Methods and Technical Issues.

Download full text

Hambleton, Ronald K. – 1995

Performance assessments in education and credentialing are becoming popular. At the same time, there do not exist any well established and validated methods for setting standards on performance assessments. This paper describes several of the new standard-setting methods that are emerging for use with performance assessments and considers their…

Descriptors: Achievement Tests, Cutting Scores, Holistic Evaluation, Licensing Examinations (Professions)

Discrimination Indices Commonly Used in Military Training Environments: Effects of Departures from Normal Distributions.

Download full text

Sarvela, Paul D. – 1986

Four discrimination indices were compared, using score distributions which were normal, bimodal, and negatively skewed. The score distributions were systematically varied to represent the common circumstances of a military training situation using criterion-referenced mastery tests. Three 20-item tests were administered to 110 simulated subjects.…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Analysis, Mastery Tests