Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 5 |
Descriptor
Source
Educational Measurement:… | 2 |
American Journal of Evaluation | 1 |
Applied Psychological… | 1 |
International Journal of… | 1 |
MDRC | 1 |
Practical Assessment,… | 1 |
Psychological Methods | 1 |
Psychometrika | 1 |
Social Indicators Research | 1 |
Author
Sijtsma, Klaas | 2 |
Balu, Rekha | 1 |
Chaaban, Jad M. | 1 |
Ciechalski, Joseph C. | 1 |
Emons, Wilco H. M. | 1 |
Esther S. Yao | 1 |
Hanes, John C. | 1 |
Hessen, David J. | 1 |
Junker, Brian | 1 |
Kane Meissel | 1 |
Lam, Tony C. M. | 1 |
More ▼ |
Publication Type
Reports - Descriptive | 12 |
Journal Articles | 9 |
Opinion Papers | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Secondary Education | 1 |
Audience
Researchers | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Kane Meissel; Esther S. Yao – Practical Assessment, Research & Evaluation, 2024
Effect sizes are important because they are an accessible way to indicate the practical importance of observed associations or differences. Standardized mean difference (SMD) effect sizes, such as Cohen's d, are widely used in education and the social sciences -- in part because they are relatively easy to calculate. However, SMD effect sizes…
Descriptors: Computer Software, Programming Languages, Effect Size, Correlation
Wind, Stefanie A. – Educational Measurement: Issues and Practice, 2017
Mokken scale analysis (MSA) is a probabilistic-nonparametric approach to item response theory (IRT) that can be used to evaluate fundamental measurement properties with less strict assumptions than parametric IRT models. This instructional module provides an introduction to MSA as a probabilistic-nonparametric framework in which to explore…
Descriptors: Probability, Nonparametric Statistics, Item Response Theory, Scaling
Porter, Kristin E.; Balu, Rekha – MDRC, 2016
Education systems are increasingly creating rich, longitudinal data sets with frequent, and even real-time, data updates of many student measures, including daily attendance, homework submissions, and exam scores. These data sets provide an opportunity for district and school staff members to move beyond an indicators-based approach and instead…
Descriptors: Models, Prediction, Statistical Analysis, Elementary Secondary Education
Chaaban, Jad M. – Social Indicators Research, 2009
This paper develops an empirical methodology for the construction of a synthetic multi-dimensional cross-country comparison of the performance of governments around the world in improving the livelihood of their younger population. The devised "Youth Welfare Index" is based on the nonparametric Data Envelopment Analysis (DEA) methodology and…
Descriptors: Nonparametric Statistics, Research Methodology, Evaluation Methods, Comparative Education
Ryden, Jesper – International Journal of Mathematical Education in Science and Technology, 2008
Extreme-value statistics is often used to estimate so-called return values (actually related to quantiles) for environmental quantities like wind speed or wave height. A basic method for estimation is the method of block maxima which consists in partitioning observations in blocks, where maxima from each block could be considered independent.…
Descriptors: Simulation, Probability, Computation, Nonparametric Statistics

Hanes, John C. – American Journal of Evaluation, 2002
Describes the number needed to treat (NNT) as an indicator of a program's efficacy. L'Abbe plots compare several programs in terms of the event rates that constitute the NNTs. Explains how an evaluator may extend the NNT concept by calculating and plotting event rates on a continuous ordinal basis. (SLD)
Descriptors: Evaluation Methods, Nonparametric Statistics, Program Evaluation

Junker, Brian; Sijtsma, Klaas – Applied Psychological Measurement, 2001
Discusses usability and interpretation issues for single-strategy cognitive assessment models that posit a stochastic, conjunctive relationship between a set of cognitive attributes to be assessed and performance on particular items/tasks of the assessment. Also discusses stochastic ordering and monotonicity properties that enhance the…
Descriptors: Cognitive Processes, Evaluation Methods, Item Response Theory, Models

Penfield, Randall D.; Lam, Tony C. M. – Educational Measurement: Issues and Practice, 2000
Discusses extending research into differential item functioning (DIF) to performance assessment and considers some of the best options available at this time. The most effective strategy for assessing DIF is to use a combination of several methods, preferably one from each of the observed score nonparametric, latent score nonparametric, and…
Descriptors: Evaluation Methods, Item Bias, Nonparametric Statistics, Performance Based Assessment
Ciechalski, Joseph C.; Pinkney, James W.; Weaver, Florence S. – 2002
This paper illustrates the use of the McNemar Test, using a hypothetical problem. The McNemar Test is a nonparametric statistical test that is a type of chi square test using dependent, rather than independent, samples to assess before-after designs in which each subject is used as his or her own control. Results of the McNemar test make it…
Descriptors: Attitude Change, Chi Square, Evaluation Methods, Nonparametric Statistics
Emons, Wilco H. M.; Sijtsma, Klaas; Meijer, Rob R. – Psychological Methods, 2005
Person-fit statistics test whether the likelihood of a respondent's complete vector of item scores on a test is low given the hypothesized item response theory model. This binary information may be insufficient for diagnosing the cause of a misfitting item-score vector. The authors propose a comprehensive methodology for person-fit analysis in the…
Descriptors: Evaluation Methods, Item Response Theory, Evaluation Research, Goodness of Fit
Hessen, David J. – Psychometrika, 2005
In the present paper, a new family of item response theory (IRT) models for dichotomous item scores is proposed. Two basic assumptions define the most general model of this family. The first assumption is local independence of the item scores given a unidimensional latent trait. The second assumption is that the odds-ratios for all item-pairs are…
Descriptors: Item Response Theory, Scores, Test Items, Models
Lay, Robert S. – 1983
The advantages and disadvantages of new software for market segmentation analysis are discussed, and the application of this new, chi-square based procedure (CHAID), is illustrated. A comparison is presented of an earlier, binary segmentation technique (THAID) and a multiple discriminant analysis. It is suggested that CHAID is superior to earlier…
Descriptors: College Applicants, Computer Programs, Evaluation Methods, Flow Charts