Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 11 |
Descriptor
Comparative Analysis | 13 |
Evaluation Methods | 13 |
Generalization | 13 |
Foreign Countries | 4 |
Models | 4 |
Statistical Analysis | 4 |
Prediction | 3 |
Regression (Statistics) | 3 |
Scores | 3 |
Simulation | 3 |
Academic Achievement | 2 |
More ▼ |
Source
Author
Wagenmakers, Eric-Jan | 2 |
Ahn, Woo-Young | 1 |
Allam, Reynald | 1 |
Ambridge, Ben | 1 |
Brannick, Michael T. | 1 |
Busemeyer, Jerome R. | 1 |
Byram, Harold M. | 1 |
Chan, Wendy | 1 |
Chorna, Olga | 1 |
Geist, Pamela K. | 1 |
Hallberg, Kelly | 1 |
More ▼ |
Publication Type
Journal Articles | 10 |
Reports - Research | 8 |
Reports - Evaluative | 3 |
Collected Works - Proceedings | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 3 |
Postsecondary Education | 3 |
Elementary Education | 2 |
Intermediate Grades | 2 |
Middle Schools | 2 |
Elementary Secondary Education | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
Junior High Schools | 1 |
Secondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Indiana Statewide Testing for… | 1 |
Program for International… | 1 |
What Works Clearinghouse Rating
Tan, Teck Kiang – Practical Assessment, Research & Evaluation, 2023
Researchers often have hypotheses concerning the state of affairs in the population from which they sampled their data to compare group means. The classical frequentist approach provides one way of carrying out hypothesis testing using ANOVA to state the null hypothesis that there is no difference in the means and proceed with multiple comparisons…
Descriptors: Comparative Analysis, Hypothesis Testing, Statistical Analysis, Guidelines
Russell, Michael; Szendey, Olivia; Li, Zhushan – Educational Assessment, 2022
Recent research provides evidence that an intersectional approach to defining reference and focal groups results in a higher percentage of comparisons flagged for potential DIF. The study presented here examined the generalizability of this pattern across methods for examining DIF. While the level of DIF detection differed among the four methods…
Descriptors: Comparative Analysis, Item Analysis, Test Items, Test Construction
Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020
A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…
Descriptors: Simulation, Sample Size, Item Analysis, Scores
Khamboonruang, Apichat – rEFLections, 2022
Although much research has compared the functioning between analytic and holistic rating scales, little research has compared the functioning of binary rating scales with other types of rating scales. This quantitative study set out to preliminarily and comparatively validate binary and analytic rating scales intended for use in formative…
Descriptors: Writing Evaluation, Evaluation Methods, Second Language Learning, Second Language Instruction
Chorna, Olga – Comparative Professional Pedagogy, 2015
The article reveals specific features of functioning systems of higher education quality monitoring at the present stage, taking into account national traditions, historical experience and mentality of the population. The article introduces a comparative analysis of monitoring actors at national, regional and local levels in two countries. The…
Descriptors: Educational Quality, Reputation, Comparative Analysis, Universities
Tipton, Elizabeth; Hallberg, Kelly; Hedges, Larry V.; Chan, Wendy – Society for Research on Educational Effectiveness, 2015
Policy-makers are frequently interested in understanding how effective a particular intervention may be for a specific (and often broad) population. In many fields, particularly education and social welfare, the ideal form of these evaluations is a large-scale randomized experiment. Recent research has highlighted that sites in these large-scale…
Descriptors: Generalization, Program Effectiveness, Sample Size, Computation
Ambridge, Ben – Cognitive Science, 2013
A paradox at the heart of language acquisition research is that, to achieve adult-like competence, children must acquire the ability to generalize verbs into non-attested structures, while avoiding utterances that are deemed ungrammatical by native speakers. For example, children must learn that, to denote the reversal of an action,…
Descriptors: Generalization, Comparative Analysis, Verbs, Grammar
Ahn, Woo-Young; Busemeyer, Jerome R.; Wagenmakers, Eric-Jan; Stout, Julie C. – Cognitive Science, 2008
It is a hallmark of a good model to make accurate "a priori" predictions to new conditions (Busemeyer & Wang, 2000). This study compared 8 decision learning models with respect to their generalizability. Participants performed 2 tasks (the Iowa Gambling Task and the Soochow Gambling Task), and each model made a priori predictions by estimating the…
Descriptors: Prediction, Generalization, Models, Comparative Analysis
Mason, Corinne; Allam, Reynald; Brannick, Michael T. – Educational and Psychological Measurement, 2007
Reliability generalization studies have provided estimates of the mean reliability coefficients and examined factors that explain the variability in the reliability estimates across studies for many different tests and measures. Different authors have used different data analyses to do such meta-analyses, and little research has addressed whether…
Descriptors: Reliability, Monte Carlo Methods, Meta Analysis, Generalization
Shiffrin, Richard M.; Lee, Michael D.; Kim, Woojae; Wagenmakers, Eric-Jan – Cognitive Science, 2008
This article reviews current methods for evaluating models in the cognitive sciences, including theoretically based approaches, such as Bayes factors and minimum description length measures; simulation approaches, including model mimicry evaluations; and practical approaches, such as validation and generalization measures. This article argues…
Descriptors: Bayesian Statistics, Generalization, Sciences, Models
Moss, Pamela A.; Sutherland, LeeAnn M.; Haniford, Laura; Miller, Renee; Johnson, David; Geist, Pamela K.; Koziol, Stephen M., Jr.; Star, Jon R.; Pecheone, Raymond L. – Education Policy Analysis Archives, 2004
This qualitative study is intended to illuminate factors that affect the generalizability of portfolio assessments of beginning teachers. By generalizability, we refer here to the extent to which the portfolio assessment supports generalizations from the particular evidence reflected in the portfolio to the conception of competent teaching…
Descriptors: Portfolios (Background Materials), Portfolio Assessment, Generalization, Beginning Teachers
Byram, Harold M. – 1971
In a proposed five-state demonstration, the four states of Arkansas, Minnesota, Mississippi and Nevada implemented a local system of directing evaluations of vocational/technical education programs in public schools. The emphasis was on both the training of local leaders by state leaders and direction of evaluation programs by local leaders. Using…
Descriptors: Comparative Analysis, Demonstration Programs, Evaluation Methods, Generalization
Stamper, John, Ed.; Pardos, Zachary, Ed.; Mavrikis, Manolis, Ed.; McLaren, Bruce M., Ed. – International Educational Data Mining Society, 2014
The 7th International Conference on Education Data Mining held on July 4th-7th, 2014, at the Institute of Education, London, UK is the leading international forum for high-quality research that mines large data sets in order to answer educational research questions that shed light on the learning process. These data sets may come from the traces…
Descriptors: Information Retrieval, Data Processing, Data Analysis, Data Collection