Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 19 |
Descriptor
Evaluation Methods | 28 |
Generalization | 26 |
Models | 7 |
Student Evaluation | 6 |
Test Validity | 5 |
Validity | 5 |
Educational Assessment | 4 |
Inferences | 4 |
Measurement Techniques | 4 |
Bayesian Statistics | 3 |
Comparative Analysis | 3 |
More ▼ |
Source
Author
Publication Type
Reports - Evaluative | 28 |
Journal Articles | 23 |
Speeches/Meeting Papers | 2 |
Information Analyses | 1 |
Education Level
Elementary Education | 3 |
Elementary Secondary Education | 2 |
Early Childhood Education | 1 |
Grade 2 | 1 |
Grade 6 | 1 |
Higher Education | 1 |
Primary Education | 1 |
Audience
Researchers | 2 |
Policymakers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Child Behavior Checklist | 1 |
What Works Clearinghouse Rating
Tan, Teck Kiang – Practical Assessment, Research & Evaluation, 2023
Researchers often have hypotheses concerning the state of affairs in the population from which they sampled their data to compare group means. The classical frequentist approach provides one way of carrying out hypothesis testing using ANOVA to state the null hypothesis that there is no difference in the means and proceed with multiple comparisons…
Descriptors: Comparative Analysis, Hypothesis Testing, Statistical Analysis, Guidelines
Jaciw, Andrew P.; Unlu, Fatih; Nguyen, Thanh – American Journal of Evaluation, 2022
There is a burgeoning body of evidence on the average impacts of educational programs. Yet, for many local decision makers, because impacts can vary across sites, the question of whether a certain program will work in their particular district or school remains. This article addresses the question of the generalizability of large-scale average…
Descriptors: Program Effectiveness, Generalization, Outcome Measures, Institutional Characteristics
Lay, Alexandra; Patton, Elizabeth; Chalhoub-Deville, Micheline – Language Testing in Asia, 2017
Dynamic assessments in general, and game-based assessment (GBA) specifically, compel us to rethink prevailing language testing conceptualizations of context. Context has traditionally been portrayed with a cognitive orientation, which focuses on static abilities, ignores complex interactions, devalues the role of tasks in determining scores, and…
Descriptors: Alternative Assessment, Game Based Learning, Evaluation Methods, Language Tests
Shabani, Karim – Cogent Education, 2016
Dynamic assessment (DA) research, still in its infancy, takes its roots from Vygotsky's concept of zone of proximal development (ZPD) to account for learner's developmental process. Breaking away from a static, incomplete and, thus, unethical assessment of learner's abilities, DA came to the fore to better crystallize learner's levels of abilities…
Descriptors: Sociocultural Patterns, Psychometrics, Second Language Learning, Ethics
Gongjun Xu; Tony Sit; Lan Wang; Chiung-Yu Huang – Grantee Submission, 2017
Biased sampling occurs frequently in economics, epidemiology, and medical studies either by design or due to data collecting mechanism. Failing to take into account the sampling bias usually leads to incorrect inference. We propose a unified estimation procedure and a computationally fast resampling method to make statistical inference for…
Descriptors: Sampling, Statistical Inference, Computation, Generalization
Kol, Sheli; Nir, Bracha; Wintner, Shuly – Journal of Child Language, 2014
Several models of language acquisition have emerged in recent years that rely on computational algorithms for simulation and evaluation. Computational models are formal and precise, and can thus provide mathematically well-motivated insights into the process of language acquisition. Such models are amenable to robust computational evaluation,…
Descriptors: Language Acquisition, Models, Computational Linguistics, Evaluation Methods
Pinto, Carlos; Machado, Armando – Learning and Motivation, 2011
To better understand short-term memory for temporal intervals, we re-examined the choose-short effect. In Experiment 1, to contrast the predictions of two models of this effect, the subjective shortening and the coding models, pigeons were exposed to a delayed matching-to-sample task with three sample durations (2, 6 and 18 s) and retention…
Descriptors: Intervals, Infants, Tests, Short Term Memory

Kaniel, Shlomo – Gifted Education International, 2010
The article responds to the need for evidence-based dynamic assessment. The article is divided into two sections: In Part 1 we examine the scientific answer to the question of how far human mental activities and capabilities are domain general (DG) / domain specific (DS). A highly complex answer emerges from the literature review of domains such…
Descriptors: Cognitive Processes, Cognitive Ability, Intelligence, Personality Traits
Jenkins, Melissa M.; Youngstrom, Eric A.; Youngstrom, Jennifer Kogos; Feeny, Norah C.; Findling, Robert L. – Psychological Assessment, 2012
Bipolar disorder is frequently clinically diagnosed in youths who do not actually satisfy Diagnostic and Statistical Manual of Mental Disorders (4th ed., text revision; DSM-IV-TR; American Psychiatric Association, 1994) criteria, yet cases that would satisfy full DSM-IV-TR criteria are often undetected clinically. Evidence-based assessment methods…
Descriptors: Evidence, Mental Health, Mental Disorders, Clinical Diagnosis
Derenne, Adam; Loshek, Eevett – Behavior Analyst Today, 2009
This paper describes materials designed for classroom projects on stimulus generalization and peak shift. A computer program (originally written in QuickBASIC) is used for data collection and a Microsoft Excel file with macros organizes the raw data on a spreadsheet and creates generalization gradients. The program is designed for use with human…
Descriptors: Computer Software, Stimulus Generalization, Data Collection, Evaluation Methods
Shalem, Yael; Slonimsky, Lynne – British Journal of Sociology of Education, 2010
This paper focuses on formative assessment in the field of higher education. It examines Bernstein's work on vertical discourses and knowledge structures with the view to deepening understanding of the concept of assessment "for" learning. The first part of the paper draws on Vygotsky's work on concept development and Bernstein's work on…
Descriptors: Student Evaluation, Semantics, Formative Evaluation, Evaluation Criteria
Mason, Corinne; Allam, Reynald; Brannick, Michael T. – Educational and Psychological Measurement, 2007
Reliability generalization studies have provided estimates of the mean reliability coefficients and examined factors that explain the variability in the reliability estimates across studies for many different tests and measures. Different authors have used different data analyses to do such meta-analyses, and little research has addressed whether…
Descriptors: Reliability, Monte Carlo Methods, Meta Analysis, Generalization
Ebner-Priemer, Ulrich W.; Trull, Timothy J. – Psychological Assessment, 2009
In this review, we discuss ecological momentary assessment (EMA) studies on mood disorders and mood dysregulation, illustrating 6 major benefits of the EMA approach to clinical assessment: (a) Real-time assessments increase accuracy and minimize retrospective bias; (b) repeated assessments can reveal dynamic processes; (c) multimodal assessments…
Descriptors: Feedback (Response), Clinical Diagnosis, Psychological Patterns, Context Effect
Shiffrin, Richard M.; Lee, Michael D.; Kim, Woojae; Wagenmakers, Eric-Jan – Cognitive Science, 2008
This article reviews current methods for evaluating models in the cognitive sciences, including theoretically based approaches, such as Bayes factors and minimum description length measures; simulation approaches, including model mimicry evaluations; and practical approaches, such as validation and generalization measures. This article argues…
Descriptors: Bayesian Statistics, Generalization, Sciences, Models
Heritage, Margaret; Kim, Jinok; Vendlinski, Terry P.; Herman, Joan L. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2008
Based on the results of a generalizability study (G study) of measures of teacher knowledge for teaching mathematics developed at The National Center for Research, on Evaluation, Standards, and Student Testing (CRESST) at the University of California, Los Angeles, this report provides evidence that teachers are better at drawing reasonable…
Descriptors: Generalization, Formative Evaluation, Inferences, Mathematics Instruction
Previous Page | Next Page ยป
Pages: 1 | 2