ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	19

Descriptor

Evaluation Methods	28
Generalization	26
Models	7
Student Evaluation	6
Test Validity	5
Validity	5
Educational Assessment	4
Inferences	4
Measurement Techniques	4
Bayesian Statistics	3
Comparative Analysis	3
Foreign Countries	3
Formative Evaluation	3
Guidelines	3
Mathematics Instruction	3
Pedagogical Content Knowledge	3
Reliability	3
Research Methodology	3
Scores	3
Secondary Education	3
Simulation	3
Alternative Assessment	2
Clinical Diagnosis	2
Computation	2
Context Effect	2
More ▼

Publication Type

Reports - Evaluative	28
Journal Articles	23
Speeches/Meeting Papers	2
Information Analyses	1

Education Level

Elementary Education	3
Elementary Secondary Education	2
Early Childhood Education	1
Grade 2	1
Grade 6	1
Higher Education	1
Primary Education	1

Audience

Researchers	2
Policymakers	1

Location

California	2
Canada	1
Tennessee	1
United Kingdom	1
United Kingdom (Wales)	1

Laws, Policies, & Programs

Assessments and Surveys

Child Behavior Checklist

What Works Clearinghouse Rating

Showing 1 to 15 of 28 results Save | Export

Informative Hypothesis for Group Means Comparison

Peer reviewed
PDF on ERIC

Download full text

Tan, Teck Kiang – Practical Assessment, Research & Evaluation, 2023

Researchers often have hypotheses concerning the state of affairs in the population from which they sampled their data to compare group means. The classical frequentist approach provides one way of carrying out hypothesis testing using ANOVA to state the null hypothesis that there is no difference in the means and proceed with multiple comparisons…

Descriptors: Comparative Analysis, Hypothesis Testing, Statistical Analysis, Guidelines

A Within-Study Approach to Evaluating the Role of Moderators of Impact in Limiting Generalizations from "Large to Small"

Peer reviewed

Direct link

Jaciw, Andrew P.; Unlu, Fatih; Nguyen, Thanh – American Journal of Evaluation, 2022

There is a burgeoning body of evidence on the average impacts of educational programs. Yet, for many local decision makers, because impacts can vary across sites, the question of whether a certain program will work in their particular district or school remains. This article addresses the question of the generalizability of large-scale average…

Descriptors: Program Effectiveness, Generalization, Outcome Measures, Institutional Characteristics

A Case for the Use of the Ability-In Language User-In Context Orientation in Game-Based Assessment

Peer reviewed

Direct link

Lay, Alexandra; Patton, Elizabeth; Chalhoub-Deville, Micheline – Language Testing in Asia, 2017

Dynamic assessments in general, and game-based assessment (GBA) specifically, compel us to rethink prevailing language testing conceptualizations of context. Context has traditionally been portrayed with a cognitive orientation, which focuses on static abilities, ignores complex interactions, devalues the role of tasks in determining scores, and…

Descriptors: Alternative Assessment, Game Based Learning, Evaluation Methods, Language Tests

Implications of Vygotsky's Sociocultural Theory for Second Language (L2) Assessment

Peer reviewed

Direct link

Shabani, Karim – Cogent Education, 2016

Dynamic assessment (DA) research, still in its infancy, takes its roots from Vygotsky's concept of zone of proximal development (ZPD) to account for learner's developmental process. Breaking away from a static, incomplete and, thus, unethical assessment of learner's abilities, DA came to the fore to better crystallize learner's levels of abilities…

Descriptors: Sociocultural Patterns, Psychometrics, Second Language Learning, Ethics

Estimation and Inference of Quantile Regression for Survival Data under Biased Sampling

Peer reviewed
PDF on ERIC

Download full text

Direct link

Gongjun Xu; Tony Sit; Lan Wang; Chiung-Yu Huang – Grantee Submission, 2017

Biased sampling occurs frequently in economics, epidemiology, and medical studies either by design or due to data collecting mechanism. Failing to take into account the sampling bias usually leads to incorrect inference. We propose a unified estimation procedure and a computationally fast resampling method to make statistical inference for…

Descriptors: Sampling, Statistical Inference, Computation, Generalization

Computational Evaluation of the Traceback Method

Peer reviewed

Direct link

Kol, Sheli; Nir, Bracha; Wintner, Shuly – Journal of Child Language, 2014

Several models of language acquisition have emerged in recent years that rely on computational algorithms for simulation and evaluation. Computational models are formal and precise, and can thus provide mathematically well-motivated insights into the process of language acquisition. Such models are amenable to robust computational evaluation,…

Descriptors: Language Acquisition, Models, Computational Linguistics, Evaluation Methods

Short-Term Memory for Temporal Intervals: Contrasting Explanations of the Choose-Short Effect in Pigeons

Peer reviewed

Direct link

Pinto, Carlos; Machado, Armando – Learning and Motivation, 2011

To better understand short-term memory for temporal intervals, we re-examined the choose-short effect. In Experiment 1, to contrast the predictions of two models of this effect, the subjective shortening and the coding models, pigeons were exposed to a delayed matching-to-sample task with three sample durations (2, 6 and 18 s) and retention…

Descriptors: Intervals, Infants, Tests, Short Term Memory

Domain Specific vs Domain General: Implications for Dynamic Assessment

Peer reviewed

Kaniel, Shlomo – Gifted Education International, 2010

The article responds to the need for evidence-based dynamic assessment. The article is divided into two sections: In Part 1 we examine the scientific answer to the question of how far human mental activities and capabilities are domain general (DG) / domain specific (DS). A highly complex answer emerges from the literature review of domains such…

Descriptors: Cognitive Processes, Cognitive Ability, Intelligence, Personality Traits

Generalizability of Evidence-Based Assessment Recommendations for Pediatric Bipolar Disorder

Peer reviewed

Direct link

Jenkins, Melissa M.; Youngstrom, Eric A.; Youngstrom, Jennifer Kogos; Feeny, Norah C.; Findling, Robert L. – Psychological Assessment, 2012

Bipolar disorder is frequently clinically diagnosed in youths who do not actually satisfy Diagnostic and Statistical Manual of Mental Disorders (4th ed., text revision; DSM-IV-TR; American Psychiatric Association, 1994) criteria, yet cases that would satisfy full DSM-IV-TR criteria are often undetected clinically. Evidence-based assessment methods…

Descriptors: Evidence, Mental Health, Mental Disorders, Clinical Diagnosis

A Computer-Based Laboratory Project for the Study of Stimulus Generalization and Peak Shift

Peer reviewed
PDF on ERIC

Download full text

Derenne, Adam; Loshek, Eevett – Behavior Analyst Today, 2009

This paper describes materials designed for classroom projects on stimulus generalization and peak shift. A computer program (originally written in QuickBASIC) is used for data collection and a Microsoft Excel file with macros organizes the raw data on a spreadsheet and creates generalization gradients. The program is designed for use with human…

Descriptors: Computer Software, Stimulus Generalization, Data Collection, Evaluation Methods

Seeing Epistemic Order: Construction and Transmission of Evaluative Criteria

Peer reviewed

Direct link

Shalem, Yael; Slonimsky, Lynne – British Journal of Sociology of Education, 2010

This paper focuses on formative assessment in the field of higher education. It examines Bernstein's work on vertical discourses and knowledge structures with the view to deepening understanding of the concept of assessment "for" learning. The first part of the paper draws on Vygotsky's work on concept development and Bernstein's work on…

Descriptors: Student Evaluation, Semantics, Formative Evaluation, Evaluation Criteria

How to Meta-Analyze Coefficient-of-Stability Estimates: Some Recommendations Based on Monte Carlo Studies

Peer reviewed

Direct link

Mason, Corinne; Allam, Reynald; Brannick, Michael T. – Educational and Psychological Measurement, 2007

Reliability generalization studies have provided estimates of the mean reliability coefficients and examined factors that explain the variability in the reliability estimates across studies for many different tests and measures. Different authors have used different data analyses to do such meta-analyses, and little research has addressed whether…

Descriptors: Reliability, Monte Carlo Methods, Meta Analysis, Generalization

Ecological Momentary Assessment of Mood Disorders and Mood Dysregulation

Peer reviewed

Direct link

Ebner-Priemer, Ulrich W.; Trull, Timothy J. – Psychological Assessment, 2009

In this review, we discuss ecological momentary assessment (EMA) studies on mood disorders and mood dysregulation, illustrating 6 major benefits of the EMA approach to clinical assessment: (a) Real-time assessments increase accuracy and minimize retrospective bias; (b) repeated assessments can reveal dynamic processes; (c) multimodal assessments…

Descriptors: Feedback (Response), Clinical Diagnosis, Psychological Patterns, Context Effect

A Survey of Model Evaluation Approaches with a Tutorial on Hierarchical Bayesian Methods

Peer reviewed

Direct link

Shiffrin, Richard M.; Lee, Michael D.; Kim, Woojae; Wagenmakers, Eric-Jan – Cognitive Science, 2008

This article reviews current methods for evaluating models in the cognitive sciences, including theoretically based approaches, such as Bayes factors and minimum description length measures; simulation approaches, including model mimicry evaluations; and practical approaches, such as validation and generalization measures. This article argues…

Descriptors: Bayesian Statistics, Generalization, Sciences, Models

From Evidence to Action: A Seamless Process in Formative Assessment? CRESST Report 741

Download full text

Heritage, Margaret; Kim, Jinok; Vendlinski, Terry P.; Herman, Joan L. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2008

Based on the results of a generalizability study (G study) of measures of teacher knowledge for teaching mathematics developed at The National Center for Research, on Evaluation, Standards, and Student Testing (CRESST) at the University of California, Los Angeles, this report provides evidence that teachers are better at drawing reasonable…

Descriptors: Generalization, Formative Evaluation, Inferences, Mathematics Instruction

Previous Page | Next Page »

Pages: 1 | 2

Measurement:…	2
Psychological Assessment	2
American Journal of Evaluation	1
Behavior Analyst Today	1
British Journal of Sociology…	1
Career Development for…	1
Cogent Education	1
Cognitive Science	1
Educational Researcher	1
Educational and Psychological…	1
Evaluation and Program…	1
Evaluation and the Health…	1
Exceptional Children	1
Gifted Education International	1
Grantee Submission	1
Journal of Applied Behavior…	1
Journal of Child Language	1
Journal of Consulting and…	1
Language Testing in Asia	1
Learning and Motivation	1
National Center for Research…	1
Practical Assessment,…	1
Research Quarterly for…	1
More ▼

Blunk, Merrie	2
Hill, Heather C.	2
Achenbach, Thomas M.	1
Allam, Reynald	1
Almqvist, Fredrik	1
Ball, Deborah Loewenberg	1
Bilenberg, Niels	1
Bird, Hector	1
Bonfiglio, Christine M.	1
Brannick, Michael T.	1
Broberg, Anders G.	1
Chalhoub-Deville, Micheline	1
Chiung-Yu Huang	1
Collins, Belva C.	1
Daly, Edward J., III	1
Derenne, Adam	1
Dobrean, Anca	1
Dopfner, Manfred	1
Dumenci, Levent	1
Eastmond, Nick	1
Ebner-Priemer, Ulrich W.	1
Erol, Nese	1
Feeny, Norah C.	1
Findling, Robert L.	1
More ▼