ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	47
Since 2006 (last 20 years)	121

Descriptor

Evaluation Methods	228
Statistical Analysis	228
Hypothesis Testing	66
Testing	56
Foreign Countries	51
Student Evaluation	50
Computer Assisted Testing	46
Measurement Techniques	37
Research Methodology	35
Comparative Analysis	31
Correlation	30
Models	29
Program Evaluation	28
Educational Testing	27
Scores	26
Educational Research	24
Questionnaires	22
Test Reliability	22
Tests	21
Academic Achievement	20
Test Construction	20
Higher Education	18
Testing Problems	18
Program Effectiveness	17
Simulation	17
More ▼

Education Level

Higher Education	42
Postsecondary Education	28
Elementary Education	16
Secondary Education	14
Elementary Secondary Education	12
Middle Schools	12
Junior High Schools	7
Grade 6	6
High Schools	6
Intermediate Grades	6
Grade 5	5
Early Childhood Education	3
Grade 4	3
Grade 7	3
Grade 8	3
Adult Education	2
Grade 10	2
Grade 9	2
Grade 1	1
Grade 11	1
Grade 12	1
Grade 2	1
Grade 3	1
Kindergarten	1
Preschool Education	1
More ▼

Audience

Researchers	13
Practitioners	5
Administrators	2
Students	2
Policymakers	1
Teachers	1

Location

United Kingdom	6
Australia	5
Japan	4
Netherlands	4
New Zealand	4
Germany	3
South Africa	3
United Kingdom (England)	3
California	2
Canada	2
Denmark	2
Finland	2
Iran	2
Mexico	2
Nigeria	2
Pennsylvania	2
Russia	2
Saudi Arabia	2
Spain	2
Taiwan	2
Texas	2
United States	2
Arkansas	1
Austria	1
Belgium	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	3
Individuals with Disabilities…	1
Occupational Safety and…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 228 results Save | Export

Comparison of Kernel Equating Methods under NEAT and NEC Designs

Peer reviewed
PDF on ERIC

Download full text

Ozsoy, Seyma Nur; Kilmen, Sevilay – International Journal of Assessment Tools in Education, 2023

In this study, Kernel test equating methods were compared under NEAT and NEC designs. In NEAT design, Kernel post-stratification and chain equating methods taking into account optimal and large bandwidths were compared. In the NEC design, gender and/or computer/tablet use was considered as a covariate, and Kernel test equating methods were…

Descriptors: Equated Scores, Testing, Test Items, Statistical Analysis

Informative Hypothesis for Group Means Comparison

Peer reviewed
PDF on ERIC

Download full text

Tan, Teck Kiang – Practical Assessment, Research & Evaluation, 2023

Researchers often have hypotheses concerning the state of affairs in the population from which they sampled their data to compare group means. The classical frequentist approach provides one way of carrying out hypothesis testing using ANOVA to state the null hypothesis that there is no difference in the means and proceed with multiple comparisons…

Descriptors: Comparative Analysis, Hypothesis Testing, Statistical Analysis, Guidelines

Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021

Peer reviewed

Direct link

Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023

Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…

Descriptors: Chemistry, Periodicals, Journal Articles, Science Education

Game-Based Assessment: Investigating the Impact on Test Anxiety and Exam Performance

Peer reviewed

Direct link

Mavridis, A.; Tsiatsos, T. – Journal of Computer Assisted Learning, 2017

The aim of this study is to assess the impact of a 3D educational computer game on students' test anxiety and exam performance when used in evaluative situations as compared to the traditional method of examination. The participants of the study were students in tertiary education who were examined using game-based assessment and traditional…

Descriptors: Computer Games, Teaching Methods, Test Anxiety, Statistical Analysis

The BASIE (BAyeSian Interpretation of Estimates) Framework for Interpreting Findings from Impact Evaluations: A Practical Guide for Education Researchers. Toolkit. NCEE 2022-005

Peer reviewed
PDF on ERIC

Download full text

Deke, John; Finucane, Mariel; Thal, Daniel – National Center for Education Evaluation and Regional Assistance, 2022

BASIE is a framework for interpreting impact estimates from evaluations. It is an alternative to null hypothesis significance testing. This guide walks researchers through the key steps of applying BASIE, including selecting prior evidence, reporting impact estimates, interpreting impact estimates, and conducting sensitivity analyses. The guide…

Descriptors: Bayesian Statistics, Educational Research, Data Interpretation, Hypothesis Testing

Estimating Statistical Power When Making Adjustments for Multiple Tests

Peer reviewed
PDF on ERIC

Download full text

Porter, Kristin E. – Society for Research on Educational Effectiveness, 2016

In recent years, there has been increasing focus on the issue of multiple hypotheses testing in education evaluation studies. In these studies, researchers are typically interested in testing the effectiveness of an intervention on multiple outcomes, for multiple subgroups, at multiple points in time or across multiple treatment groups. When…

Descriptors: Hypothesis Testing, Intervention, Error Patterns, Evaluation Methods

A Statistical Procedure for Testing Unusually Frequent Exactly Matching Responses and Nearly Matching Responses. Research Report. ETS RR-17-23

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J.; Lee, Yi-Hsuan – ETS Research Report Series, 2017

In investigations of unusual testing behavior, a common question is whether a specific pattern of responses occurs unusually often within a group of examinees. In many current tests, modern communication techniques can permit quite large numbers of examinees to share keys, or common response patterns, to the entire test. To address this issue,…

Descriptors: Student Evaluation, Testing, Item Response Theory, Maximum Likelihood Statistics

Using the Standard Wald Confidence Interval for a Population Proportion Hypothesis Test Is a Common Mistake

Peer reviewed

Direct link

Yang, Shitao; Black, Ken – Teaching Statistics: An International Journal for Teachers, 2019

Summary Employing a Wald confidence interval to test hypotheses about population proportions could lead to an increase in Type I or Type II errors unless the hypothesized value, p0, is used in computing its standard error rather than the sample proportion. Whereas the Wald confidence interval to estimate a population proportion uses the sample…

Descriptors: Error Patterns, Evaluation Methods, Error of Measurement, Measurement Techniques

Effects of Quizzing Methodology on Student Outcomes: Reading Compliance, Retention, and Perceptions

Peer reviewed
PDF on ERIC

Download full text

Dowling, Carey Bernini – International Journal for the Scholarship of Teaching and Learning, 2017

This study set out to replicate and extend research on students' reading compliance and examine the impact of daily quizzing methodology on students' reading compliance and retention. 98 students in two sections of Abnormal Psychology participated (mean age = 21.5, SD = 3.35; 72.4% Caucasian). Using a multiple baseline quasi-experimental design…

Descriptors: Undergraduate Students, Psychopathology, Evaluation Methods, Testing

Statistical Power in Evaluations That Investigate Effects on Multiple Outcomes: A Guide for Researchers

Peer reviewed

Direct link

Porter, Kristin E. – Journal of Research on Educational Effectiveness, 2018

Researchers are often interested in testing the effectiveness of an intervention on multiple outcomes, for multiple subgroups, at multiple points in time, or across multiple treatment groups. The resulting multiplicity of statistical hypothesis tests can lead to spurious findings of effects. Multiple testing procedures (MTPs) are statistical…

Descriptors: Statistical Analysis, Program Effectiveness, Intervention, Hypothesis Testing

An Oral Component in PhD Examination in Australia: Issues and Considerations

Peer reviewed
PDF on ERIC

Download full text

Kiley, Margaret; Holbrook, Allyson; Lovat, Terence; Fairbairn, Hedy; Starfield, Sue; Paltridge, Brian – Australian Universities' Review, 2018

While there has been considerable research on doctoral examination there is little that examines the various roles of the oral component and what issues one might consider if introducing or revising that aspect of the thesis examination process. This matter is of particular importance in Australia where it is not usual to have an oral component as…

Descriptors: Foreign Countries, Doctoral Dissertations, Evaluation Methods, Verbal Tests

Construction of Expert Knowledge Monitoring and Assessment System Based on Integral Method of Knowledge Evaluation

Peer reviewed
PDF on ERIC

Download full text

Golovachyova, Viktoriya N.; Menlibekova, Gulbakhyt Zh.; Abayeva, Nella F.; Ten, Tatyana L.; Kogaya, Galina D. – International Journal of Environmental and Science Education, 2016

Using computer-based monitoring systems that rely on tests could be the most effective way of knowledge evaluation. The problem of objective knowledge assessment by means of testing takes on a new dimension in the context of new paradigms in education. The analysis of the existing test methods enabled us to conclude that tests with selected…

Descriptors: Expertise, Computer Assisted Testing, Student Evaluation, Knowledge Level

Using Indirect vs. Direct Measures in the Summative Assessment of Student Learning in Higher Education

Peer reviewed
PDF on ERIC

Download full text

Luce, Christine; Kirnan, Jean P. – Journal of the Scholarship of Teaching and Learning, 2016

Contradictory results have been reported regarding the accuracy of various methods used to assess student learning in higher education. The current study examined student learning outcomes across a multi-section and mult-iinstructor psychology research course with both indirect and direct assessments in a sample of 67 undergraduate students. The…

Descriptors: Undergraduate Students, Psychology, Methods Courses, Student Evaluation

Bayesian Posterior Odds Ratios: Statistical Tools for Collaborative Evaluations

Peer reviewed
PDF on ERIC

Download full text

Direct link

Hicks, Tyler; Rodríguez-Campos, Liliana; Choi, Jeong Hoon – American Journal of Evaluation, 2018

To begin statistical analysis, Bayesians quantify their confidence in modeling hypotheses with priors. A prior describes the probability of a certain modeling hypothesis apart from the data. Bayesians should be able to defend their choice of prior to a skeptical audience. Collaboration between evaluators and stakeholders could make their choices…

Descriptors: Bayesian Statistics, Evaluation Methods, Statistical Analysis, Hypothesis Testing

Document Level Assessment of Document Retrieval Systems in a Pairwise System Evaluation

Peer reviewed
PDF on ERIC

Download full text

Rajagopal, Prabha; Ravana, Sri Devi – Information Research: An International Electronic Journal, 2017

Introduction: The use of averaged topic-level scores can result in the loss of valuable data and can cause misinterpretation of the effectiveness of system performance. This study aims to use the scores of each document to evaluate document retrieval systems in a pairwise system evaluation. Method: The chosen evaluation metrics are document-level…

Descriptors: Information Retrieval, Documentation, Scores, Information Systems

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 16

Educational and Psychological…	8
ProQuest LLC	6
Assessment & Evaluation in…	4
ETS Research Report Series	3
International Education…	3
Structural Equation Modeling:…	3
American Psychologist	2
Applied Psychological…	2
Assessment in Education:…	2
European Journal of…	2
Grantee Submission	2
International Educational…	2
International Journal of…	2
International Journal of…	2
Journal of Research on…	2
Online Submission	2
Psychological Methods	2
Regional Educational…	2
Research Papers in Education	2
Turkish Online Journal of…	2
ACT, Inc.	1
AEDS J	1
Acta Didactica Napocensia	1
Active Learning in Higher…	1
Advances in Physiology…	1
More ▼

Porter, Kristin E.	4
Bobbett, Gordon	2
Booker, Kevin	2
Bruch, Julie	2
Burstein, Leigh	2
French, Russell L.	2
Gill, Brian	2
Millsap, Roger E.	2
Wilcox, Rand R.	2
Zimmerman, Donald W.	2
Zumbo, Bruno D.	2
ALTMANN, BERTHOLD	1
Abayeva, Nella F.	1
Abdel Latif, Muhammad M.	1
Abedor, Allan J.	1
Admiraal, Wilfried	1
Ajayi, Nurudeen A.	1
Ajuonuma, Juliet O.	1
Akelaitis, Arturas V.	1
Alexander, Melody W.	1
Algina, James	1
Alzaid, Jawaher Mohammed	1
Ames, Russell	1
Amorim, Paulo Roberto S.	1
More ▼

Journal Articles	126
Reports - Research	106
Reports - Evaluative	31
Speeches/Meeting Papers	23
Reports - Descriptive	21
Guides - Non-Classroom	13
Dissertations/Theses -…	6
Information Analyses	6
Books	4
Opinion Papers	4
Tests/Questionnaires	4
Collected Works - General	2
Collected Works - Proceedings	2
Collected Works - Serials	1
ERIC Publications	1
Guides - Classroom - Learner	1
Guides - General	1
Numerical/Quantitative Data	1
Reference Materials -…	1
Reference Materials -…	1
Reports - General	1
More ▼

ACT Assessment	2
Dynamic Indicators of Basic…	2
Iowa Tests of Basic Skills	2
Preliminary Scholastic…	2
Stanford Achievement Tests	2
Test of English as a Foreign…	2
Adjective Check List	1
Autism Diagnostic Observation…	1
Defining Issues Test	1
Florida Comprehensive…	1
Georgia Criterion Referenced…	1
National Assessment of…	1
National Longitudinal…	1
Nelson Denny Reading Tests	1
Peabody Picture Vocabulary…	1
Program for International…	1
Raven Progressive Matrices	1
SAT (College Admission Test)	1
Social Skills Rating System	1
Test of English for…	1
Vineland Adaptive Behavior…	1
More ▼