Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 18 |
Since 2006 (last 20 years) | 39 |
Descriptor
Evaluation Methods | 39 |
Statistical Analysis | 39 |
Foreign Countries | 18 |
Middle School Students | 13 |
Student Evaluation | 12 |
Grade 7 | 10 |
Grade 8 | 10 |
Scores | 10 |
Academic Achievement | 9 |
Comparative Analysis | 9 |
Correlation | 8 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 31 |
Journal Articles | 27 |
Dissertations/Theses -… | 6 |
Collected Works - Proceedings | 2 |
Speeches/Meeting Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Junior High Schools | 39 |
Middle Schools | 38 |
Secondary Education | 37 |
Elementary Education | 22 |
Grade 7 | 10 |
Grade 8 | 10 |
High Schools | 10 |
Grade 9 | 7 |
Intermediate Grades | 7 |
Grade 6 | 6 |
Grade 5 | 5 |
More ▼ |
Audience
Location
Taiwan | 3 |
Turkey | 3 |
California | 2 |
Indonesia | 2 |
Minnesota | 2 |
Afghanistan | 1 |
Australia | 1 |
Finland | 1 |
France | 1 |
Illinois | 1 |
Illinois (Chicago) | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Program for International… | 2 |
Trends in International… | 2 |
Social Skills Rating System | 1 |
Test of Science Related… | 1 |
Washington Assessment of… | 1 |
What Works Clearinghouse Rating
Ozsoy, Seyma Nur; Kilmen, Sevilay – International Journal of Assessment Tools in Education, 2023
In this study, Kernel test equating methods were compared under NEAT and NEC designs. In NEAT design, Kernel post-stratification and chain equating methods taking into account optimal and large bandwidths were compared. In the NEC design, gender and/or computer/tablet use was considered as a covariate, and Kernel test equating methods were…
Descriptors: Equated Scores, Testing, Test Items, Statistical Analysis
Perez, Alexandra Lane; Evans, Carla – Applied Measurement in Education, 2023
New Hampshire's Performance Assessment of Competency Education (PACE) innovative assessment system uses student scores from classroom performance assessments as well as other classroom tests for school accountability purposes. One concern is that not having annual state testing may incentivize schools and teachers away from teaching the breadth of…
Descriptors: Grade 8, Competency Based Education, Evaluation Methods, Educational Innovation
Robert Meyer; Tracy Diel; Rinor Jahiu; Hayley Tymeson – Society for Research on Educational Effectiveness, 2023
Background: This paper considers a new policy and statistical framework for evaluating K-12 schools and policies that aligns with diversity, equity, and inclusion values. The new approach broadens the standard approach to accountability and evaluation by combining features of evaluation and multi-level growth models with approaches used in systems…
Descriptors: Accountability, Inclusion, Diversity, Equal Education
Chan, Wendy – Journal of Research on Educational Effectiveness, 2017
Recent methods to improve generalizations from nonrandom samples typically invoke assumptions such as the strong ignorability of sample selection, which is challenging to meet in practice. Although researchers acknowledge the difficulty in meeting this assumption, point estimates are still provided and used without considering alternative…
Descriptors: Generalization, Inferences, Probability, Educational Research
Cetin, Bayram; Guler, Nese; Sarica, Rabia – Eurasian Journal of Educational Research, 2016
Problem Statement: In addition to being teaching tools, concept maps can be used as effective assessment tools. The use of concept maps for assessment has raised the issue of scoring them. Concept maps generated and used in different ways can be scored via various methods. Holistic and relational scoring methods are two of them. Purpose of the…
Descriptors: Generalizability Theory, Concept Mapping, Scoring, Scoring Formulas
Halpin, Peter F. – Society for Research on Educational Effectiveness, 2016
Recent research on multiple measures of teaching effectiveness has redefined the role of in-classroom observations in teacher evaluation systems. In particular, most states now mandate that teachers are observed on multiple occasions during the school year, and it is increasingly common that multiple raters are utilized across the different rating…
Descriptors: Models, Multivariate Analysis, Scoring Rubrics, Teacher Evaluation
Ho, Shyue-Yung; Chen, Wen-Te; Hsu, Wei-Ling – EURASIA Journal of Mathematics, Science & Technology Education, 2017
Environmental education is essential for people to pursue sustainable development. In Taiwan, environmental education is taught to students until they graduate from junior high school. This study was conducted to establish an assessment system for junior high schools to select appropriate environmental education facilities and sites. A mix of…
Descriptors: Junior High Schools, Foreign Countries, Environmental Education, Statistical Analysis
Cyril, A. Vences; Jeyasekaran, D. – Journal on Educational Psychology, 2016
Continuous and Comprehensive Evaluation (CCE) refers to a system of school-based evaluation introduced by CBSE in all CBSE affiliated schools across the country to evaluate both scholastic and non-scholastic aspects of students' growth and development. Continuous and comprehensive evaluation is to evaluate every aspect of the child during their…
Descriptors: Student Evaluation, Attitude Measures, Student Attitudes, Foreign Countries
Won, Mihye; Krabbe, Heiko; Ley, Siv Ling; Treagust, David F.; Fischer, Hans E. – Educational Assessment, 2017
In this study, we investigated the value of a concept map marking guide as an alternative formative assessment tool for science teachers to adopt for the topic of energy. Eight high school science teachers marked students' concept maps using an itemized holistic marking guide. Their marking was compared with the researchers' marking and the scores…
Descriptors: Science Teachers, Science Instruction, Concept Mapping, Formative Evaluation
Tidén, Anna; Lundqvist, Carolina; Nyberg, Marie – Measurement in Physical Education and Exercise Science, 2015
This study presents the development process and initial validation of the NyTid test, a process-oriented movement assessment tool for compulsory school pupils. A sample of 1,260 (627 girls and 633 boys; mean age of 14.39) Swedish school children participated in the study. In the first step, exploratory factor analyses (EFAs) were performed in…
Descriptors: Test Construction, Test Validity, Psychomotor Skills, Student Evaluation
González-Brenes, José P.; Huang, Yun – International Educational Data Mining Society, 2015
Classification evaluation metrics are often used to evaluate adaptive tutoring systems-- programs that teach and adapt to humans. Unfortunately, it is not clear how intuitive these metrics are for practitioners with little machine learning background. Moreover, our experiments suggest that existing convention for evaluating tutoring systems may…
Descriptors: Intelligent Tutoring Systems, Evaluation Methods, Program Evaluation, Student Behavior
Lombardi, Doug; Brandt, Carol B.; Bickel, Elliot S.; Burg, Colin – International Journal of Science Education, 2016
Scientists regularly evaluate alternative explanations of phenomena and solutions to problems. Students should similarly engage in critical evaluation when learning about scientific and engineering topics. However, students do not often demonstrate sophisticated evaluation skills in the classroom. The purpose of the present study was to…
Descriptors: Climate, Student Attitudes, Middle School Students, Controversial Issues (Course Content)
Plucker, Jonathan A.; Qian, Meihua; Schmalensee, Stephanie L. – Creativity Research Journal, 2014
In recent years, the social sciences have seen a resurgence in the study of divergent thinking (DT) measures. However, many of these recent advances have focused on abstract, decontextualized DT tasks (e.g., list as many things as you can think of that have wheels). This study provides a new perspective by exploring the reliability and validity…
Descriptors: Creative Thinking, Creativity Tests, Scoring Formulas, Evaluation Methods
Tseng, Jun-Jie – Computer Assisted Language Learning, 2016
Researchers have been keen to develop instruments for the assessment of teachers' self-perceived technological pedagogical content knowledge (TPACK); however, few studies have been conducted to validate such assessment tools through students' perspectives in the context of English as a foreign language (EFL). The purpose of this study was thus to…
Descriptors: English (Second Language), Second Language Learning, Pedagogical Content Knowledge, Educational Technology
Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…
Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores