Publication Date
In 2025 | 6 |
Since 2024 | 14 |
Since 2021 (last 5 years) | 26 |
Since 2016 (last 10 years) | 46 |
Since 2006 (last 20 years) | 101 |
Descriptor
Evaluation Methods | 281 |
Test Reliability | 281 |
Test Validity | 182 |
Student Evaluation | 93 |
Testing | 89 |
Test Construction | 74 |
Testing Problems | 69 |
Computer Assisted Testing | 60 |
Foreign Countries | 46 |
Higher Education | 39 |
Elementary Secondary Education | 36 |
More ▼ |
Source
Author
Bagnato, Stephen J. | 2 |
Booker, Kevin | 2 |
Boyle, Michael H. | 2 |
Bruch, Julie | 2 |
Cunningham, Charles E. | 2 |
Gill, Brian | 2 |
Koretz, Daniel | 2 |
Macy, Marisa | 2 |
Pettingill, Peter | 2 |
Thurlow, Martha L. | 2 |
ANDERSON, JAMES A. | 1 |
More ▼ |
Publication Type
Education Level
Location
United Kingdom | 6 |
Turkey | 5 |
Germany | 4 |
California | 3 |
Canada | 3 |
China | 3 |
Israel | 3 |
Japan | 3 |
United Kingdom (England) | 3 |
Australia | 2 |
Florida | 2 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Mücahit Öztürk – Open Praxis, 2024
This study examined the problems that pre-service teachers face in the online assessment process and their suggestions for solutions to these problems. The participants were 136 pre-service teachers who have been experiencing online assessment for a long time and who took the Foundations of Open and Distance Learning course. This research is a…
Descriptors: Foreign Countries, Preservice Teacher Education, Preservice Teachers, Distance Education
Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025
The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…
Descriptors: College Students, Slavic Languages, German, Italian
Alper Gülay; Emre Cumali; Damla Cumali – International Journal of Contemporary Educational Research, 2024
This qualitative phenomenological study explores the experiences of parents of children with special needs in Turkey, specifically their encounters with Guidance and Research Centers (GRCs) during the process of obtaining educational assessment reports. Through semi-structured interviews with 25 parents, the study reveals complex emotions and…
Descriptors: Foreign Countries, Special Needs Students, Parent Attitudes, Parent Participation
Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024
Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…
Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing
Caspar J. Van Lissa; Eli-Boaz Clapper; Rebecca Kuiper – Research Synthesis Methods, 2024
The product Bayes factor (PBF) synthesizes evidence for an informative hypothesis across heterogeneous replication studies. It can be used when fixed- or random effects meta-analysis fall short. For example, when effect sizes are incomparable and cannot be pooled, or when studies diverge significantly in the populations, study designs, and…
Descriptors: Hypothesis Testing, Evaluation Methods, Replication (Evaluation), Sample Size
Delphine Franco; Ruben Vanderlinde; Martin Valcke – European Journal of Education, 2025
Complex competences, such as managing students' aggressive behaviour, are challenging to develop during teacher training. Recently, video-based simulations have been considered promising, yet suitable assessment instruments are limitedly available. This paper reports on the design and evaluation of a video-based assessment tool tailored to measure…
Descriptors: Preservice Teachers, Preservice Teacher Education, Student Behavior, Aggression
Ole J. Kemi – Advances in Physiology Education, 2025
Students are assessed by coursework and/or exams, all of which are marked by assessors (markers). Student and marker performances are then subject to end-of-session board of examiner handling and analysis. This occurs annually and is the basis for evaluating students but also the wider learning and teaching efficiency of an academic institution.…
Descriptors: Undergraduate Students, Evaluation Methods, Evaluation Criteria, Academic Standards
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Yue Huang; Joshua Wilson – Journal of Computer Assisted Learning, 2025
Background: Automated writing evaluation (AWE) systems, used as formative assessment tools in writing classrooms, are promising for enhancing instruction and improving student performance. Although meta-analytic evidence supports AWE's effectiveness in various contexts, research on its effectiveness in the U.S. K-12 setting has lagged behind its…
Descriptors: Writing Evaluation, Writing Skills, Writing Tests, Writing Instruction
Uzun, N. Bilge; Aktas, Mehtap; Akay, Cenk – Journal of Educational Technology, 2023
The challenges experienced in measurement and evaluation during the distance education process among student and instructor groups are discussed in the study. A qualitative meta-synthesis method is used in this research. Twenty studies were included in the meta-synthesis. The challenges experienced by the instructors are program utilization,…
Descriptors: Measurement Techniques, Evaluation Methods, Distance Education, Literature Reviews
Duncan Culbreth; Rebekah Davis; Cigdem Meral; Florence Martin; Weichao Wang; Sejal Foxx – TechTrends: Linking Research and Practice to Improve Learning, 2025
Monitoring applications (MAs) use digital and online tools to collect and track data on student behavior, and they have become increasingly popular among schools. Empirical research on these complex surveillance platforms is scant, and little is known about the efficacy or impact that they have on students. This study used a multi-method…
Descriptors: High School Students, COVID-19, Pandemics, Progress Monitoring
Mansooreh Hosseinnia; Zahra Kafi – Language Testing in Asia, 2024
As testing involves various aspects of education as well as the ones who are involved like instructors, students, managers, teacher trainers, testers, and decision-makers, it comes to be highly crucial to develop ethical tests. In addition, as some methods of testing are more favored and practiced compared to others without considering the ethical…
Descriptors: Test Construction, Test Validity, Ethics, Testing
Toker, Turker – International Journal of Curriculum and Instruction, 2023
Achievement tests are among the most widely used data collection tools to measure the knowledge and skill levels of individuals. For this reason, the existence of valid and reliable achievement tests that can perfectly reveal the competencies that a person should have in any discipline is of great importance. The purpose of this research is to…
Descriptors: Basic Skills, Evaluation Methods, Test Items, Test Validity
Kübra Karakaya Özyer – Journal of Social Studies Education Research, 2024
The study aims to assess online assessment practices in a public university, addressing questions about self-efficacy levels, tools used, challenges faced, and proposed solutions. The chosen methodology employs a cross-sectional survey design, collecting both quantitative and qualitative data from 50 instructors in Türkiye through a convenience…
Descriptors: Foreign Countries, Student Evaluation, Computer Assisted Testing, College Students
Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021
Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023
Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…
Descriptors: Chemistry, Periodicals, Journal Articles, Science Education