Publication Date
In 2025 | 46 |
Since 2024 | 128 |
Since 2021 (last 5 years) | 380 |
Since 2016 (last 10 years) | 694 |
Since 2006 (last 20 years) | 1749 |
Descriptor
Evaluation Methods | 1749 |
Computer Assisted Testing | 812 |
Student Evaluation | 677 |
Foreign Countries | 519 |
Testing | 323 |
Educational Assessment | 264 |
Educational Technology | 261 |
Scores | 223 |
Educational Testing | 217 |
Comparative Analysis | 201 |
Academic Achievement | 200 |
More ▼ |
Source
Author
Tindal, Gerald | 10 |
Alonzo, Julie | 9 |
Lai, Cheng Fei | 7 |
Hwang, Gwo-Jen | 6 |
Thurlow, Martha L. | 6 |
Newhouse, C. Paul | 5 |
Sinharay, Sandip | 5 |
van der Linden, Wim J. | 5 |
Bridgeman, Brent | 4 |
Davey, Tim | 4 |
Liu, Kristin K. | 4 |
More ▼ |
Publication Type
Education Level
Audience
Teachers | 38 |
Administrators | 17 |
Researchers | 17 |
Practitioners | 15 |
Policymakers | 7 |
Support Staff | 6 |
Counselors | 5 |
Students | 5 |
Location
United Kingdom | 61 |
Australia | 58 |
United States | 35 |
Germany | 30 |
Turkey | 29 |
Canada | 26 |
Florida | 26 |
Spain | 23 |
California | 22 |
South Africa | 22 |
China | 21 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 2 |
Meets WWC Standards with or without Reservations | 3 |
Jeff Coon; Paulina N. Silva; Alexander Etz; Barbara W. Sarnecka – Journal of Cognition and Development, 2025
Bayesian methods offer many advantages when applied to psychological research, yet they may seem esoteric to researchers who are accustomed to traditional methods. This paper aims to lower the barrier of entry for developmental psychologists who are interested in using Bayesian methods. We provide worked examples of how to analyze common study…
Descriptors: Developmental Psychology, Bayesian Statistics, Research Methodology, Psychological Studies
Jonas Flodén – British Educational Research Journal, 2025
This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…
Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring
Mücahit Öztürk – Open Praxis, 2024
This study examined the problems that pre-service teachers face in the online assessment process and their suggestions for solutions to these problems. The participants were 136 pre-service teachers who have been experiencing online assessment for a long time and who took the Foundations of Open and Distance Learning course. This research is a…
Descriptors: Foreign Countries, Preservice Teacher Education, Preservice Teachers, Distance Education
Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025
The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…
Descriptors: College Students, Slavic Languages, German, Italian
Alper Gülay; Emre Cumali; Damla Cumali – International Journal of Contemporary Educational Research, 2024
This qualitative phenomenological study explores the experiences of parents of children with special needs in Turkey, specifically their encounters with Guidance and Research Centers (GRCs) during the process of obtaining educational assessment reports. Through semi-structured interviews with 25 parents, the study reveals complex emotions and…
Descriptors: Foreign Countries, Special Needs Students, Parent Attitudes, Parent Participation
Hacer Karamese – ProQuest LLC, 2022
Multistage adaptive testing (MST) has become popular in the testing industry because the research has shown that it combines the advantages of both linear tests and item-level computer adaptive testing (CAT). The previous research efforts primarily focused on MST design issues such as panel design, module length, test length, distribution of test…
Descriptors: Adaptive Testing, Scoring, Computer Assisted Testing, Design
Jolanta Kisielewska; Paul Millin; Neil Rice; Jose Miguel Pego; Steven Burr; Michal Nowakowski; Thomas Gale – Education and Information Technologies, 2024
Between 2018-2021, eight European medical schools took part in a study to develop a medical knowledge Online Adaptive International Progress Test. Here we discuss participants' self-perception to evaluate the acceptability of adaptive vs non-adaptive testing. Study participants, students from across Europe at all stages of undergraduate medical…
Descriptors: Medical Students, Medical Education, Student Attitudes, Self Efficacy
Baryktabasov, Kasym; Jumabaeva, Chinara; Brimkulov, Ulan – Research in Learning Technology, 2023
Many examinations with thousands of participating students are organized worldwide every year. Usually, this large number of students sit the exams simultaneously and answer almost the same set of questions. This method of learning assessment requires tremendous effort and resources to prepare the venues, print question books and organize the…
Descriptors: Information Technology, Computer Assisted Testing, Test Items, Adaptive Testing
Meagan Karvonen; Russell Swinburne Romine; Amy K. Clark – Practical Assessment, Research & Evaluation, 2024
This paper describes methods and findings from student cognitive labs, teacher cognitive labs, and test administration observations as evidence evaluated in a validity argument for a computer-based alternate assessment for students with significant cognitive disabilities. Validity of score interpretations and uses for alternate assessments based…
Descriptors: Students with Disabilities, Intellectual Disability, Severe Disabilities, Student Evaluation
Martin Braun – New Directions in the Teaching of Natural Sciences, 2024
During the COVID pandemic, universities around the globe had to move not only their content delivery online, but also their assessments. Due to COVID causing significant upheaval in Higher Education (HE), this enforced experiment also afforded an opportunity to reflect on traditional, invigilated, closed book exams (ICBE) resulting in research and…
Descriptors: COVID-19, Pandemics, Computer Assisted Testing, Educational Technology
Zebo Xu; Prerit S. Mittal; Mohd. Mohsin Ahmed; Chandranath Adak; Zhenguang G. Cai – Reading and Writing: An Interdisciplinary Journal, 2025
The rise of the digital era has led to a decline in handwriting as the primary mode of communication, resulting in negative effects on handwriting literacy, particularly in complex writing systems such as Chinese. The marginalization of handwriting has contributed to the deterioration of penmanship, defined as the ability to write aesthetically…
Descriptors: Handwriting, Writing Skills, Chinese, Ideography
Wagner, Inga; Loesche, Philipp; Bißantz, Steven – European Journal of Psychology of Education, 2022
The German school system employs centrally organized performance assessments (some of which are called "VERA") as a way of promoting lesson development. In recent years, several German federal states introduced a computer-based performance testing system which will replace the paper-pencil testing system in the future. Scores from…
Descriptors: Foreign Countries, Computer Assisted Testing, Testing, Evaluation Methods
Ozsoy, Seyma Nur; Kilmen, Sevilay – International Journal of Assessment Tools in Education, 2023
In this study, Kernel test equating methods were compared under NEAT and NEC designs. In NEAT design, Kernel post-stratification and chain equating methods taking into account optimal and large bandwidths were compared. In the NEC design, gender and/or computer/tablet use was considered as a covariate, and Kernel test equating methods were…
Descriptors: Equated Scores, Testing, Test Items, Statistical Analysis
Ebru Balta; Celal Deha Dogan – SAGE Open, 2024
As computer-based testing becomes more prevalent, the attention paid to response time (RT) in assessment practice and psychometric research correspondingly increases. This study explores the rate of Type I error in detecting preknowledge cheating behaviors, the power of the Kullback-Leibler (KL) divergence measure, and the L person fit statistic…
Descriptors: Cheating, Accuracy, Reaction Time, Computer Assisted Testing
Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024
Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…
Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing