NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 241 to 255 of 10,081 results Save | Export
Lambert, Richard G.; Holcomb, T. Scott; Bottoms, Bryndle – Center for Educational Measurement and Evaluation, 2022
The validity of the Kappa coefficient of chance-corrected agreement has been questioned when the prevalence of specific rating scale categories is low and agreement between raters is high. The researchers proposed the Lambda Coefficient of Rater-Mediated Agreement as an alternative to Kappa to address these concerns. Lambda corrects for chance…
Descriptors: Interrater Reliability, Evaluators, Rating Scales, Teacher Evaluation
Hacer Karamese – ProQuest LLC, 2022
Multistage adaptive testing (MST) has become popular in the testing industry because the research has shown that it combines the advantages of both linear tests and item-level computer adaptive testing (CAT). The previous research efforts primarily focused on MST design issues such as panel design, module length, test length, distribution of test…
Descriptors: Adaptive Testing, Scoring, Computer Assisted Testing, Design
Peer reviewed Peer reviewed
Direct linkDirect link
Peter Howell; Clarissa Sorger; Roa'a Alsulaiman; Kaho Yoshikawa; John Harris; Kevin Tang – International Journal of Language & Communication Disorders, 2024
Background: Non-word repetition (NWR) tests are an important way speech and language therapists (SaLTs) assess language development. NWR tests are often scored whilst participants make their responses (i.e., in real time) in clinical and research reports (documented here via a secondary analysis of a published systematic review). Aims: The main…
Descriptors: Language Tests, Scoring, Accuracy, Children
Peer reviewed Peer reviewed
Direct linkDirect link
Ahtsham U. Niazi; Alayne Kealey; Stephen Choi; Lilia Kaustov; Jordan Tarshis – Discover Education, 2024
To improve clinical teaching skills, feedback on teachers' strengths and weaknesses needs to be reliable, timely, and relevant. To provide timely feedback we undertook development of an analytical dashboard to provide learner feedback to our faculty. As dashboard data displays are limited we performed a modified Delphi (mDelphi) method to…
Descriptors: Teacher Effectiveness, Teacher Improvement, Feedback (Response), Preferences
Peer reviewed Peer reviewed
Direct linkDirect link
Marcos Jiménez; María Zapata-Cáceres; Marcos Román-González; Gregorio Robles; Jesús Moreno-León; Estefanía Martín-Barroso – Journal of Science Education and Technology, 2024
Computational thinking (CT) is a multidimensional term that encompasses a wide variety of problem-solving skills related to the field of computer science. Unfortunately, standardized, valid, and reliable methods to assess CT skills in preschool children are lacking, compromising the reliability of the results reported in CT interventions. To…
Descriptors: Computation, Thinking Skills, Student Evaluation, Preschool Children
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Aykut Çitçi; Fatih Kezer – International Journal of Assessment Tools in Education, 2024
This study investigates the application of the fuzzy logic method for scoring open-ended items, specifically comparing its effectiveness against traditional scoring methods. Utilizing the fuzzy TOPSIS method within the mathematics domain, this research established seven criteria for evaluating open-ended responses, developed in consultation with…
Descriptors: Foreign Countries, Students, Mathematics Instruction, Scoring Rubrics
Peer reviewed Peer reviewed
Direct linkDirect link
Richard McInnes; James E. Hobson; Kerry Lorette Johnson; Joshua Cramp; Claire Aitchison; Katherine L. Baldock – Australasian Journal of Educational Technology, 2024
How do we make judgements about the quality of online courses? Checklists and rubrics are commonplace in higher education for establishing and measuring design features of online courses. They are created and used by institutions, academics and educational designers to standardise measures for quality online course design. Despite an intensifying…
Descriptors: Educational Quality, Online Courses, Check Lists, Scoring Rubrics
Peer reviewed Peer reviewed
Direct linkDirect link
Chahna Gonsalves – Assessment & Evaluation in Higher Education, 2024
Despite their widespread adoption and recognised benefits, rubrics have been critiqued for their potential misalignment with student needs. The voices of international students, who constitute a substantial portion of the higher education population, remain underrepresented. This study examines the perspectives of international undergraduate…
Descriptors: Foreign Countries, Democratic Values, Scoring Rubrics, Foreign Students
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Deborah Oluwadele; Yashik Singh; Timothy Adeliyi – Electronic Journal of e-Learning, 2024
Validation is needed for any newly developed model or framework because it requires several real-life applications. The investment made into e-learning in medical education is daunting, as is the expectation for a positive return on investment. The medical education domain requires data-wise implementation of e-learning as the debate continues…
Descriptors: Electronic Learning, Evaluation Methods, Medical Education, Sustainability
Peer reviewed Peer reviewed
Direct linkDirect link
Rebecka Weegar; Peter Idestam-Almquist – International Journal of Artificial Intelligence in Education, 2024
Machine learning methods can be used to reduce the manual workload in exam grading, making it possible for teachers to spend more time on other tasks. However, when it comes to grading exams, fully eliminating manual work is not yet possible even with very accurate automated grading, as any grading mistakes could have significant consequences for…
Descriptors: Grading, Computer Assisted Testing, Introductory Courses, Computer Science Education
Peer reviewed Peer reviewed
Direct linkDirect link
Liang Liao – Teaching in Higher Education, 2024
This study explores how assessment criteria are applied in grading student work. It is found that explicit assessment criteria do not work as authoritative guidance as expected and that tacit criteria are more decisive in awarding a certain grade. Various sources that form idiosyncratic tacit criteria are identified. These sources, including…
Descriptors: Student Evaluation, Grading, Criterion Referenced Tests, Criteria
Peer reviewed Peer reviewed
Direct linkDirect link
Pui-Kwan Au; Calvin Kai-Ching Yu; Siu-Sing Wong – Art Therapy: Journal of the American Art Therapy Association, 2024
The Person-in-the-Rain (PITR) drawing scoring system primarily assesses stress, excluding consideration of color usage. In contrast, the Formal Elements Art Therapy Scale (FEATS) effectively evaluates psychopathological disorders and provides a comprehensive assessment of color usage. This study aimed to: (1) develop an alternative scoring system…
Descriptors: Foreign Countries, College Students, Art Therapy, Behavior Rating Scales
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ebru Öztürk; Erol Duran – Educational Policy Analysis and Strategic Research, 2024
In this study, it was aimed to develop a rubric to evaluate the creative story writing skill levels of seventh grade secondary school students. The research was designed in quantitative research method and survey model. In the research, convenience sampling technique was used and 270 students studying at the seventh grade level of secondary school…
Descriptors: Scoring Rubrics, Writing Evaluation, Creative Writing, Middle School Students
Peer reviewed Peer reviewed
Direct linkDirect link
Dadi Ramesh; Suresh Kumar Sanampudi – European Journal of Education, 2024
Automatic essay scoring (AES) is an essential educational application in natural language processing. This automated process will alleviate the burden by increasing the reliability and consistency of the assessment. With the advances in text embedding libraries and neural network models, AES systems achieved good results in terms of accuracy.…
Descriptors: Scoring, Essays, Writing Evaluation, Memory
Peer reviewed Peer reviewed
Direct linkDirect link
Reagan Mozer; Luke Miratrix; Jackie Relyea; Jimmy Kim – Society for Research on Educational Effectiveness, 2021
Background: In a randomized trial that collects text as an outcome, traditional approaches for assessing treatment impact require that each document first be manually coded for constructs of interest by human raters. An impact analysis can then be conducted to compare treatment and control groups, using the hand-coded scores as a measured outcome.…
Descriptors: Elementary School Students, Grade 1, Grade 2, Science Education
Pages: 1  |  ...  |  13  |  14  |  15  |  16  |  17  |  18  |  19  |  20  |  21  |  ...  |  673