NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Does not meet standards1
Showing 1 to 15 of 2,069 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025
Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…
Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Siti Suprihatiningsih; Masriyah; Rooselyna Ekawati – Journal of Education and Learning (EduLearn), 2025
The knowledge of the materials to be taught to the students is the basic knowledge that preservice mathematics teachers should possess, as they need to prepare themselves for teaching. In order to research preservice teachers' understanding of the subject matter and teaching skils, valid and reliable test instruments are required. Knowledge of…
Descriptors: Preservice Teachers, Pedagogical Content Knowledge, Preservice Teacher Education, Mathematics Teachers
Peer reviewed Peer reviewed
Direct linkDirect link
Lisa DaVia Rubenstein; Kathrin Maki; Brianna Quigley; Shanyn Thompson; Lisa M. Ridgley Smith – AERA Online Paper Repository, 2024
The purpose of this systematic review was to survey available measures of creativity for pk12 students for assessments characteristics and reporting of psychometric properties. Using the PRISMA framework, we identified 42 unique articles with 48 assessments meeting our inclusion criteria. Then, two coders independently coded all articles using a…
Descriptors: Literature Reviews, Meta Analysis, Elementary Secondary Education, Creativity
Peer reviewed Peer reviewed
Direct linkDirect link
Eirini M. Mitropoulou; Leonidas A. Zampetakis; Ioannis Tsaousis – Evaluation Review, 2024
Unfolding item response theory (IRT) models are important alternatives to dominance IRT models in describing the response processes on self-report tests. Their usage is common in personality measures, since they indicate potential differentiations in test score interpretation. This paper aims to gain a better insight into the structure of trait…
Descriptors: Foreign Countries, Adults, Item Response Theory, Personality Traits
Peer reviewed Peer reviewed
Direct linkDirect link
Yue Huang; Joshua Wilson – Journal of Computer Assisted Learning, 2025
Background: Automated writing evaluation (AWE) systems, used as formative assessment tools in writing classrooms, are promising for enhancing instruction and improving student performance. Although meta-analytic evidence supports AWE's effectiveness in various contexts, research on its effectiveness in the U.S. K-12 setting has lagged behind its…
Descriptors: Writing Evaluation, Writing Skills, Writing Tests, Writing Instruction
Peer reviewed Peer reviewed
Direct linkDirect link
Dirk Gellermann; Hanno Michel; Ute Harms – Mind, Brain, and Education, 2025
In order for climate literacy assessments to be applicable in large-scale studies, it is essential that they comply with the standards of test administration while maintaining consistency with a comprehensive definition of the concept. In alignment with the different educational frameworks and the Climate Literacy Principles of the U.S. Global…
Descriptors: Climate, Environmental Education, Literacy, Measures (Individuals)
Peer reviewed Peer reviewed
Direct linkDirect link
Tia M. Fechter; Heeyeon Yoon – Language Testing, 2024
This study evaluated the efficacy of two proposed methods in an operational standard-setting study conducted for a high-stakes language proficiency test of the U.S. government. The goal was to seek low-cost modifications to the existing Yes/No Angoff method to increase the validity and reliability of the recommended cut scores using a convergent…
Descriptors: Standard Setting, Language Proficiency, Language Tests, Evaluation Methods
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sümeyye Arkan; Sema Tan – International Journal of Assessment Tools in Education, 2025
Teachers' perceptions, attitudes, and opinions about students, curricula, or evaluation methods contribute to the development of students' talents. Thus, researchers often collect data from teachers to identify gifted students, determine educational practices to meet the students' needs and assess gifted education programs. Researchers often…
Descriptors: Talent Identification, Academically Gifted, Evaluation Methods, Measurement Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
Marilena Z. Leana-Tascilar – Cogent Education, 2024
This study aimed to develop a comprehensive tool to assess underachievement in gifted students, incorporating input from parents, teachers, and students themselves. A total of 285 participants, including 95 gifted students, their parents, and teachers, were involved in the study. The results have revealed a four-factor structure for the Gifted…
Descriptors: Psychometrics, Academic Achievement, Underachievement, Academically Gifted
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Mohammed, Aisha; Dawood, Abdul Kareem Shareef; Alghazali, Tawfeeq; Kadhim, Qasim Khlaif; Sabti, Ahmed Abdulateef; Sabit, Shaker Holh – International Journal of Language Testing, 2023
Cognitive diagnostic models (CDMs) have received much interest within the field of language testing over the last decade due to their great potential to provide diagnostic feedback to all stakeholders and ultimately improve language teaching and learning. A large number of studies have demonstrated the application of CDMs on advanced large-scale…
Descriptors: Reading Comprehension, Reading Tests, Language Tests, English (Second Language)
Peer reviewed Peer reviewed
Direct linkDirect link
Marianne Berg Halvorsen; Arvid Nikolai Kildahl; Sabine Kaiser; Brynhildur Axelsdottir; Michael G. Aman; Sissel Berge Helverschou – Journal of Autism and Developmental Disorders, 2025
In recent years, there has been a proliferation of instruments for assessing mental health (MH) among autistic people. This study aimed to review the psychometric properties of broadband instruments used to assess MH problems among autistic people. In accordance with the PRISMA guidelines (PROSPERO: CRD42022316571) we searched the APA PsycINFO via…
Descriptors: Psychometrics, Mental Health, Clinical Diagnosis, Evaluation Methods
Katie L. McDermott – ProQuest LLC, 2024
Nursing education programs are faced with urgent demands to transition to competency-based education (CBE) to address the limitations of the nursing workforce. The AACN (2021) has developed the Essentials, or the core competencies for graduating entry- and advanced-level nurses to inform CBE. A concept analysis of Foundational Competence was…
Descriptors: Job Skills, Employment Qualifications, Nurses, Nursing Education
Peer reviewed Peer reviewed
Direct linkDirect link
Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025
Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…
Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation
Paul Alexander Siegel – ProQuest LLC, 2024
While multimodality and multiliteracies has been a concept for 25 years (Kalantzis & Cope, 2023; The New London Group, 1996), research on and application of the concept within text complexity measures has been limited. Attempts to assess multiliteracies and multimodality (Jacobs, 2013; Schmerbeck & Lucht, 2017; Wyatt-Smith & Kimber,…
Descriptors: Multiple Literacies, Learning Modalities, Test Validity, Test Reliability
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  138