Publication Date
| In 2026 | 0 |
| Since 2025 | 178 |
| Since 2022 (last 5 years) | 1058 |
| Since 2017 (last 10 years) | 2880 |
| Since 2007 (last 20 years) | 6165 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Teachers | 480 |
| Practitioners | 358 |
| Researchers | 152 |
| Administrators | 122 |
| Policymakers | 51 |
| Students | 44 |
| Parents | 32 |
| Counselors | 25 |
| Community | 15 |
| Media Staff | 5 |
| Support Staff | 3 |
| More ▼ | |
Location
| Australia | 183 |
| Turkey | 156 |
| California | 133 |
| Canada | 123 |
| New York | 118 |
| United States | 112 |
| Florida | 107 |
| China | 103 |
| Texas | 72 |
| United Kingdom | 72 |
| Japan | 70 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 5 |
| Meets WWC Standards with or without Reservations | 11 |
| Does not meet standards | 8 |
Peter Organisciak; Selcuk Acar; Denis Dumas; Kelly Berthiaume – Grantee Submission, 2023
Automated scoring for divergent thinking (DT) seeks to overcome a key obstacle to creativity measurement: the effort, cost, and reliability of scoring open-ended tests. For a common test of DT, the Alternate Uses Task (AUT), the primary automated approach casts the problem as a semantic distance between a prompt and the resulting idea in a text…
Descriptors: Automation, Computer Assisted Testing, Scoring, Creative Thinking
Panadero, Ernesto; Jonsson, Anders; Pinedo, Leire; Fernández-Castilla, Belén – Educational Psychology Review, 2023
Rubrics are widely used as instructional and learning instrument. Though they have been claimed to have positive effects on students' learning, these effects have not been meta-analyzed. Our aim was to synthesize the effects of rubrics on academic performance, self-regulated learning, and self-efficacy. The moderator effect of the following…
Descriptors: Scoring Rubrics, Academic Achievement, Self Management, Learning Strategies
Gerard, Libby; Kidron, Ady; Linn, Marcia C. – International Journal of Computer-Supported Collaborative Learning, 2019
This paper illustrates how the combination of teacher and computer guidance can strengthen collaborative revision and identifies opportunities for teacher guidance in a computer-supported collaborative learning environment. We took advantage of natural language processing tools embedded in an online, collaborative environment to automatically…
Descriptors: Computer Assisted Testing, Student Evaluation, Science Tests, Scoring
Zhang, Haoran; Litman, Diane – Grantee Submission, 2017
Manually grading the Response to Text Assessment (RTA) is labor intensive. Therefore, an automatic method is being developed for scoring analytical writing when the RTA is administered in large numbers of classrooms. Our long-term goal is to also use this scoring method to provide formative feedback to students and teachers about students' writing…
Descriptors: Automation, Scoring, Evidence, Scoring Rubrics
Lichtenstein, Robert – Communique, 2020
A neuropsychologist describes a child's performance on a measure of short-term verbal memory as falling in the low average range. Another neuropsychologist reports that a child scored in the below average range. A third neuropsychologist describes a child's performance as mildly impaired. Yet, all three are referring to the same score on the same…
Descriptors: Scores, Neuropsychology, Short Term Memory, Tests
Bimpeh, Yaw; Pointer, William; Smith, Ben Alexander; Harrison, Liz – Applied Measurement in Education, 2020
Many high-stakes examinations in the United Kingdom (UK) use both constructed-response items and selected-response items. We need to evaluate the inter-rater reliability for constructed-response items that are scored by humans. While there are a variety of methods for evaluating rater consistency across ratings in the psychometric literature, we…
Descriptors: Scoring, Generalizability Theory, Interrater Reliability, Foreign Countries
Hauk, Shandy; Kaser, Joyce – American Journal of Evaluation, 2020
This brief report describes the conception, development, and use of a rubric in evaluating the feasibility of a new program. The evaluators searched for a meta-analytic tool to help organize ideas about what data to collect, and why, in order to create a detailed story of feasibility of implementation for the client. The main advantage of using…
Descriptors: Scoring Rubrics, Program Implementation, Program Evaluation, Feasibility Studies
Skaggs, Gary; Hein, Serge F.; Wilkins, Jesse L. M. – Educational Measurement: Issues and Practice, 2020
In test-centered standard-setting methods, borderline performance can be represented by many different profiles of strengths and weaknesses. As a result, asking panelists to estimate item or test performance for a hypothetical group study of borderline examinees, or a typical borderline examinee, may be an extremely difficult task and one that can…
Descriptors: Standard Setting (Scoring), Cutting Scores, Testing Problems, Profiles
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…
Descriptors: Testing, Standards, Comparative Analysis, Guidelines
Nicolas Rochat; Laurent Lima; Pascal Bressoux – Journal of Psychoeducational Assessment, 2025
Inference is considered an important factor in comprehension models and has been described as a causal factor in predicting comprehension. To date, specific tests for inference are rare and often rely on specific thematic texts. This reliance on thematic inference may raise some concerns as inference is related to prior text-specific knowledge.…
Descriptors: Inferences, Reading Comprehension, Reading Tests, Test Reliability
Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025
This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…
Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction
Katy Dyson; Laura Piestrzynski – Dimensions of Early Childhood, 2025
Emergent writing--the process where young children begin to experiment with written language--is an important contributor to the development of literacy skills. One way for teachers to support the development of writing skills in preschool-aged children is by integrating the Classroom Assessment Scoring System (CLASS) as a framework to foster…
Descriptors: Writing Instruction, Teaching Methods, Beginning Writing, Preschool Children
Jocelyn A. Gutierrez; Nikola Grafnetterova – Journal of Latinos and Education, 2025
Hispanic-Serving Institutions (HSIs) play an important role in educating Latinos. The purpose of this article is to apply and extend the Typology of Hispanic-Serving Institutions Organizational Identities framework and propose a systematic rubric as a tool for assessment and evaluation. The article contains analyses and discourse for HSIs…
Descriptors: Minority Serving Institutions, Undergraduate Students, Hispanic American Students, Institutional Characteristics
Edyburn, Keith; Edyburn, Dave L. – Intervention in School and Clinic, 2021
In the fairy tale "Goldilocks," a young girl enters the home of three bears. As she explores the porridge, chairs, and beds, in each situation she is seeking what is "just right." It seems that Goldilocks is the perfect metaphor for describing learners experiencing universal design for learning (UDL) because it highlights the…
Descriptors: Access to Education, Teacher Role, Educational Resources, Educational Technology
Crecelius, Anne R.; DeRuisseau, Lara R.; Brandauer, Josef – Advances in Physiology Education, 2021
Assessment methods vary widely across undergraduate physiology courses. Here, a cumulative oral examination was administered in two sections of a 300-level undergraduate physiology course. Student performance was quantified via instructor grading using a rubric, and self-perceptions (n = 55) were collected via survey. Overall, students affirmed…
Descriptors: Verbal Tests, Undergraduate Students, Physiology, Student Attitudes

Peer reviewed
Direct link
