Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 6 |
| Since 2017 (last 10 years) | 12 |
| Since 2007 (last 20 years) | 43 |
Descriptor
| Correlation | 50 |
| Reliability | 50 |
| Scoring | 36 |
| Validity | 25 |
| Comparative Analysis | 15 |
| Foreign Countries | 15 |
| Scoring Rubrics | 12 |
| Evaluators | 11 |
| Scores | 11 |
| Computer Assisted Testing | 10 |
| Statistical Analysis | 8 |
| More ▼ | |
Source
Author
| Attali, Yigal | 2 |
| Clauser, Brian E. | 2 |
| Simper, Natalie | 2 |
| Abdul Gafoor, K. | 1 |
| Akkoyunlu, Buket | 1 |
| Allan S. Cohen | 1 |
| Alsardary, Salar | 1 |
| Amanda Huee-Ping Wong | 1 |
| Andersson, Marie | 1 |
| Apple, Kristen | 1 |
| Baldwin, Peter | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 40 |
| Journal Articles | 39 |
| Dissertations/Theses -… | 3 |
| Reports - Evaluative | 3 |
| Speeches/Meeting Papers | 3 |
| Reports - Descriptive | 2 |
| Tests/Questionnaires | 1 |
Education Level
| Higher Education | 12 |
| Postsecondary Education | 8 |
| Secondary Education | 8 |
| Elementary Secondary Education | 3 |
| Junior High Schools | 3 |
| Elementary Education | 2 |
| High Schools | 2 |
| Middle Schools | 2 |
| Grade 3 | 1 |
| Grade 5 | 1 |
| Grade 7 | 1 |
| More ▼ | |
Audience
Location
| Canada | 3 |
| China | 3 |
| Turkey | 2 |
| Australia | 1 |
| California | 1 |
| Colorado | 1 |
| Florida | 1 |
| Georgia | 1 |
| India | 1 |
| Nigeria | 1 |
| North Carolina (Greensboro) | 1 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024
The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…
Descriptors: Accuracy, Reliability, Computational Linguistics, Standards
Conti, Gary J. – Journal of Education and Learning, 2023
The use of personality inventories has been limited because of their cost and the length. To overcome these limitations, this study created the Personality Identity Estimator (PIE), an easy-to-use inventory to estimate personality types that can be used at no cost. PIE is a categorical inventory containing 12 items with 3 items for each of the 4…
Descriptors: Personality Measures, Personality Traits, Validity, Reliability
Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024
Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…
Descriptors: Semantics, Educational Assessment, Evaluators, Reliability
Beaty, Roger E.; Johnson, Dan R.; Zeitlen, Daniel C.; Forthmann, Boris – Creativity Research Journal, 2022
Semantic distance is increasingly used for automated scoring of originality on divergent thinking tasks, such as the Alternate Uses Task (AUT). Despite some psychometric support for semantic distance -- including positive correlations with human creativity ratings -- additional work is needed to optimize its reliability and validity, including…
Descriptors: Semantics, Scoring, Creative Thinking, Creativity
Dalton, Sarah Grace; Stark, Brielle C.; Fromm, Davida; Apple, Kristen; MacWhinney, Brian; Rensch, Amanda; Rowedder, Madyson – Journal of Speech, Language, and Hearing Research, 2022
Purpose: The aim of this study was to advance the use of structured, monologic discourse analysis by validating an automated scoring procedure for core lexicon (CoreLex) using transcripts. Method: Forty-nine transcripts from persons with aphasia and 48 transcripts from persons with no brain injury were retrieved from the AphasiaBank database. Five…
Descriptors: Validity, Discourse Analysis, Databases, Scoring
Wind, Stefanie A.; Wolfe, Edward W.; Engelhard, George, Jr.; Foltz, Peter; Rosenstein, Mark – International Journal of Testing, 2018
Automated essay scoring engines (AESEs) are becoming increasingly popular as an efficient method for performance assessments in writing, including many language assessments that are used worldwide. Before they can be used operationally, AESEs must be "trained" using machine-learning techniques that incorporate human ratings. However, the…
Descriptors: Computer Assisted Testing, Essay Tests, Writing Evaluation, Scoring
Ebuoh, Casmir N. – World Journal of Education, 2018
Literature revealed that the patterns/methods of scoring essay tests had been criticized for not being reliable and this unreliability is more likely to be more in internal examinations than in the external examinations. The purpose of this study is to find out the effects of analytical and holistic scoring patterns on scorer reliability in…
Descriptors: Holistic Approach, Scoring, Essay Tests, Biology
Simper, Natalie – Teaching & Learning Inquiry, 2018
This paper explores a method to support instructors in assessing cognitive skills in their course, designed to enable aggregation of data across an institution. A rubric authoring tool, "BASICS" (Building Assessment Scaffolds for Intellectual Cognitive Skills) was built as part of the Queen's University Learning Outcomes Assessment (LOA)…
Descriptors: Scoring Rubrics, Thinking Skills, Foreign Countries, College Outcomes Assessment
Attali, Yigal; Lewis, Will; Steier, Michael – Language Testing, 2013
Automated essay scoring can produce reliable scores that are highly correlated with human scores, but is limited in its evaluation of content and other higher-order aspects of writing. The increased use of automated essay scoring in high-stakes testing underscores the need for human scoring that is focused on higher-order aspects of writing. This…
Descriptors: Scoring, Essay Tests, Reliability, High Stakes Tests
Nuhoglu Kibar, Pinar; Akkoyunlu, Buket – Journal of Visual Literacy, 2017
In this ever more digital and visual world, it has become more vital that students are encouraged to create content during the learning process through effective visualization of their knowledge. Infographics are an effective method for such visualization. The current study therefore proposes an infographic design rubric (IDR) as a criteria-based…
Descriptors: Visual Aids, Design, Visual Literacy, Visualization
Mann, Aaron; de Bruin, Angela – International Journal of Bilingual Education and Bilingualism, 2022
Bilingualism is a multi-faceted experience and bilinguals differ in how they use their languages in daily life. Therefore, assessments of bilingualism that consider the role of (social) context are needed when describing bilinguals. In this study, we evaluated how (reliably) the Language and Social Background Questionnaire (LSBQ; Anderson et al.…
Descriptors: Bilingualism, Foreign Countries, Native Language, Second Language Learning
Steedle, Jeffrey T.; Ferrara, Steve – Applied Measurement in Education, 2016
As an alternative to rubric scoring, comparative judgment generates essay scores by aggregating decisions about the relative quality of the essays. Comparative judgment eliminates certain scorer biases and potentially reduces training requirements, thereby allowing a large number of judges, including teachers, to participate in essay evaluation.…
Descriptors: Essays, Scoring, Comparative Analysis, Evaluators
Vázquez-Alonso, Ángel; Manassero-Mas, María-Antonia; García-Carmona, Antonio; Montesano de Talavera, Marisa – Asia-Pacific Forum on Science Learning and Teaching, 2016
This study applies a new quantitative methodological approach to diagnose epistemology conceptions in a large sample. The analyses use seven multiple-rating items on the epistemology of science drawn from the item pool Views on Science-Technology-Society (VOSTS). The bases of the new methodological diagnostic approach are the empirical…
Descriptors: Epistemology, Statistical Analysis, Science and Society, Scientific Principles
Dan, Youngjun; Geng, Leisha; Li, Meng – Education, 2017
This study aimed to explore students' cognitive patterns based on their knowledge and levels. Participants were seventh graders from a junior high school in China. Three relatively distinct groups were specified by Cluster Analysis: high knowledge and low ability, low knowledge and low ability, and high knowledge and high ability. The group of low…
Descriptors: Cognitive Structures, Curriculum Design, Teaching Methods, Junior High School Students
Clauser, Jerome C.; Clauser, Brian E.; Hambleton, Ronald K. – Applied Measurement in Education, 2014
The purpose of the present study was to extend past work with the Angoff method for setting standards by examining judgments at the judge level rather than the panel level. The focus was on investigating the relationship between observed Angoff standard setting judgments and empirical conditional probabilities. This relationship has been used as a…
Descriptors: Standard Setting (Scoring), Validity, Reliability, Correlation

Peer reviewed
Direct link
