ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	6
Since 2017 (last 10 years)	12
Since 2007 (last 20 years)	43

Descriptor

Correlation	50
Reliability	50
Scoring	36
Validity	25
Comparative Analysis	15
Foreign Countries	15
Scoring Rubrics	12
Evaluators	11
Scores	11
Computer Assisted Testing	10
Statistical Analysis	8
Factor Analysis	7
Writing Evaluation	7
Accuracy	6
College Students	6
English (Second Language)	6
Essays	6
Psychometrics	6
Second Language Learning	6
Writing Tests	6
Evaluation Methods	5
Questionnaires	5
Responses	5
Vocabulary	5
Decision Making	4
More ▼

Publication Type

Reports - Research	40
Journal Articles	39
Dissertations/Theses -…	3
Reports - Evaluative	3
Speeches/Meeting Papers	3
Reports - Descriptive	2
Tests/Questionnaires	1

Education Level

Higher Education	12
Postsecondary Education	8
Secondary Education	8
Elementary Secondary Education	3
Junior High Schools	3
Elementary Education	2
High Schools	2
Middle Schools	2
Grade 3	1
Grade 5	1
Grade 7	1
Grade 8	1
More ▼

Audience

Location

Canada	3
China	3
Turkey	2
Australia	1
California	1
Colorado	1
Florida	1
Georgia	1
India	1
Nigeria	1
North Carolina (Greensboro)	1
Panama	1
Pennsylvania	1
Pennsylvania (Pittsburgh)	1
Singapore	1
Sweden	1
United Kingdom	1
United States	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

United States Medical…	3
Graduate Record Examinations	2
Test of English as a Foreign…	2
Goodenough Harris Drawing Test	1
Learning and Study Strategies…	1
Motivated Strategies for…	1
Myers Briggs Type Indicator	1
Torrance Tests of Creative…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 50 results Save | Export

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

Development and Validation of a Short-Form Inventory to Identify Personality Types: The Personality Identity Estimator (PIE)

Peer reviewed
PDF on ERIC

Download full text

Conti, Gary J. – Journal of Education and Learning, 2023

The use of personality inventories has been limited because of their cost and the length. To overcome these limitations, this study created the Personality Identity Estimator (PIE), an easy-to-use inventory to estimate personality types that can be used at no cost. PIE is a categorical inventory containing 12 items with 3 items for each of the 4…

Descriptors: Personality Measures, Personality Traits, Validity, Reliability

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

Semantic Distance and the Alternate Uses Task: Recommendations for Reliable Automated Assessment of Originality

Peer reviewed

Direct link

Beaty, Roger E.; Johnson, Dan R.; Zeitlen, Daniel C.; Forthmann, Boris – Creativity Research Journal, 2022

Semantic distance is increasingly used for automated scoring of originality on divergent thinking tasks, such as the Alternate Uses Task (AUT). Despite some psychometric support for semantic distance -- including positive correlations with human creativity ratings -- additional work is needed to optimize its reliability and validity, including…

Descriptors: Semantics, Scoring, Creative Thinking, Creativity

Validation of an Automated Procedure for Calculating Core Lexicon from Transcripts

Peer reviewed

Direct link

Dalton, Sarah Grace; Stark, Brielle C.; Fromm, Davida; Apple, Kristen; MacWhinney, Brian; Rensch, Amanda; Rowedder, Madyson – Journal of Speech, Language, and Hearing Research, 2022

Purpose: The aim of this study was to advance the use of structured, monologic discourse analysis by validating an automated scoring procedure for core lexicon (CoreLex) using transcripts. Method: Forty-nine transcripts from persons with aphasia and 48 transcripts from persons with no brain injury were retrieved from the AphasiaBank database. Five…

Descriptors: Validity, Discourse Analysis, Databases, Scoring

The Influence of Rater Effects in Training Sets on the Psychometric Quality of Automated Scoring for Writing Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Wolfe, Edward W.; Engelhard, George, Jr.; Foltz, Peter; Rosenstein, Mark – International Journal of Testing, 2018

Automated essay scoring engines (AESEs) are becoming increasingly popular as an efficient method for performance assessments in writing, including many language assessments that are used worldwide. Before they can be used operationally, AESEs must be "trained" using machine-learning techniques that incorporate human ratings. However, the…

Descriptors: Computer Assisted Testing, Essay Tests, Writing Evaluation, Scoring

Effects of Analytical and Holistic Scoring Patterns on Scorer Reliability in Biology Essay Tests

Peer reviewed
PDF on ERIC

Download full text

Ebuoh, Casmir N. – World Journal of Education, 2018

Literature revealed that the patterns/methods of scoring essay tests had been criticized for not being reliable and this unreliability is more likely to be more in internal examinations than in the external examinations. The purpose of this study is to find out the effects of analytical and holistic scoring patterns on scorer reliability in…

Descriptors: Holistic Approach, Scoring, Essay Tests, Biology

Rubric Authoring Tool Supporting Cognitive Skills Assessment across an Institution

Peer reviewed
PDF on ERIC

Download full text

Simper, Natalie – Teaching & Learning Inquiry, 2018

This paper explores a method to support instructors in assessing cognitive skills in their course, designed to enable aggregation of data across an institution. A rubric authoring tool, "BASICS" (Building Assessment Scaffolds for Intellectual Cognitive Skills) was built as part of the Queen's University Learning Outcomes Assessment (LOA)…

Descriptors: Scoring Rubrics, Thinking Skills, Foreign Countries, College Outcomes Assessment

Scoring with the Computer: Alternative Procedures for Improving the Reliability of Holistic Essay Scoring

Peer reviewed

Direct link

Attali, Yigal; Lewis, Will; Steier, Michael – Language Testing, 2013

Automated essay scoring can produce reliable scores that are highly correlated with human scores, but is limited in its evaluation of content and other higher-order aspects of writing. The increased use of automated essay scoring in high-stakes testing underscores the need for human scoring that is focused on higher-order aspects of writing. This…

Descriptors: Scoring, Essay Tests, Reliability, High Stakes Tests

Fostering and Assessing Infographic Design for Learning: The Development of Infographic Design Criteria

Peer reviewed

Direct link

Nuhoglu Kibar, Pinar; Akkoyunlu, Buket – Journal of Visual Literacy, 2017

In this ever more digital and visual world, it has become more vital that students are encouraged to create content during the learning process through effective visualization of their knowledge. Infographics are an effective method for such visualization. The current study therefore proposes an infographic design rubric (IDR) as a criteria-based…

Descriptors: Visual Aids, Design, Visual Literacy, Visualization

Bilingual Language Use Is Context Dependent: Using the Language and Social Background Questionnaire to Assess Language Experiences and Test-Rest Reliability

Peer reviewed

Direct link

Mann, Aaron; de Bruin, Angela – International Journal of Bilingual Education and Bilingualism, 2022

Bilingualism is a multi-faceted experience and bilinguals differ in how they use their languages in daily life. Therefore, assessments of bilingualism that consider the role of (social) context are needed when describing bilinguals. In this study, we evaluated how (reliably) the Language and Social Background Questionnaire (LSBQ; Anderson et al.…

Descriptors: Bilingualism, Foreign Countries, Native Language, Second Language Learning

Evaluating Comparative Judgment as an Approach to Essay Scoring

Peer reviewed

Direct link

Steedle, Jeffrey T.; Ferrara, Steve – Applied Measurement in Education, 2016

As an alternative to rubric scoring, comparative judgment generates essay scores by aggregating decisions about the relative quality of the essays. Comparative judgment eliminates certain scorer biases and potentially reduces training requirements, thereby allowing a large number of judges, including teachers, to participate in essay evaluation.…

Descriptors: Essays, Scoring, Comparative Analysis, Evaluators

Diagnosing Conceptions about the Epistemology of Science: Contributions of a Quantitative Assessment Methodology

Peer reviewed

Direct link

Vázquez-Alonso, Ángel; Manassero-Mas, María-Antonia; García-Carmona, Antonio; Montesano de Talavera, Marisa – Asia-Pacific Forum on Science Learning and Teaching, 2016

This study applies a new quantitative methodological approach to diagnose epistemology conceptions in a large sample. The analyses use seven multiple-rating items on the epistemology of science drawn from the item pool Views on Science-Technology-Society (VOSTS). The bases of the new methodological diagnostic approach are the empirical…

Descriptors: Epistemology, Statistical Analysis, Science and Society, Scientific Principles

Cluster Analysis of Junior High School Students' Cognitive Structures

Peer reviewed

Direct link

Dan, Youngjun; Geng, Leisha; Li, Meng – Education, 2017

This study aimed to explore students' cognitive patterns based on their knowledge and levels. Participants were seventh graders from a junior high school in China. Three relatively distinct groups were specified by Cluster Analysis: high knowledge and low ability, low knowledge and low ability, and high knowledge and high ability. The group of low…

Descriptors: Cognitive Structures, Curriculum Design, Teaching Methods, Junior High School Students

Increasing the Validity of Angoff Standards through Analysis of Judge-Level Internal Consistency

Peer reviewed

Direct link

Clauser, Jerome C.; Clauser, Brian E.; Hambleton, Ronald K. – Applied Measurement in Education, 2014

The purpose of the present study was to extend past work with the Angoff method for setting standards by examining judgments at the judge level rather than the panel level. The focus was on investigating the relationship between observed Angoff standard setting judgments and empirical conditional probabilities. This relationship has been used as a…

Descriptors: Standard Setting (Scoring), Validity, Reliability, Correlation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Applied Measurement in…	3
International Journal of…	3
ProQuest LLC	3
Applied Linguistics	2
Creativity Research Journal	2
ETS Research Report Series	2
Online Submission	2
Perceptual and Motor Skills	2
Advances in Health Sciences…	1
Advances in Physiology…	1
Applied Psychological…	1
Asia-Pacific Forum on Science…	1
Assessment & Evaluation in…	1
Australian Educational…	1
CALICO Journal	1
Education	1
Educational Psychology in…	1
Educational and Psychological…	1
English Language Teaching	1
International Journal of…	1
Journal of College Teaching &…	1
Journal of Education and…	1
Journal of Educational and…	1
Journal of Language and…	1
Journal of Speech, Language,…	1
More ▼

Attali, Yigal	2
Clauser, Brian E.	2
Simper, Natalie	2
Abdul Gafoor, K.	1
Akkoyunlu, Buket	1
Allan S. Cohen	1
Alsardary, Salar	1
Amanda Huee-Ping Wong	1
Andersson, Marie	1
Apple, Kristen	1
Baldwin, Peter	1
Beaty, Roger E.	1
Bell, Courtney A.	1
Bishop, Jesica M.	1
Blair, William O.	1
Blumberg, Phyllis	1
Bristow, Lora J.	1
Carifio, James	1
Casabianca, Jodi M.	1
Charlin, Bernard	1
Chen, Jin	1
Clauser, Brian	1
Clauser, Jerome C.	1
Conti, Gary J.	1
Cooner, Donna	1
More ▼