ERIC - Search Results

Publication Date

In 2025	6
Since 2024	18
Since 2021 (last 5 years)	99
Since 2016 (last 10 years)	243
Since 2006 (last 20 years)	511

Descriptor

Scoring	511
Test Validity	286
Test Reliability	185
Validity	164
Test Construction	118
Foreign Countries	92
Computer Assisted Testing	91
Scores	91
Psychometrics	84
Correlation	80
Testing	76
Test Items	73
Evaluation Methods	71
Reliability	71
Language Tests	69
Second Language Learning	60
Student Evaluation	60
Comparative Analysis	58
English (Second Language)	58
Measures (Individuals)	58
Item Response Theory	53
Elementary School Students	52
Interrater Reliability	51
Writing Evaluation	50
Factor Analysis	47
More ▼

Publication Type

Journal Articles	393
Reports - Research	270
Reports - Evaluative	136
Reports - Descriptive	59
Tests/Questionnaires	31
Numerical/Quantitative Data	25
Dissertations/Theses -…	14
Speeches/Meeting Papers	12
Information Analyses	11
Guides - Non-Classroom	9
Opinion Papers	9
Books	7
Guides - General	5
Collected Works - General	4
Guides - Classroom - Teacher	2
Reports -…	2
Reports - General	1
More ▼

Education Level

Higher Education	102
Secondary Education	84
Elementary Education	81
Postsecondary Education	78
Middle Schools	50
Elementary Secondary Education	47
Junior High Schools	39
Early Childhood Education	37
High Schools	36
Intermediate Grades	30
Grade 8	28
Grade 4	27
Primary Education	27
Grade 3	25
Grade 5	25
Grade 7	25
Grade 6	24
Kindergarten	13
Preschool Education	11
Grade 1	9
Grade 10	9
Grade 11	8
Grade 2	8
Grade 9	8
Grade 12	6
More ▼

Audience

Administrators	6
Teachers	5
Policymakers	3
Practitioners	3
Researchers	2

Location

New York	18
United States	13
Turkey	10
California	9
China	8
Australia	7
Nebraska	7
United Kingdom (England)	7
Canada	6
Florida	6
Japan	6
Netherlands	6
United Kingdom	6
Pennsylvania	5
Colorado	4
Texas	4
Iran	3
Jordan	3
Massachusetts	3
New Mexico	3
Vermont	3
Washington	3
Connecticut	2
District of Columbia	2
Europe	2
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	6
No Child Left Behind Act 2001	6
Every Student Succeeds Act…	3
Elementary and Secondary…	1
Family Educational Rights and…	1
Health Insurance Portability…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Race to the Top	1

What Works Clearinghouse Rating

Showing 1 to 15 of 511 results Save | Export

One Score to Rule Them All? Comparing the Predictive and Concurrent Validity of 30 Hearts and Flowers Scoring Approaches

Peer reviewed

Direct link

Tiffany Wu; Christina Weiland; Meghan McCormick; JoAnn Hsueh; Catherine Snow; Jason Sachs – Grantee Submission, 2024

The Hearts and Flowers (H&F) task is a computerized executive functioning (EF) assessment that has been used to measure EF from early childhood to adulthood. It provides data on accuracy and reaction time (RT) across three different task blocks (hearts, flowers, and mixed). However, there is a lack of consensus in the field on how to score the…

Descriptors: Scoring, Executive Function, Kindergarten, Young Children

Development of a Protein Concept Inventory: A Proposal for Item Scoring and Responding

Peer reviewed
PDF on ERIC

Download full text

Güntay Tasçi – Science Insights Education Frontiers, 2024

The present study has aimed to develop and validate a protein concept inventory (PCI) consisting of 25 multiple-choice (MC) questions to assess students' understanding of protein, which is a fundamental concept across different biology disciplines. The development process of the PCI involved a literature review to identify protein-related content,…

Descriptors: Science Instruction, Science Tests, Multiple Choice Tests, Biology

Validity Arguments for AI-Based Automated Scores: Essay Scoring as an Illustration

Peer reviewed

Direct link

Ferrara, Steve; Qunbar, Saed – Journal of Educational Measurement, 2022

In this article, we argue that automated scoring engines should be transparent and construct relevant--that is, as much as is currently feasible. Many current automated scoring engines cannot achieve high degrees of scoring accuracy without allowing in some features that may not be easily explained and understood and may not be obviously and…

Descriptors: Artificial Intelligence, Scoring, Essays, Automation

Towards a More Nuanced Conceptualisation of Differential Examiner Stringency in OSCEs

Peer reviewed

Direct link

Matt Homer – Advances in Health Sciences Education, 2024

Quantitative measures of systematic differences in OSCE scoring across examiners (often termed examiner stringency) can threaten the validity of examination outcomes. Such effects are usually conceptualised and operationalised based solely on checklist/domain scores in a station, and global grades are not often used in this type of analysis. In…

Descriptors: Examiners, Scoring, Validity, Cutting Scores

The Patient Is Thriving! Current Issues, Recent Advances, and Future Directions in Creativity Assessment

Peer reviewed

Direct link

Plucker, Jonathan A. – Creativity Research Journal, 2023

In 1998, Plucker and Runco provided an overview of creativity assessment, noting current issues (fluency confounds, generality vs. specificity), recent advances (predictive validity, implicit theories), and promising future directions (moving beyond divergent thinking measures, reliance on batteries of assessments, translation into practice). In…

Descriptors: Creativity, Creativity Tests, Creative Thinking, Semantics

Confirmatory Factor Analysis of the KeyMath-3 Diagnostic Assessment

Peer reviewed

Direct link

Michael D. Wray; Matthew R. Reynolds – Journal of Psychoeducational Assessment, 2025

The KeyMath-3 Diagnostic Assessment (KM-3) is an individually-administered math assessment used in educational placement and diagnostic decisions. It includes 10 subtests making up Basic Concepts, Operations, and Applications indexes and a "Total Test" composite that measures overall math ability. Here, covariances among subtests from…

Descriptors: Diagnostic Tests, Mathematics Tests, Arithmetic, Factor Analysis

Developing the Mathematical Thinking Scale for Gifted Students

Peer reviewed
PDF on ERIC

Download full text

Er, Zübeyde; Dinç Artut, Perihan; Bal, Ayten Pinar – Pegem Journal of Education and Instruction, 2023

The aim of this research is to develop a reliable and valid scale to determine the mathematical thinking skills of gifted students. In addition, with the developed scale, thinking skills of gifted students was examined in terms of various variables. In this context, the research was carried out on two different study groups. The first stage of…

Descriptors: Measures (Individuals), Rating Scales, Test Construction, Construct Validity

Psychometric Evaluation of an Alternate Scoring for the Remote Associates Test

Peer reviewed

Direct link

Beisemann, Marie; Forthmann, Boris; Bürkner, Paul-Christian; Holling, Heinz – Journal of Creative Behavior, 2020

The Remote Associates Test (RAT; Mednick, 1962; Mednick & Mednick, 1967) is a commonly employed test of creative convergent thinking. The RAT is scored with a dichotomous scoring, scoring correct answers as 1 and all other answers as 0. Based on recent research into the information processing underlying RAT performance, we argued that the…

Descriptors: Psychometrics, Scoring, Tests, Semantics

A Method for Identifying Partial Test-Taking Engagement

Peer reviewed

Direct link

Wise, Steven; Kuhfeld, Megan – Applied Measurement in Education, 2021

Effort-moderated (E-M) scoring is intended to estimate how well a disengaged test taker would have performed had they been fully engaged. It accomplishes this adjustment by excluding disengaged responses from scoring and estimating performance from the remaining responses. The scoring method, however, assumes that the remaining responses are not…

Descriptors: Scoring, Achievement Tests, Identification, Validity

The Future of Standardised Assessment: Validity and Trust in Algorithms for Assessment and Scoring

Peer reviewed

Direct link

Aloisi, Cesare – European Journal of Education, 2023

This article considers the challenges of using artificial intelligence (AI) and machine learning (ML) to assist high-stakes standardised assessment. It focuses on the detrimental effect that even state-of-the-art AI and ML systems could have on the validity of national exams of secondary education, and how lower validity would negatively affect…

Descriptors: Standardized Tests, Test Validity, Credibility, Algorithms

Applying Automated Originality Scoring to the Verbal Form of Torrance Tests of Creative Thinking

Download full text

Direct link

Selcuk Acar; Kelly Berthiaume; Katalin Grajzel; Denis Dumas; Charles Flemister; Peter Organisciak – Gifted Child Quarterly, 2023

In this study, we applied different text-mining methods to the originality scoring of the Unusual Uses Test (UUT) and Just Suppose Test (JST) from the Torrance Tests of Creative Thinking (TTCT)--Verbal. Responses from 102 and 123 participants who completed Form A and Form B, respectively, were scored using three different text-mining methods. The…

Descriptors: Creative Thinking, Creativity Tests, Scoring, Automation

Anchoring Validity Evidence for Automated Essay Scoring

Peer reviewed

Direct link

Shermis, Mark D. – Journal of Educational Measurement, 2022

One of the challenges of discussing validity arguments for machine scoring of essays centers on the absence of a commonly held definition and theory of good writing. At best, the algorithms attempt to measure select attributes of writing and calibrate them against human ratings with the goal of accurate prediction of scores for new essays.…

Descriptors: Scoring, Essays, Validity, Writing Evaluation

Computational Concepts and Their Assessment in Preschool Students: An Empirical Study

Peer reviewed

Direct link

Marcos Jiménez; María Zapata-Cáceres; Marcos Román-González; Gregorio Robles; Jesús Moreno-León; Estefanía Martín-Barroso – Journal of Science Education and Technology, 2024

Computational thinking (CT) is a multidimensional term that encompasses a wide variety of problem-solving skills related to the field of computer science. Unfortunately, standardized, valid, and reliable methods to assess CT skills in preschool children are lacking, compromising the reliability of the results reported in CT interventions. To…

Descriptors: Computation, Thinking Skills, Student Evaluation, Preschool Children

Operationalizing a Weighted Performance Scoring Model for Sustainable e-Learning in Medical Education: Insights from Expert Judgement

Peer reviewed
PDF on ERIC

Download full text

Deborah Oluwadele; Yashik Singh; Timothy Adeliyi – Electronic Journal of e-Learning, 2024

Validation is needed for any newly developed model or framework because it requires several real-life applications. The investment made into e-learning in medical education is daunting, as is the expectation for a positive return on investment. The medical education domain requires data-wise implementation of e-learning as the debate continues…

Descriptors: Electronic Learning, Evaluation Methods, Medical Education, Sustainability

A Systematic Review of Automated Writing Evaluation Systems

Peer reviewed

Direct link

Huawei, Shi; Aryadoust, Vahid – Education and Information Technologies, 2023

Automated writing evaluation (AWE) systems are developed based on interdisciplinary research and technological advances such as natural language processing, computer sciences, and latent semantic analysis. Despite a steady increase in research publications in this area, the results of AWE investigations are often mixed, and their validity may be…

Descriptors: Writing Evaluation, Writing Tests, Computer Assisted Testing, Automation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 35

Journal of Psychoeducational…	50
Language Testing	23
Grantee Submission	20
ETS Research Report Series	18
ProQuest LLC	14
New York State Education…	12
Applied Measurement in…	9
Educational Assessment	9
Online Submission	8
Assessing Writing	7
Canadian Journal of School…	7
Educational Measurement:…	7
Educational Testing Service	7
Assessment in Education:…	6
International Journal of…	6
Journal of Educational…	6
Nebraska Department of…	6
Language Assessment Quarterly	5
Reading & Writing Quarterly	5
Assessment for Effective…	4
Education and Information…	4
Educational and Psychological…	4
International Journal of…	4
Journal of Speech, Language,…	4
Measurement in Physical…	4
More ▼

Attali, Yigal	7
McCrimmon, Adam W.	6
Schoen, Robert C.	6
Bejar, Isaac I.	4
Lembke, Erica S.	4
McMaster, Kristen L.	4
Mercer, Sterett H.	4
Paek, Insu	4
Poch, Apryl L.	4
Williamson, David M.	4
Xi, Xiaoming	4
Yang, Xiaotong	4
Allen, Abigail A.	3
Espin, Christine A.	3
Forthmann, Boris	3
Keller-Margulis, Milena A.	3
Liu, Ou Lydia	3
Liu, Sicong	3
Matta, Michael	3
Nordstokke, David W.	3
Oliveri, María Elena	3
Pianta, Robert C.	3
Pollitt, Alastair	3
Ramineni, Chaitanya	3
Sinharay, Sandip	3
More ▼

Test of English as a Foreign…	15
Wechsler Intelligence Scale…	6
ACT Assessment	5
Graduate Record Examinations	5
Wechsler Individual…	5
SAT (College Admission Test)	4
Wechsler Preschool and…	4
Early Childhood Environment…	3
International English…	3
Kaufman Test of Educational…	3
National Assessment of…	3
Program for International…	3
Trends in International…	3
Wechsler Adult Intelligence…	3
Woodcock Johnson Tests of…	3
Autism Diagnostic Observation…	2
Clinical Evaluation of…	2
Conners Rating Scales	2
Dynamic Indicators of Basic…	2
Measures of Academic Progress	2
New York State Regents…	2
Peabody Picture Vocabulary…	2
Torrance Tests of Creative…	2
ACT Interest Inventory	1
Advanced Placement…	1
More ▼