ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	12
Since 2016 (last 10 years)	27
Since 2006 (last 20 years)	57

Descriptor

Scoring	351
Test Interpretation	351
Test Reliability	99
Test Construction	89
Test Validity	83
Testing	81
Elementary Secondary Education	55
Test Use	53
Test Results	51
Testing Problems	45
Scores	42
Standardized Tests	42
Achievement Tests	39
Higher Education	39
Test Items	33
Testing Programs	33
Item Analysis	31
Educational Testing	28
Test Norms	28
Measurement Techniques	26
Educational Assessment	25
Evaluation Methods	25
Language Tests	25
Statistical Analysis	25
Student Evaluation	25
More ▼

Education Level

Higher Education	12
Postsecondary Education	10
Elementary Secondary Education	8
Elementary Education	7
Secondary Education	5
Early Childhood Education	3
Kindergarten	3
Junior High Schools	2
Middle Schools	2
Grade 1	1
Grade 6	1
High Schools	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Practitioners	39
Teachers	17
Administrators	8
Researchers	6
Counselors	5
Parents	5
Policymakers	2
Students	1

Location

California	6
Australia	4
Canada	4
New York	3
United States	3
Kentucky	2
Pennsylvania	2
Rhode Island	2
Vermont	2
Arizona	1
Arkansas	1
Germany	1
Indonesia	1
Israel	1
Italy	1
Jordan	1
Kansas	1
Louisiana	1
Massachusetts	1
Michigan	1
New York (New York)	1
Northern Mariana Islands	1
Ohio	1
Oregon	1
Pennsylvania (Philadelphia)	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	2
Education Consolidation…	1
National Defense Education Act	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 351 results Save | Export

Integration of Prediction Scores from Various Automated Essay Scoring Models Using Item Response Theory

Peer reviewed

Direct link

Uto, Masaki; Aomi, Itsuki; Tsutsumi, Emiko; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2023

In automated essay scoring (AES), essays are automatically graded without human raters. Many AES models based on various manually designed features or various architectures of deep neural networks (DNNs) have been proposed over the past few decades. Each AES model has unique advantages and characteristics. Therefore, rather than using a single-AES…

Descriptors: Prediction, Scores, Computer Assisted Testing, Scoring

Leveraging ChatGPT for Scoring Students' Subjective Tests

Peer reviewed
PDF on ERIC

Download full text

Tri Sedya Febrianti; Siti Fatimah; Yuni Fitriyah; Hanifah Nurhayati – International Journal of Education in Mathematics, Science and Technology, 2024

Assessing students' understanding of circle-related material through subjective tests is effective, though grading these tests can be challenging and often requires technological support. ChatGPT has shown promise in providing reliable and objective evaluations. Many teachers in Indonesia, however, continue to face difficulties integrating…

Descriptors: Artificial Intelligence, Computer Assisted Testing, Scoring, Tests

Development of the Quantitative Modelling Observation Protocol (QMOP) for Undergraduate Biology Courses: Validity Evidence for Score Interpretation and Uses

Peer reviewed

Direct link

Lyrica Lucas; Anum Khushal; Robert Mayes; Brian A. Couch; Joseph Dauer – International Journal of Science Education, 2025

Educational reform priorities such as emphasis on quantitative modelling (QM) have positioned undergraduate biology instructors as designers of QM experiences to engage students in authentic science practices that support the development of data-driven and evidence-based reasoning. Yet, little is known about how biology instructors adapt to the…

Descriptors: Undergraduate Students, College Science, Biology, Classroom Observation Techniques

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

Examining the Effects of a Real-Time, Knowledge-Aware Tool for Academic Writing Assessment

Peer reviewed

Direct link

Li, Xu; Ouyang, Fan; Liu, Jianwen; Wei, Chengkun; Chen, Wenzhi – Journal of Educational Computing Research, 2023

The computer-supported writing assessment (CSWA) has been widely used to reduce instructor workload and provide real-time feedback. Interpretability of CSWA draws extensive attention because it can benefit the validity, transparency, and knowledge-aware feedback of academic writing assessments. This study proposes a novel assessment tool,…

Descriptors: Computer Assisted Testing, Writing Evaluation, Feedback (Response), Natural Language Processing

Operationalizing the Reading-into-Writing Construct in Analytic Rating Scales: Effects of Different Approaches on Rating

Peer reviewed

Direct link

Lestari, Santi B.; Brunfaut, Tineke – Language Testing, 2023

Assessing integrated reading-into-writing task performances is known to be challenging, and analytic rating scales have been found to better facilitate the scoring of these performances than other common types of rating scales. However, little is known about how specific operationalizations of the reading-into-writing construct in analytic rating…

Descriptors: Reading Writing Relationship, Writing Tests, Rating Scales, Writing Processes

Comparison of Two Approaches to Interpretive Use Arguments

Peer reviewed

Direct link

Carney, Michele; Crawford, Angela; Siebert, Carl; Osguthorpe, Rich; Thiede, Keith – Applied Measurement in Education, 2019

The "Standards for Educational and Psychological Testing" recommend an argument-based approach to validation that involves a clear statement of the intended interpretation and use of test scores, the identification of the underlying assumptions and inferences in that statement--termed the interpretation/use argument, and gathering of…

Descriptors: Inquiry, Test Interpretation, Validity, Scores

Digital-First Assessments: A Security Framework

Peer reviewed

Direct link

LaFlair, Geoffrey T.; Langenfeld, Thomas; Baig, Basim; Horie, André Kenji; Attali, Yigal; von Davier, Alina A. – Journal of Computer Assisted Learning, 2022

Background: Digital-first assessments leverage the affordances of technology in all elements of the assessment process--from design and development to score reporting and evaluation to create test taker-centric assessments. Objectives: The goal of this paper is to describe the engineering, machine learning, and psychometric processes and…

Descriptors: Computer Assisted Testing, Affordances, Scoring, Engineering

Q-Interactive: Training Implications for Accuracy and Technology Integration

Peer reviewed

Direct link

Corcoran, Stephanie – Contemporary School Psychology, 2022

With the iPad-mediated cognitive assessment gaining popularity with school districts and the need for alternative modes for training and instruction during this COVID-19 pandemic, school psychology training programs will need to adapt to effectively train their students to be competent in administering, scoring, an interpreting cognitive…

Descriptors: School Psychologists, Professional Education, Job Skills, Cognitive Tests

A General Method for Adjusting Test Score Distributions to Account for Rescoring and Retesting

Peer reviewed

Direct link

Sophie Litschwartz – Society for Research on Educational Effectiveness, 2021

Background/Context: Pass/fail standardized exams frequently selectively rescore failing exams and retest failing examinees. This practice distorts the test score distribution and can confuse those who do analysis on these distributions. In 2011, the Wall Street Journal showed large discontinuities in the New York City Regent test score…

Descriptors: Standardized Tests, Pass Fail Grading, Scoring Rubrics, Scoring Formulas

On the Complementarity of Holistic and Analytic Approaches to Performance Assessment Scoring

Peer reviewed

Direct link

Zlatkin-Troitschanskaia, Olga; Shavelson, Richard J.; Schmidt, Susanne; Beck, Klaus – British Journal of Educational Psychology, 2019

Background: A holistic approach to performance assessment recognizes the theoretical complexity of multifaceted critical thinking (CT), a key objective of higher education. However, issues related to reliability, interpretation, and use arise with this approach. Aims and Method: Therefore, we take an analytic approach to scoring students' written…

Descriptors: Holistic Approach, Performance Based Assessment, Critical Thinking, College Students

A Validation Framework for Science Learning Progression Research

Peer reviewed

Direct link

Jin, Hui; van Rijn, Peter; Moore, John C.; Bauer, Malcolm I.; Pressler, Yamina; Yestness, Nissa – International Journal of Science Education, 2019

This article provides a validation framework for research on the development and use of science Learning Progressions (LPs). The framework describes how evidence from various sources can be used to establish an interpretive argument and a validity argument at five stages of LP research--development, scoring, generalisation, extrapolation, and use.…

Descriptors: Sequential Approach, Educational Research, Science Education, Validity

A Validation Framework for Science Learning Progression Research

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jin, Hui; van Rijn, Peter; Moore, John C.; Bauer, Malcolm I.; Pressler, Yamina; Yestness, Nissa – Grantee Submission, 2019

Descriptors: Sequential Approach, Educational Research, Science Education, Validity

Integrating Validation Arguments with the Assessment Triangle: A Framework for Operationalizing and Instantiating Validation

Peer reviewed

Direct link

Ketterlin-Geller, Leanne R.; Perry, Lindsey; Adams, Elizabeth – Applied Measurement in Education, 2019

Despite the call for an argument-based approach to validity over 25 years ago, few examples exist in the published literature. One possible explanation for this outcome is that the complexity of the argument-based approach makes implementation difficult. To counter this claim, we propose that the Assessment Triangle can serve as the overarching…

Descriptors: Validity, Educational Assessment, Models, Screening Tests

Validity of Automated Learning Progress Assessment in English Written Expression for Students with Learning Difficulties

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sterett H. Mercer; Joanna E. Cannon – Grantee Submission, 2022

We evaluated the validity of an automated approach to learning progress assessment (aLPA) for English written expression. Participants (n = 105) were students in Grades 2-12 who had parent-identified learning difficulties and received academic tutoring through a community-based organization. Participants completed narrative writing samples in the…

Descriptors: Elementary School Students, Secondary School Students, Learning Problems, Learning Disabilities

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 24

Educational Measurement:…	15
Journal of Psychoeducational…	12
Educational and Psychological…	9
Journal of Educational…	9
Psychology in the Schools	6
Grantee Submission	5
Applied Measurement in…	4
Perceptual and Motor Skills	4
American Psychologist	3
Assessment	3
Journal of Counseling…	3
Language Testing	3
Arithmetic Teacher	2
British Journal of…	2
Canadian Journal of School…	2
Educ Psychol Meas	2
Educational Assessment	2
International Journal of…	2
J Educ Meas	2
Journal of Consulting and…	2
Academic Therapy	1
American Journal of Mental…	1
American Language Review	1
Association for Supervision…	1
Audio-Visual Language Journal	1
More ▼

White, Edward M.	6
Echternacht, Gary	3
Plake, Barbara S.	3
Bauer, Malcolm I.	2
Bowles, Ryan P.	2
De Avila, Edward A.	2
Dorans, Neil J.	2
Duncan, Sharon E.	2
Edgington, Eugene S.	2
Elliott-Schuman, Nikki	2
Flanagan, Dawn P.	2
Hall, Gene E.	2
Haller, Otto	2
Horkay, Nancy, Ed.	2
Jensrud, Qetler	2
Jin, Hui	2
Kaufman, Alan S.	2
Livingston, Samuel A.	2
Mascolo, Jennifer T.	2
Moore, John C.	2
Moss, Jerome, Jr.	2
Pressler, Yamina	2
Reynolds, Cecil R.	2
Rippey, Robert M.	2
More ▼

Journal Articles	109
Reports - Research	92
Guides - Non-Classroom	61
Reports - Evaluative	59
Tests/Questionnaires	41
Reports - Descriptive	34
Speeches/Meeting Papers	32
Opinion Papers	12
Books	11
Guides - General	9
Information Analyses	7
Numerical/Quantitative Data	7
Book/Product Reviews	4
ERIC Digests in Full Text	3
ERIC Publications	3
Guides - Classroom - Teacher	3
Reports - General	3
Collected Works - Proceedings	2
Legal/Legislative/Regulatory…	2
Dissertations/Theses -…	1
Guides - Classroom - Learner	1
Historical Materials	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

National Assessment of…	9
Wechsler Intelligence Scale…	8
ACT Assessment	3
General Aptitude Test Battery	3
Graduate Record Examinations	3
Massachusetts Comprehensive…	3
Washington Assessment of…	3
Woodcock Johnson Tests of…	3
Bender Gestalt Test	2
Goodenough Harris Drawing Test	2
Iowa Tests of Basic Skills	2
McCarthy Scales of Childrens…	2
Medical College Admission Test	2
Minnesota Multiphasic…	2
Rod and Frame Test	2
Rorschach Test	2
SAT (College Admission Test)	2
Stages of Concern…	2
Stanford Binet Intelligence…	2
Strong Campbell Interest…	2
Test of English as a Foreign…	2
Wechsler Adult Intelligence…	2
ACT Interest Inventory	1
Adaptive Behavior Scale	1
Advanced Placement…	1
More ▼