ERIC - Search Results

Publication Date

In 2025	2
Since 2024	2
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	17

Descriptor

Computer Assisted Testing	20
Evaluation Methods	20
Reliability	20
Validity	12
Student Evaluation	11
Foreign Countries	7
Comparative Analysis	6
Higher Education	6
Scoring	5
Performance Based Assessment	4
Program Effectiveness	4
Alternative Assessment	3
Correlation	3
Curriculum Based Assessment	3
Educational Assessment	3
Elementary Secondary Education	3
Essays	3
Evidence	3
High School Students	3
Mathematics Tests	3
Scores	3
Student Attitudes	3
Test Construction	3
Test Items	3
Accountability	2
More ▼

Source

Educational and Psychological…	2
American Educational Research…	1
Arab World English Journal	1
Assessment & Evaluation in…	1
Assessment for Effective…	1
British Educational Research…	1
Council of Chief State School…	1
Journal of Deaf Studies and…	1
Journal of Educational…	1
Journal of Speech, Language,…	1
Learning Policy Institute	1
Online Submission	1
ProQuest LLC	1
Proceedings of the ASIS…	1
School Psychology Review	1
Teaching in Higher Education	1
Technology, Pedagogy and…	1
More ▼

Publication Type

Journal Articles	14
Reports - Research	10
Reports - Evaluative	4
Reports - Descriptive	3
Dissertations/Theses -…	2
Speeches/Meeting Papers	2
Tests/Questionnaires	2
Books	1
Guides - Classroom - Learner	1

Education Level

Higher Education	8
Postsecondary Education	8
Elementary Education	6
High Schools	4
Elementary Secondary Education	3
Middle Schools	3
Secondary Education	3
Grade 5	2
Intermediate Grades	2
Early Childhood Education	1
Grade 1	1
Grade 3	1
Grade 4	1
Grade 8	1
Junior High Schools	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Students

Location

Australia	3
Connecticut	2
New Hampshire	2
New York	2
Rhode Island	2
United Kingdom (England)	2
Vermont	2
Arizona	1
Delaware	1
Egypt	1
Mexico	1
Pennsylvania	1
Portugal	1
Singapore	1
South Africa	1
South Carolina	1
More ▼

Laws, Policies, & Programs

Every Student Succeeds Act…

Assessments and Surveys

National Assessment of…	2
New York State Regents…	2
Dynamic Indicators of Basic…	1
Peabody Individual…	1
Social Skills Rating System	1
Wechsler Intelligence Scale…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

Grading Exams Using Large Language Models: A Comparison between Human and AI Grading of Exams in Higher Education Using ChatGPT

Peer reviewed

Direct link

Jonas Flodén – British Educational Research Journal, 2025

This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…

Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring

Variation in Assembling Assessments Using Automated Test Assembly Methodologies: Item-Pool Constraints and Response-Time Targets

Direct link

Aaron McVay – ProQuest LLC, 2021

As assessments move towards computerized testing and making continuous testing available the need for rapid assembly of forms is increasing. The objective of this study was to investigate variability in assembled forms through the lens of first- and second-order equity properties of equating, by examining three factors and their interactions. Two…

Descriptors: Automation, Computer Assisted Testing, Test Items, Reaction Time

Examining Human and Automated Ratings of Elementary Students' Writing Quality: A Multivariate Generalizability Theory Application

Peer reviewed

Direct link

Chen, Dandan; Hebert, Michael; Wilson, Joshua – American Educational Research Journal, 2022

We used multivariate generalizability theory to examine the reliability of hand-scoring and automated essay scoring (AES) and to identify how these scoring methods could be used in conjunction to optimize writing assessment. Students (n = 113) included subsamples of struggling writers and non-struggling writers in Grades 3-5 drawn from a larger…

Descriptors: Reliability, Scoring, Essays, Automation

Exploring the Nexus between Assessment, Quality and Social Justice: Reflections on Remote Assessment Practices

Peer reviewed

Direct link

Kershree Padayachee; M. Matimolane – Teaching in Higher Education, 2025

In the shift to Emergency Remote Teaching and Learning (ERT&L) during the COVID-19 pandemic, remote assessment and feedback became a major source of discontent and challenge for students and staff. This paper is a reflection and analysis of assessment practices during ERT&L, and our theorisation of the possibilities for shifts towards…

Descriptors: Educational Quality, Social Justice, Distance Education, Feedback (Response)

Technical Characteristics of Curriculum-Based Measurement with Students Who Are Deaf

Peer reviewed

Direct link

Lam, Elizabeth A.; Rose, Susan; McMaster, Kristen L. – Journal of Deaf Studies and Deaf Education, 2020

This study compared the reliability and validity of student scores from paper--pencil and e-based assessments using the "maze" and "silent reading fluency" (SRF) tasks. Forty students who were deaf and hard of hearing and reading between the second and fifth grade reading levels and their teachers (n = 21) participated. For…

Descriptors: Deafness, Hearing Impairments, Curriculum Based Assessment, Evaluation Methods

Validation of an Automated Procedure for Calculating Core Lexicon from Transcripts

Peer reviewed

Direct link

Dalton, Sarah Grace; Stark, Brielle C.; Fromm, Davida; Apple, Kristen; MacWhinney, Brian; Rensch, Amanda; Rowedder, Madyson – Journal of Speech, Language, and Hearing Research, 2022

Purpose: The aim of this study was to advance the use of structured, monologic discourse analysis by validating an automated scoring procedure for core lexicon (CoreLex) using transcripts. Method: Forty-nine transcripts from persons with aphasia and 48 transcripts from persons with no brain injury were retrieved from the AphasiaBank database. Five…

Descriptors: Validity, Discourse Analysis, Databases, Scoring

Applying Assessment Principles during Emergency Remote Teaching: Challenges and Considerations

Peer reviewed
PDF on ERIC

Download full text

Allehaiby, Wid Hasen; Al-Bahlani, Sara – Arab World English Journal, 2021

One of the main challenges higher educational institutions encounter amid the recent COVID-19 crisis is transferring assessment approaches from the traditional face-to-face form to the online Emergency Remote Teaching approach. A set of language assessment principles, practicality, reliability, validity, authenticity, and washback, which can be…

Descriptors: Barriers, Distance Education, Evaluation Methods, Teaching Methods

A Program Based on Cloud Computing Platform for Developing the Assessment Strategies for EFL for the Primary Stage Teachers

Download full text

Salem, Walaa Abdel Fattah – Online Submission, 2021

The study aim was examining the effect of a program based on Cloud-computing platform on developing assessment strategies for EFL primary school teachers. The study reviewed the literature and previous studies dealing with assessment strategies (self-assessment, peer-assessment, strategic use of Questioning and reflection), professional learning…

Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Elementary School Teachers

Developing and Measuring Higher Order Skills: Models for State Performance Assessment Systems. Research Brief

Peer reviewed
PDF on ERIC

Download full text

Darling-Hammond, Linda – Learning Policy Institute, 2017

After passage of the Every Student Succeeds Act (ESSA) in 2015, states assumed greater responsibility for designing their own accountability and assessment systems. ESSA requires states to measure "higher order thinking skills and understanding" and encourages the use of open-ended performance assessments, which are essential for…

Descriptors: Performance Based Assessment, Accountability, Portfolios (Background Materials), Task Analysis

Student Evaluation in Higher Education: A Comparison between Computer Assisted Assessment and Traditional Evaluation

Peer reviewed
PDF on ERIC

Download full text

Ghilay, Yaron; Ghilay, Ruth – Journal of Educational Technology, 2012

The study examined advantages and disadvantages of computerised assessment compared to traditional evaluation. It was based on two samples of college students (n=54) being examined in computerised tests instead of paper-based exams. Students were asked to answer a questionnaire focused on test effectiveness, experience, flexibility and integrity.…

Descriptors: Student Evaluation, Higher Education, Comparative Analysis, Computer Assisted Testing

Developing and Measuring Higher Order Skills: Models for State Performance Assessment Systems

Download full text

Darling-Hammond, Linda – Council of Chief State School Officers, 2017

The Every Student Succeeds Act (ESSA) opened up new possibilities for how student and school success are defined and supported in American public education. States have greater responsibility for designing and building their assessment and accountability systems. These new opportunities to develop performance assessments are critically important…

Descriptors: Performance Based Assessment, Accountability, Portfolios (Background Materials), Task Analysis

Predicting End-of-Year Achievement Test Performance: A Comparison of Assessment Methods

Peer reviewed

Direct link

Kettler, Ryan J.; Elliott, Stephen N.; Kurz, Alexander; Zigmond, Naomi; Lemons, Christopher J.; Kloo, Amanda; Shrago, Jacqueline; Beddow, Peter A.; Williams, Leila; Bruen, Charles; Lupp, Lynda; Farmer, Jeanie; Mosiman, Melanie – Assessment for Effective Intervention, 2014

Motivated by the multiple-measures clause of recent federal policy regarding student eligibility for alternate assessments based on modified academic achievement standards (AA-MASs), this study examined how scores or combinations of scores from a diverse set of assessments predicted students' end-of-year proficiency status on statewide achievement…

Descriptors: Eligibility, Alternative Assessment, Academic Achievement, Predictive Validity

Validating a Number Sense Screening Tool for Use in Kindergarten and First Grade: Prediction of Mathematics Proficiency in Third Grade

Peer reviewed

Direct link

Jordan, Nancy C.; Glutting, Joseph; Ramineni, Chaitanya; Watkins, Marley W. – School Psychology Review, 2010

Using a longitudinal design, children were given a brief number sense screener (NSB) screener (N = 204) over six time points, from the beginning of kindergarten to the middle of first grade. The NSB is based on research showing the importance of number competence (number, number relations, and number operations) for success in mathematics.…

Descriptors: Mathematics Achievement, Reliability, Achievement Tests, Kindergarten

Development and Application of Detection Indices for Measuring Guessing Behaviors and Test-Taking Effort in Computerized Adaptive Testing

Peer reviewed

Direct link

Chang, Shu-Ren; Plake, Barbara S.; Kramer, Gene A.; Lien, Shu-Mei – Educational and Psychological Measurement, 2011

This study examined the amount of time that different ability-level examinees spend on questions they answer correctly or incorrectly across different pretest item blocks presented on a fixed-length, time-restricted computerized adaptive testing (CAT). Results indicate that different ability-level examinees require different amounts of time to…

Descriptors: Evidence, Test Items, Reaction Time, Adaptive Testing

Studying the Equivalence of Computer-Delivered and Paper-Based Administrations of the Raven Standard Progressive Matrices Test

Peer reviewed

Direct link

Arce-Ferrer, Alvaro J.; Guzman, Elvira Martinez – Educational and Psychological Measurement, 2009

This study investigates the effect of mode of administration of the Raven Standard Progressive Matrices test on distribution, accuracy, and meaning of raw scores. A random sample of high school students take counterbalanced paper-and-pencil and computer-based administrations of the test and answer a questionnaire surveying preferences for…

Descriptors: Factor Analysis, Raw Scores, Statistical Analysis, Computer Assisted Testing

Previous Page | Next Page »

Pages: 1 | 2

Darling-Hammond, Linda	2
Aaron McVay	1
Al-Bahlani, Sara	1
Allehaiby, Wid Hasen	1
Apple, Kristen	1
Arce-Ferrer, Alvaro J.	1
Beddow, Peter A.	1
Bruen, Charles	1
Chang, Shu-Ren	1
Chen, Dandan	1
Dalton, Sarah Grace	1
Drabenstott, Karen M.	1
Elliott, Stephen N.	1
Farmer, Jeanie	1
Ferrao, Maria	1
Fromm, Davida	1
Ghilay, Ruth	1
Ghilay, Yaron	1
Glutting, Joseph	1
Guzman, Elvira Martinez	1
Hebert, Michael	1
Jonas Flodén	1
Jordan, Nancy C.	1
Kershree Padayachee	1
More ▼