ERIC - Search Results

Publication Date

In 2025	3
Since 2024	5
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	16
Since 2006 (last 20 years)	23

Publication Type

Journal Articles	21
Reports - Research	21
Reports - Descriptive	3
Tests/Questionnaires	3
Numerical/Quantitative Data	2
Reports - Evaluative	2
Collected Works - Serial	1
Guides - General	1

Education Level

Higher Education	26
Postsecondary Education	26
Secondary Education	3
Junior High Schools	2
Middle Schools	2
Elementary Education	1
Grade 8	1
High Schools	1

Audience

Location

Australia	2
Canada	2
China	2
Turkey	2
China (Beijing)	1
Philippines	1
Portugal	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	2
SAT (College Admission Test)	2
Depression Anxiety and Stress…	1
Flesch Kincaid Grade Level…	1
International English…	1
Marlowe Crowne Social…	1
National Merit Scholarship…	1
Praxis Series	1
Preliminary Scholastic…	1
Rosenberg Self Esteem Scale	1

What Works Clearinghouse Rating

Showing 1 to 15 of 26 results Save | Export

Grading Exams Using Large Language Models: A Comparison between Human and AI Grading of Exams in Higher Education Using ChatGPT

Peer reviewed

Direct link

Jonas Flodén – British Educational Research Journal, 2025

This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…

Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring

Neglected, Acknowledged, or Targeted: A Conceptual Framing of Variability, Data Analysis, and Domain Consequences

Peer reviewed

Direct link

Zachary del Rosario – Journal of Statistics and Data Science Education, 2024

Variability is underemphasized in domains such as engineering. Statistics and data science education research offers a variety of frameworks for understanding variability, but new frameworks for domain applications are necessary. This study investigated the professional practices of working engineers to develop such a framework. The Neglected,…

Descriptors: Foreign Countries, Engineering Education, Engineering, Technical Occupations

Psychometric Properties of the Depression, Anxiety, and Stress Scale-21 (DASS-21) across Nine Countries/Regions

Peer reviewed

Direct link

Cristian Zanon; Nan Zhao; Nursel Topkaya; Ertugrul Sahin; David L. Vogel; Melissa M. Ertl; Samineh Sanatkar; Hsin-Ya Liao; Mark Rubin; Makilim N. Baptista; Winnie W. S. Mak; Fatima Rashed Al-Darmaki; Georg Schomerus; Ying-Fen Wang; Dalia Nasvytiene – International Journal of Testing, 2025

Examinations of the internal structure of the Depression, Anxiety, and Stress Scale-21 (DASS-21) have yielded inconsistent conclusions within and across cultural contexts. This study examined the dimensionality and reliability of the DASS-21 across three theoretically plausible factor structures (i.e., unidimensional, oblique three-factor, and…

Descriptors: Anxiety, Depression (Psychology), Psychometrics, Cultural Context

Psychometric Evaluation of Perceived Internship PUA Scale: Using Rasch Analysis

Peer reviewed

Direct link

Yanchao Yang; Wangze Li; Sijia Xue; Wenxue Huang; Shijie Guo – European Journal of Education, 2025

In response to the prevalence of perceived internship Pick-up Artist(PUA) behaviours and the lack of appropriate measurement tools, the purpose of this study was to develop and validate a new self-designed questionnaire, the Perceived Internship PUA Scale (PIPUAS), to assess college student interns' perceptions of internship PUA behaviours. The…

Descriptors: Measurement Techniques, Incidence, Internship Programs, Validity

Comparing Numerical Methods to Estimate Vertical Jump Height Using a Force Platform

Peer reviewed

Direct link

Chiu, Loren Z. F.; Daehlin, Torstein E. – Measurement in Physical Education and Exercise Science, 2020

Males (n = 29) and females (n = 34) performed vertical jumps. Jump height was estimated from force platform data using five numerical methods and compared using intraclass correlation ([rho]), and linear and rank regression standard error of estimate ("SEE"). Take-off velocity plus center of mass height at take-off and mechanical work…

Descriptors: Physical Activities, Scientific Concepts, Computation, Motion

The Reliability of Simultaneous versus Individual Data Collection during Stuttering Assessment

Peer reviewed

Direct link

Davidow, Jason H.; Ye, Jun; Edge, Robin L. – International Journal of Language & Communication Disorders, 2023

Background: Speech-language pathologists often multitask in order to be efficient with their commonly large caseloads. In stuttering assessment, multitasking often involves collecting multiple measures simultaneously. Aims: The present study sought to determine reliability when collecting multiple measures simultaneously versus individually.…

Descriptors: Graduate Students, Measurement, Reliability, Group Activities

Unbiased, Reliable, and Valid Student Evaluations Can Still Be Unfair

Peer reviewed

Direct link

Esarey, Justin; Valdes, Natalie – Assessment & Evaluation in Higher Education, 2020

Scholarly debate about student evaluations of teaching (SETs) often focuses on whether SETs are valid, reliable and unbiased. In this article, we assume the most optimistic conditions for SETs that are supported by the empirical literature. Specifically, we assume that SETs are moderately correlated with teaching quality (student learning and…

Descriptors: Student Evaluation of Teacher Performance, Bias, Reliability, Validity

Brief Conscientiousness Scales: How Low Can You Go? ACT Research. Issue Brief. R2407

Download full text

Kate E. Walton – ACT, Inc., 2024

There is a tradeoff between scale length and psychometric concerns. The two are, in fact, directly linked. Generally, when scales are shortened, reliability is reduced, and when scales are lengthened, reliability is improved, provided the items added to the scale are comparable psychometrically (AERA et al., 2014). Scale reliability, in turn,…

Descriptors: Psychometrics, Error of Measurement, Rating Scales, Reliability

Investigating the Impact of Rater Training on Rater Errors in the Process of Assessing Writing Skill

Peer reviewed
PDF on ERIC

Download full text

Sata, Mehmet; Karakaya, Ismail – International Journal of Assessment Tools in Education, 2022

In the process of measuring and assessing high-level cognitive skills, interference of rater errors in measurements brings about a constant concern and low objectivity. The main purpose of this study was to investigate the impact of rater training on rater errors in the process of assessing individual performance. The study was conducted with a…

Descriptors: Evaluators, Training, Comparative Analysis, Academic Language

Exploring Cross-Cultural and Gender Differences in Test Anxiety among U.S. and Canadian College Students

Peer reviewed

Direct link

Lowe, Patricia A. – Journal of Psychoeducational Assessment, 2019

Existing measures of test anxiety used with the college student population are old with old norms and old items, and they do not capture the multiple dimensions of the test anxiety construct or assess facilitating anxiety. In the present study, the validity of the scores of a new, multidimensional measure of test anxiety with a facilitating…

Descriptors: Cross Cultural Studies, Gender Differences, Test Anxiety, Foreign Countries

Simple and Low-Cost Setup for Measurement of the Density of a Liquid

Peer reviewed

Direct link

Noei, Nima; Imani, Iman Mohammadi; Wilson, Lee D.; Azizian, Saeid – Journal of Chemical Education, 2019

A low-cost and simple setup to measure the densities of liquids is introduced herein. The results and reliability of this setup were evaluated for pure liquids, water-ethanol binary mixtures, and aqueous NaCl solutions. The constructed densitometer provided density values with acceptable relative errors (less than ±3.0%), which were compared to…

Descriptors: Chemistry, Science Education, Science Instruction, Laboratory Experiments

Validation of the Long- and Short-Form of the Ethical Values Assessment (EVA): A Questionnaire Measuring the Three Ethics Approach to Moral Psychology

Peer reviewed

Direct link

Padilla-Walker, Laura Maria; Jensen, Lene Arnett – International Journal of Behavioral Development, 2016

Moral psychology has been moving toward consideration of multiple kinds of moral concepts and values, such as the Ethics of Autonomy, Community, and Divinity. While these three ethics have commonly been measured qualitatively, the current study sought to validate the long and short forms of the Ethical Values Assessment (EVA), which is a…

Descriptors: Ethics, Questionnaires, Error of Measurement, Moral Values

The "Don't Know" Option in Progress Testing

Peer reviewed

Direct link

Ravesloot, C. J.; Van der Schaaf, M. F.; Muijtjens, A. M. M.; Haaring, C.; Kruitwagen, C. L. J. J.; Beek, F. J. A.; Bakker, J.; Van Schaik, J.P.J.; Ten Cate, Th. J. – Advances in Health Sciences Education, 2015

Formula scoring (FS) is the use of a don't know option (DKO) with subtraction of points for wrong answers. Its effect on construct validity and reliability of progress test scores, is subject of discussion. Choosing a DKO may not only be affected by knowledge level, but also by risk taking tendency, and may thus introduce construct-irrelevant…

Descriptors: Scoring Formulas, Tests, Scores, Construct Validity

ACT Reporting Category Interpretation Guide: Version 1.0. ACT Working Paper 2016 (05)

Download full text

Powers, Sonya; Li, Dongmei; Suh, Hongwook; Harris, Deborah J. – ACT, Inc., 2016

ACT reporting categories and ACT Readiness Ranges are new features added to the ACT score reports starting in fall 2016. For each reporting category, the number correct score, the maximum points possible, the percent correct, and the ACT Readiness Range, along with an indicator of whether the reporting category score falls within the Readiness…

Descriptors: Scores, Classification, College Entrance Examinations, Error of Measurement

Investigating Score Dependability in English/Chinese Interpreter Certification Performance Testing: A Generalizability Theory Approach

Peer reviewed

Direct link

Han, Chao – Language Assessment Quarterly, 2016

As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…

Descriptors: Foreign Countries, Scores, English, Chinese

Previous Page | Next Page »

Pages: 1 | 2

ACT, Inc.	2
Advances in Health Sciences…	2
Assessment & Evaluation in…	2
ETS Research Report Series	2
Association for Institutional…	1
British Educational Research…	1
CALICO Journal	1
College Board	1
Educational Psychology	1
European Journal of Education	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Chemical Education	1
Journal of Educational and…	1
Journal of Psychoeducational…	1
Journal of Statistics and…	1
Language Assessment Quarterly	1
Measurement in Physical…	1
Psychological Methods	1
Society for Research on…	1
More ▼

Error of Measurement	26
Reliability	26
Scores	13
Foreign Countries	10
Validity	9
College Students	8
Undergraduate Students	6
Psychometrics	5
College Entrance Examinations	4
Comparative Analysis	4
Computation	4
Measures (Individuals)	4
Statistical Analysis	4
Computer Assisted Testing	3
Correlation	3
Data Collection	3
English (Second Language)	3
Evaluators	3
Gender Differences	3
Interrater Reliability	3
Measurement	3
Measurement Techniques	3
Rating Scales	3
Scoring	3
Second Language Learning	3
More ▼

Cook, Thomas D.	2
Haberman, Shelby J.	2
Shadish, William R.	2
Steiner, Peter M.	2
Aryadoust, Vahid	1
Azizian, Saeid	1
Bakker, J.	1
Becker, Gilbert	1
Beek, F. J. A.	1
Chiu, Loren Z. F.	1
Cristian Zanon	1
Daehlin, Torstein E.	1
Dalia Nasvytiene	1
David L. Vogel	1
Davidow, Jason H.	1
Dollman, James	1
Edge, Robin L.	1
Ertugrul Sahin	1
Esarey, Justin	1
Fatima Rashed Al-Darmaki	1
Ferrao, Maria	1
Georg Schomerus	1
Haaring, C.	1
Han, Chao	1
Harris, Deborah J.	1
More ▼