NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 26 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Jonas Flodén – British Educational Research Journal, 2025
This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…
Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring
Peer reviewed Peer reviewed
Direct linkDirect link
Zachary del Rosario – Journal of Statistics and Data Science Education, 2024
Variability is underemphasized in domains such as engineering. Statistics and data science education research offers a variety of frameworks for understanding variability, but new frameworks for domain applications are necessary. This study investigated the professional practices of working engineers to develop such a framework. The Neglected,…
Descriptors: Foreign Countries, Engineering Education, Engineering, Technical Occupations
Peer reviewed Peer reviewed
Direct linkDirect link
Cristian Zanon; Nan Zhao; Nursel Topkaya; Ertugrul Sahin; David L. Vogel; Melissa M. Ertl; Samineh Sanatkar; Hsin-Ya Liao; Mark Rubin; Makilim N. Baptista; Winnie W. S. Mak; Fatima Rashed Al-Darmaki; Georg Schomerus; Ying-Fen Wang; Dalia Nasvytiene – International Journal of Testing, 2025
Examinations of the internal structure of the Depression, Anxiety, and Stress Scale-21 (DASS-21) have yielded inconsistent conclusions within and across cultural contexts. This study examined the dimensionality and reliability of the DASS-21 across three theoretically plausible factor structures (i.e., unidimensional, oblique three-factor, and…
Descriptors: Anxiety, Depression (Psychology), Psychometrics, Cultural Context
Peer reviewed Peer reviewed
Direct linkDirect link
Yanchao Yang; Wangze Li; Sijia Xue; Wenxue Huang; Shijie Guo – European Journal of Education, 2025
In response to the prevalence of perceived internship Pick-up Artist(PUA) behaviours and the lack of appropriate measurement tools, the purpose of this study was to develop and validate a new self-designed questionnaire, the Perceived Internship PUA Scale (PIPUAS), to assess college student interns' perceptions of internship PUA behaviours. The…
Descriptors: Measurement Techniques, Incidence, Internship Programs, Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Chiu, Loren Z. F.; Daehlin, Torstein E. – Measurement in Physical Education and Exercise Science, 2020
Males (n = 29) and females (n = 34) performed vertical jumps. Jump height was estimated from force platform data using five numerical methods and compared using intraclass correlation ([rho]), and linear and rank regression standard error of estimate ("SEE"). Take-off velocity plus center of mass height at take-off and mechanical work…
Descriptors: Physical Activities, Scientific Concepts, Computation, Motion
Peer reviewed Peer reviewed
Direct linkDirect link
Davidow, Jason H.; Ye, Jun; Edge, Robin L. – International Journal of Language & Communication Disorders, 2023
Background: Speech-language pathologists often multitask in order to be efficient with their commonly large caseloads. In stuttering assessment, multitasking often involves collecting multiple measures simultaneously. Aims: The present study sought to determine reliability when collecting multiple measures simultaneously versus individually.…
Descriptors: Graduate Students, Measurement, Reliability, Group Activities
Peer reviewed Peer reviewed
Direct linkDirect link
Esarey, Justin; Valdes, Natalie – Assessment & Evaluation in Higher Education, 2020
Scholarly debate about student evaluations of teaching (SETs) often focuses on whether SETs are valid, reliable and unbiased. In this article, we assume the most optimistic conditions for SETs that are supported by the empirical literature. Specifically, we assume that SETs are moderately correlated with teaching quality (student learning and…
Descriptors: Student Evaluation of Teacher Performance, Bias, Reliability, Validity
Kate E. Walton – ACT, Inc., 2024
There is a tradeoff between scale length and psychometric concerns. The two are, in fact, directly linked. Generally, when scales are shortened, reliability is reduced, and when scales are lengthened, reliability is improved, provided the items added to the scale are comparable psychometrically (AERA et al., 2014). Scale reliability, in turn,…
Descriptors: Psychometrics, Error of Measurement, Rating Scales, Reliability
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sata, Mehmet; Karakaya, Ismail – International Journal of Assessment Tools in Education, 2022
In the process of measuring and assessing high-level cognitive skills, interference of rater errors in measurements brings about a constant concern and low objectivity. The main purpose of this study was to investigate the impact of rater training on rater errors in the process of assessing individual performance. The study was conducted with a…
Descriptors: Evaluators, Training, Comparative Analysis, Academic Language
Peer reviewed Peer reviewed
Direct linkDirect link
Lowe, Patricia A. – Journal of Psychoeducational Assessment, 2019
Existing measures of test anxiety used with the college student population are old with old norms and old items, and they do not capture the multiple dimensions of the test anxiety construct or assess facilitating anxiety. In the present study, the validity of the scores of a new, multidimensional measure of test anxiety with a facilitating…
Descriptors: Cross Cultural Studies, Gender Differences, Test Anxiety, Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
Noei, Nima; Imani, Iman Mohammadi; Wilson, Lee D.; Azizian, Saeid – Journal of Chemical Education, 2019
A low-cost and simple setup to measure the densities of liquids is introduced herein. The results and reliability of this setup were evaluated for pure liquids, water-ethanol binary mixtures, and aqueous NaCl solutions. The constructed densitometer provided density values with acceptable relative errors (less than ±3.0%), which were compared to…
Descriptors: Chemistry, Science Education, Science Instruction, Laboratory Experiments
Peer reviewed Peer reviewed
Direct linkDirect link
Padilla-Walker, Laura Maria; Jensen, Lene Arnett – International Journal of Behavioral Development, 2016
Moral psychology has been moving toward consideration of multiple kinds of moral concepts and values, such as the Ethics of Autonomy, Community, and Divinity. While these three ethics have commonly been measured qualitatively, the current study sought to validate the long and short forms of the Ethical Values Assessment (EVA), which is a…
Descriptors: Ethics, Questionnaires, Error of Measurement, Moral Values
Peer reviewed Peer reviewed
Direct linkDirect link
Ravesloot, C. J.; Van der Schaaf, M. F.; Muijtjens, A. M. M.; Haaring, C.; Kruitwagen, C. L. J. J.; Beek, F. J. A.; Bakker, J.; Van Schaik, J.P.J.; Ten Cate, Th. J. – Advances in Health Sciences Education, 2015
Formula scoring (FS) is the use of a don't know option (DKO) with subtraction of points for wrong answers. Its effect on construct validity and reliability of progress test scores, is subject of discussion. Choosing a DKO may not only be affected by knowledge level, but also by risk taking tendency, and may thus introduce construct-irrelevant…
Descriptors: Scoring Formulas, Tests, Scores, Construct Validity
Powers, Sonya; Li, Dongmei; Suh, Hongwook; Harris, Deborah J. – ACT, Inc., 2016
ACT reporting categories and ACT Readiness Ranges are new features added to the ACT score reports starting in fall 2016. For each reporting category, the number correct score, the maximum points possible, the percent correct, and the ACT Readiness Range, along with an indicator of whether the reporting category score falls within the Readiness…
Descriptors: Scores, Classification, College Entrance Examinations, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Han, Chao – Language Assessment Quarterly, 2016
As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…
Descriptors: Foreign Countries, Scores, English, Chinese
Previous Page | Next Page »
Pages: 1  |  2