NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 31 to 45 of 3,974 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Tri Sedya Febrianti; Siti Fatimah; Yuni Fitriyah; Hanifah Nurhayati – International Journal of Education in Mathematics, Science and Technology, 2024
Assessing students' understanding of circle-related material through subjective tests is effective, though grading these tests can be challenging and often requires technological support. ChatGPT has shown promise in providing reliable and objective evaluations. Many teachers in Indonesia, however, continue to face difficulties integrating…
Descriptors: Artificial Intelligence, Computer Assisted Testing, Scoring, Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Lyrica Lucas; Anum Khushal; Robert Mayes; Brian A. Couch; Joseph Dauer – International Journal of Science Education, 2025
Educational reform priorities such as emphasis on quantitative modelling (QM) have positioned undergraduate biology instructors as designers of QM experiences to engage students in authentic science practices that support the development of data-driven and evidence-based reasoning. Yet, little is known about how biology instructors adapt to the…
Descriptors: Undergraduate Students, College Science, Biology, Classroom Observation Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022
While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…
Descriptors: Scoring, Testing, Test Items, Test Format
Peer reviewed Peer reviewed
Direct linkDirect link
Goldhammer, Frank; Hahnel, Carolin; Kroehne, Ulf; Zehner, Fabian – Large-scale Assessments in Education, 2021
International large-scale assessments such as PISA or PIAAC have started to provide public or scientific use files for log data; that is, events, event-related attributes and timestamps of test-takers' interactions with the assessment system. Log data and the process indicators derived from it can be used for many purposes. However, the intended…
Descriptors: International Assessment, Data, Computer Assisted Testing, Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Edward Karl Schultz; Emily Smith; Stephanie Zamora-Robles – Journal of the American Academy of Special Education Professionals, 2024
Evaluating students from culturally and linguistically diverse backgrounds (i.e., emergent bilinguals) presents challenges to evaluation teams, as distinguishing between a language disorder and typical second language development is more complex. The skills and knowledge required to do this task often exceed the level of training that evaluators…
Descriptors: Emergent Literacy, Bilingualism, Bilingual Students, Learning Disabilities
Peer reviewed Peer reviewed
Direct linkDirect link
Colvin, Kimberly F.; Gorgun, Guher; Zhang, Sijun – Journal of Psychoeducational Assessment, 2020
The Rosenberg Self-Esteem Scale was administered with a 1-4, 1-5, or 0-100 scale to 819 participants, to compare score interpretations across the different versions. A rating scale utility analysis revealed that the categories in the 101-point scale were used inconsistently; based on the analysis, adjacent categories were collapsed resulting in a…
Descriptors: Self Concept Measures, Self Esteem, Test Interpretation, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Edward J. Golob; Ricardo C. Olayo; Denver M. Y. Brown; Jeffrey R. Mock – Journal of Speech, Language, and Hearing Research, 2024
Purpose: Listening effort is a broad construct, and there is no consensus on how to subdivide listening effort into dimensions. This project focuses on the subjective experience of effortful listening and tests if cognitive workload, mental fatigue, and mood are interrelated dimensions. Method: Two online studies tested young adults (n = 74 and n…
Descriptors: Adults, Psychomotor Skills, Psychomotor Objectives, Listening Comprehension Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Viola Merhof; Caroline M. Böhm; Thorsten Meiser – Educational and Psychological Measurement, 2024
Item response tree (IRTree) models are a flexible framework to control self-reported trait measurements for response styles. To this end, IRTree models decompose the responses to rating items into sub-decisions, which are assumed to be made on the basis of either the trait being measured or a response style, whereby the effects of such person…
Descriptors: Item Response Theory, Test Interpretation, Test Reliability, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Micir, Ian; Swygert, Kimberly; D'Angelo, Jean – Journal of Applied Testing Technology, 2022
The interpretations of test scores in secure, high-stakes environments are dependent on several assumptions, one of which is that examinee responses to items are independent and no enemy items are included on the same forms. This paper documents the development and implementation of a C#-based application that uses Natural Language Processing…
Descriptors: Artificial Intelligence, Man Machine Systems, Accuracy, Efficiency
Peer reviewed Peer reviewed
Direct linkDirect link
Leventhal, Brian C.; Gregg, Nikole; Ames, Allison J. – Measurement: Interdisciplinary Research and Perspectives, 2022
Response styles introduce construct-irrelevant variance as a result of respondents systematically responding to Likert-type items regardless of content. Methods to account for response styles through data analysis as well as approaches to mitigating the effects of response styles during data collection have been well-documented. Recent approaches…
Descriptors: Response Style (Tests), Item Response Theory, Test Items, Likert Scales
Peer reviewed Peer reviewed
Direct linkDirect link
Kranzler, John H.; Maki, Kathrin E.; Benson, Nicholas F.; Eckert, Tanya L.; Floyd, Randy G.; Fefer, Sarah A. – Contemporary School Psychology, 2020
Although intelligence tests are among the most widely used psychological instruments in school psychology, at the current time, little is known about how practitioners interpret them. The primary purpose of this study, therefore, was to determine how intelligence tests are interpreted by school psychologists, particularly for the identification of…
Descriptors: School Counselors, Test Interpretation, Intelligence Tests, Disability Identification
Peer reviewed Peer reviewed
Direct linkDirect link
Shadi Noroozi; Hossein Karami – Language Testing in Asia, 2024
Recently, psychometricians and researchers have voiced their concern over the exploration of language test items in light of Messick's validation framework. Validity has been central to test development and use; however, it has not received due attention in language tests having grave consequences for test takers. The present study sought to…
Descriptors: Foreign Countries, Doctoral Students, Graduate Students, Language Proficiency
Sascha Skucek – ProQuest LLC, 2022
When you look at an image, what do you see? What does the image say to you? What do you think about? What meaning do you infer? These questions may blur together, but they can be expanded individually and uniquely into a multitude of responses. Your initial thoughts are yours. You are silently debating meaning within yourself. If I interject a new…
Descriptors: Rhetoric, Listening, Freehand Drawing, Notetaking
Peer reviewed Peer reviewed
Direct linkDirect link
Stephen M. Leach; Jason C. Immekus; Jeffrey C. Valentine; Prathiba Batley; Dena Dossett; Tamara Lewis; Thomas Reece – Assessment for Effective Intervention, 2025
Educators commonly use school climate survey scores to inform and evaluate interventions for equitably improving learning and reducing educational disparities. Unfortunately, validity evidence to support these (and other) score uses often falls short. In response, Whitehouse et al. proposed a collaborative, two-part validity testing framework for…
Descriptors: School Surveys, Measurement, Hierarchical Linear Modeling, Educational Environment
Peer reviewed Peer reviewed
Direct linkDirect link
Kho, Shermaine Qi En; Aryadoust, Vahid; Foo, Stacy – Education and Information Technologies, 2023
Studies have shown that test-takers tend to use keyword-matching strategies when taking listening tests. Keyword-matching involves matching content words in the written modality (test items) against those heard in the audio text. However, no research has investigated the effect of such keywords in listening tests, or the impact of gazing upon…
Descriptors: Eye Movements, Test Wiseness, Information Retrieval, Listening Comprehension Tests
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  265