Publication Date
In 2025 | 6 |
Since 2024 | 20 |
Since 2021 (last 5 years) | 115 |
Since 2016 (last 10 years) | 361 |
Since 2006 (last 20 years) | 758 |
Descriptor
Difficulty Level | 842 |
Scores | 842 |
Academic Achievement | 318 |
Higher Education | 295 |
Benchmarking | 264 |
Core Curriculum | 260 |
College Entrance Examinations | 244 |
Racial Differences | 220 |
High School Graduates | 215 |
Academic Aspiration | 213 |
Educational Trends | 213 |
More ▼ |
Source
Author
McNamara, Danielle S. | 5 |
Sheehan, Kathleen M. | 5 |
Guo, Hongwen | 4 |
Rock, Donald A. | 4 |
Bridgeman, Brent | 3 |
Bulut, Okan | 3 |
Johnson, Amy M. | 3 |
Kim, Sooyeon | 3 |
Likens, Aaron D. | 3 |
Liu, Ou Lydia | 3 |
McCarthy, Kathryn S. | 3 |
More ▼ |
Publication Type
Education Level
Audience
Policymakers | 117 |
Practitioners | 114 |
Researchers | 6 |
Parents | 1 |
Teachers | 1 |
Location
Turkey | 20 |
China | 13 |
United States | 13 |
Florida | 11 |
Iran | 10 |
South Korea | 10 |
California | 9 |
Canada | 9 |
Colorado | 9 |
Ohio | 9 |
Australia | 8 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 4 |
Education Consolidation… | 1 |
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 2 |
Onur Dönmez; Yavuz Akbulut; Gözde Zabzun; Berrin Köseoglu – Applied Cognitive Psychology, 2025
This study investigates the effect of survey order in measuring self-reported cognitive load. Understanding how survey order influences responses is crucial, but it has been largely overlooked in the context of cognitive load. Using a 2 × 2 experimental design with 319 high school students, the study manipulated intrinsic cognitive load (ICL)…
Descriptors: Surveys, Test Construction, Measurement, Cognitive Processes
Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2023
Traditional estimators of reliability such as coefficients alpha, theta, omega, and rho (maximal reliability) are prone to give radical underestimates of reliability for the tests common when testing educational achievement. These tests are often structured by widely deviating item difficulties. This is a typical pattern where the traditional…
Descriptors: Test Reliability, Achievement Tests, Computation, Test Items
Gyamfi, Abraham; Acquaye, Rosemary – Acta Educationis Generalis, 2023
Introduction: Item response theory (IRT) has received much attention in validation of assessment instrument because it allows the estimation of students' ability from any set of the items. Item response theory allows the difficulty and discrimination levels of each item on the test to be estimated. In the framework of IRT, item characteristics are…
Descriptors: Item Response Theory, Models, Test Items, Difficulty Level
Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024
This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…
Descriptors: Korean, Test Validity, Test Reliability, Imitation
Chen, Xuqian; Wei, Ziqian; Li, Ziteng; Clariana, Roy B. – Educational Technology Research and Development, 2023
How does conceptual structure of external representations contribute to learning? This investigation considered the influence of generative concept sorting and of external structure information moderated by perceived difficulty. In Study 1, undergraduate students completed a perceived difficulty survey and comprehension pretest, then a sorting…
Descriptors: Undergraduate Students, Comprehension, Concept Formation, Difficulty Level
Kaitlyn Tracy; Ourania Spantidi – IEEE Transactions on Learning Technologies, 2025
Virtual reality (VR) has emerged as a transformative educational tool, enabling immersive learning environments that promote student engagement and understanding of complex concepts. However, despite the growing adoption of VR in education, there remains a significant gap in research exploring how generative artificial intelligence (AI), such as…
Descriptors: Artificial Intelligence, Computer Assisted Instruction, Computer Simulation, Educational Technology
Frank Goldhammer; Ulf Kroehne; Carolin Hahnel; Johannes Naumann; Paul De Boeck – Journal of Educational Measurement, 2024
The efficiency of cognitive component skills is typically assessed with speeded performance tests. Interpreting only effective ability or effective speed as efficiency may be challenging because of the within-person dependency between both variables (speed-ability tradeoff, SAT). The present study measures efficiency as effective ability…
Descriptors: Timed Tests, Efficiency, Scores, Test Interpretation
Apichat Khamboonruang – Language Testing in Asia, 2025
Chulalongkorn University Language Institute (CULI) test was developed as a local standardised test of English for professional and international communication. To ensure that the CULI test fulfils its intended purposes, this study employed Kane's argument-based validation and Rasch measurement approaches to construct the validity argument for the…
Descriptors: Universities, Second Language Learning, Second Language Instruction, Language Tests
Chakrabartty, Satyendra Nath – International Journal of Psychology and Educational Studies, 2021
The paper proposes new measures of difficulty and discriminating values of binary items and test consisting of such items and find their relationships including estimation of test error variance and thereby the test reliability, as per definition using cosine similarities. The measures use entire data. Difficulty value of test and item is defined…
Descriptors: Test Items, Difficulty Level, Scores, Test Reliability
Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model
Custer, Michael; Kim, Jongpil – Online Submission, 2023
This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…
Descriptors: Sample Size, Item Response Theory, Test Items, Computation
Andrés Christiansen; Rianne Janssen – Educational Assessment, Evaluation and Accountability, 2024
In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect…
Descriptors: Test Items, Item Response Theory, Grade 4, International Assessment
Çinar, Murat; Dogan, Dilek; Tüzün, Hakan – International Journal of Technology and Design Education, 2022
This study aims to investigate the effects of different design tasks on the cognitive load level of instructional designers during the process of designing a learning activity in a 3D multi-user virtual environment (MUVE). The sample consisted of 16 undergraduate students who were experienced in the areas of instructional design, computer…
Descriptors: Instructional Design, Cognitive Processes, Difficulty Level, Learning Activities
Ari, Omer; Calandra, Brendan – College Teaching, 2022
College students enrolled in a reading support course were asked to (a) read a short text, (b) listen to a second text, and (c) read + listen to a third text and answer multiple-choice comprehension questions about each text. Each condition employed a self-study format allowing for constant availability of text input and extra time to revisit text…
Descriptors: College Students, Reading Comprehension, Cognitive Processes, Difficulty Level
Xavier Ochoa; Xiaomeng Huang; Yuli Shao – Journal of Learning Analytics, 2025
Generative AI (GenAI) has the potential to revolutionize the analysis of educational data, significantly impacting learning analytics (LA). This study explores the capability of non-experts, including administrators, instructors, and students, to effectively use GenAI for descriptive LA tasks without requiring specialized knowledge in data…
Descriptors: Learning Analytics, Artificial Intelligence, Computer Software, Scores
Thompson, Kathryn N. – ProQuest LLC, 2023
It is imperative to collect validity evidence prior to interpreting and using test scores. During the process of collecting validity evidence, test developers should consider whether test scores are contaminated by sources of extraneous information. This is referred to as construct irrelevant variance, or the "degree to which test scores are…
Descriptors: Test Wiseness, Test Items, Item Response Theory, Scores