Publication Date
In 2025 | 2 |
Since 2024 | 6 |
Since 2021 (last 5 years) | 27 |
Since 2016 (last 10 years) | 66 |
Since 2006 (last 20 years) | 110 |
Descriptor
Difficulty Level | 181 |
Simulation | 181 |
Test Items | 90 |
Item Response Theory | 53 |
Comparative Analysis | 28 |
Cognitive Processes | 25 |
Computer Assisted Testing | 24 |
Models | 24 |
Statistical Analysis | 24 |
Item Analysis | 23 |
Sample Size | 23 |
More ▼ |
Source
Author
Schnipke, Deborah L. | 5 |
Jin, Kuan-Yu | 3 |
Sinharay, Sandip | 3 |
Wang, Wen-Chung | 3 |
Guo, Hongwen | 2 |
Holland, Paul | 2 |
Kamata, Akihito | 2 |
Li, Yuan H. | 2 |
Mazor, Kathleen M. | 2 |
Reckase, Mark D. | 2 |
Reese, Lynda M. | 2 |
More ▼ |
Publication Type
Education Level
Higher Education | 26 |
Postsecondary Education | 20 |
Secondary Education | 5 |
Adult Education | 2 |
Two Year Colleges | 2 |
Elementary Education | 1 |
Grade 9 | 1 |
Audience
Practitioners | 4 |
Researchers | 4 |
Teachers | 3 |
Location
China | 4 |
South Korea | 2 |
United Kingdom | 2 |
Asia | 1 |
Australia | 1 |
California (Berkeley) | 1 |
Colorado | 1 |
Germany | 1 |
Italy | 1 |
Japan | 1 |
Minnesota | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Xiangyi Liao; Daniel M. Bolt; Jee-Seon Kim – Journal of Educational Measurement, 2024
Item difficulty and dimensionality often correlate, implying that unidimensional IRT approximations to multidimensional data (i.e., reference composites) can take a curvilinear form in the multidimensional space. Although this issue has been previously discussed in the context of vertical scaling applications, we illustrate how such a phenomenon…
Descriptors: Difficulty Level, Simulation, Multidimensional Scaling, Graphs
Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025
Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…
Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items
Barbrook-Johnson, Pete; Carrick, Jayne – International Journal of Social Research Methodology, 2022
Research methods with roots in complexity science are increasingly popular in social research. However, they are not widespread and have potential to deliver value more fully and consistently to social research and methodology. One reason for this is that methods are often used alone, or only with traditional social research methods. We attempt to…
Descriptors: Social Science Research, Research Methodology, Difficulty Level, Simulation
Sweeney, Sandra M.; Sinharay, Sandip; Johnson, Matthew S.; Steinhauer, Eric W. – Educational Measurement: Issues and Practice, 2022
The focus of this paper is on the empirical relationship between item difficulty and item discrimination. Two studies--an empirical investigation and a simulation study--were conducted to examine the association between item difficulty and item discrimination under classical test theory and item response theory (IRT), and the effects of the…
Descriptors: Correlation, Item Response Theory, Item Analysis, Difficulty Level
Wind, Stefanie A.; Ge, Yuan – Measurement: Interdisciplinary Research and Perspectives, 2023
In selected-response assessments such as attitude surveys with Likert-type rating scales, examinees often select from rating scale categories to reflect their locations on a construct. Researchers have observed that some examinees exhibit "response styles," which are systematic patterns of responses in which examinees are more likely to…
Descriptors: Goodness of Fit, Responses, Likert Scales, Models
Deghedi, Ghada Ahmed – International Journal of Game-Based Learning, 2023
Game-based learning has proven to be an effective teaching method in a variety of fields, including supply chain management (SCM). This paper discusses the use of simulation-based business games for training and education in SCM by evaluating the complexity of SCM games. The main contributions of this study are twofold. (1) Creation of a database…
Descriptors: Game Based Learning, Supply and Demand, Business Administration Education, Difficulty Level
Barbara J. Kinney – ProQuest LLC, 2024
This qualitative descriptive study investigated the experiences and professional outcomes of novice registered nurses participating in professional development nursing simulations integrating worked examples, a cognitive load instructional design principle. Drawing upon cognitive load theory and the worked examples principle, the study addressed…
Descriptors: Nurses, Nursing Education, Simulation, Professional Development
Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024
The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…
Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests
Yang, Eunbyul; Park, Sanghoon; Ryu, Jeeheon; Lim, Taehyeong – Journal of Computing in Higher Education, 2023
The purposes of this study were (1) to introduce a virtual dental lab designed to support students' virtual clinical examinations in a dentistry program in South Korea and (2) to determine how dental students' levels of expertise (low, medium, or high) influence their clinical performance in terms of dwell time on each tooth location, total…
Descriptors: Foreign Countries, Dentistry, Dental Schools, Laboratories
Lion Sieg; Hendrik Eismann; Bertrand Schneider; Jan Karsten; Dogus Darici – Journal of Computer Assisted Learning, 2025
Background: Research in social cognition suggests that learning effectiveness in teacher-learner pairs may be influenced by how well their attention aligns with each other. However, we currently have limited understanding of how common distractions in real-world environments affect teacher-student interactions, specifically the synchronisation of…
Descriptors: Attention Control, Social Cognition, Teacher Student Relationship, Interaction
Wyse, Adam E.; McBride, James R. – Journal of Educational Measurement, 2021
A key consideration when giving any computerized adaptive test (CAT) is how much adaptation is present when the test is used in practice. This study introduces a new framework to measure the amount of adaptation of Rasch-based CATs based on looking at the differences between the selected item locations (Rasch item difficulty parameters) of the…
Descriptors: Item Response Theory, Computer Assisted Testing, Adaptive Testing, Test Items
DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023
A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…
Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness
Derek Sauder – ProQuest LLC, 2020
The Rasch model is commonly used to calibrate multiple choice items. However, the sample sizes needed to estimate the Rasch model can be difficult to attain (e.g., consider a small testing company trying to pretest new items). With small sample sizes, auxiliary information besides the item responses may improve estimation of the item parameters.…
Descriptors: Item Response Theory, Sample Size, Computation, Test Length
Boehm, Udo; Matzke, Dora; Gretton, Matthew; Castro, Spencer; Cooper, Joel; Skinner, Michael; Strayer, David; Heathcote, Andrew – Cognitive Research: Principles and Implications, 2021
Human operators often experience large fluctuations in cognitive workload over seconds timescales that can lead to sub-optimal performance, ranging from overload to neglect. Adaptive automation could potentially address this issue, but to do so it needs to be aware of real-time changes in operators' spare cognitive capacity, so it can provide help…
Descriptors: Prediction, Cognitive Processes, Difficulty Level, Automation
Kárász, Judit T.; Széll, Krisztián; Takács, Szabolcs – Quality Assurance in Education: An International Perspective, 2023
Purpose: Based on the general formula, which depends on the length and difficulty of the test, the number of respondents and the number of ability levels, this study aims to provide a closed formula for the adaptive tests with medium difficulty (probability of solution is p = 1/2) to determine the accuracy of the parameters for each item and in…
Descriptors: Test Length, Probability, Comparative Analysis, Difficulty Level