Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 5 |
| Since 2017 (last 10 years) | 18 |
| Since 2007 (last 20 years) | 31 |
Descriptor
| Accuracy | 32 |
| Difficulty Level | 32 |
| Models | 32 |
| Item Response Theory | 12 |
| Test Items | 9 |
| Comparative Analysis | 7 |
| Foreign Countries | 7 |
| Undergraduate Students | 7 |
| Prediction | 6 |
| Second Language Learning | 6 |
| Task Analysis | 6 |
| More ▼ | |
Source
Author
| Abdi Tabari, Mahmoud | 1 |
| Aghekyan, Rosa | 1 |
| Aiman Mohammad Freihat | 1 |
| Ashford-Rowe, Kevin | 1 |
| Azzam, Tarek | 1 |
| Barnes, Tiffany | 1 |
| Bradshaw, Laine P. | 1 |
| Brown, Christine | 1 |
| Chen, Binglin | 1 |
| Chi, Min | 1 |
| Christina Glasauer | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 30 |
| Journal Articles | 27 |
| Speeches/Meeting Papers | 3 |
| Dissertations/Theses -… | 1 |
| Reports - Descriptive | 1 |
| Tests/Questionnaires | 1 |
Education Level
| Higher Education | 13 |
| Postsecondary Education | 10 |
| Elementary Education | 1 |
| Grade 5 | 1 |
| Grade 6 | 1 |
| Grade 9 | 1 |
| Intermediate Grades | 1 |
| Middle Schools | 1 |
| Secondary Education | 1 |
Audience
| Teachers | 2 |
Laws, Policies, & Programs
Assessments and Surveys
| Graduate Record Examinations | 1 |
| SAT (College Admission Test) | 1 |
| Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Aiman Mohammad Freihat; Omar Saleh Bani Yassin – Educational Process: International Journal, 2025
Background/purpose: This study aimed to reveal the accuracy of estimation of multiple-choice test items parameters following the models of the item-response theory in measurement. Materials/methods: The researchers depended on the measurement accuracy indicators, which express the absolute difference between the estimated and actual values of the…
Descriptors: Accuracy, Computation, Multiple Choice Tests, Test Items
Christina Glasauer; Martin K. Yeh; Lois Anne DeLong; Yu Yan; Yanyan Zhuang – Computer Science Education, 2025
Background and Context: Feedback on one's progress is essential to new programming language learners, particularly in out-of-classroom settings. Though many study materials offer assessment mechanisms, most do not examine the accuracy of the feedback they deliver, nor give evidence on its validity. Objective: We investigate the potential use of a…
Descriptors: Novices, Computer Science Education, Programming, Accuracy
Wang, Jue; Engelhard, George; Combs, Trenton – Journal of Experimental Education, 2023
Unfolding models are frequently used to develop scales for measuring attitudes. Recently, unfolding models have been applied to examine rater severity and accuracy within the context of rater-mediated assessments. One of the problems in applying unfolding models to rater-mediated assessments is that the substantive interpretations of the latent…
Descriptors: Writing Evaluation, Scoring, Accuracy, Computational Linguistics
Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model
Custer, Michael; Kim, Jongpil – Online Submission, 2023
This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…
Descriptors: Sample Size, Item Response Theory, Test Items, Computation
Jones, Natalie D.; Azzam, Tarek; Wanzer, Dana Linnell; Skousen, Darrel; Knight, Ciara; Sabarre, Nina – American Journal of Evaluation, 2020
One of the most widely used communication tools in evaluation is the logic model. Despite its extensive use, there has been little research into the visualization aspect of the logic model. To assess the impact that design modifications would have on its effectiveness, we applied established visualization principles to revise a program model.…
Descriptors: Logical Thinking, Models, Visualization, Accuracy
Finch, Holmes; French, Brian F. – Applied Measurement in Education, 2019
The usefulness of item response theory (IRT) models depends, in large part, on the accuracy of item and person parameter estimates. For the standard 3 parameter logistic model, for example, these parameters include the item parameters of difficulty, discrimination, and pseudo-chance, as well as the person ability parameter. Several factors impact…
Descriptors: Item Response Theory, Accuracy, Test Items, Difficulty Level
Morsy, Sara; Karypis, George – Journal of Educational Data Mining, 2019
In order to help undergraduate students towards successfully completing their degrees, developing tools that can assist students during the course selection process is a significant task in the education domain. The optimal set of courses for each student should include courses that help him/her graduate in a timely fashion and for which he/she is…
Descriptors: Undergraduate Students, Grade Point Average, Course Selection (Students), Prediction
Kitikanan, Patchanok – English Language Teaching, 2022
This article reports on the second language (L2) perception of contrasts among British English monophthongs. This study has two aims: 1) to explore the discriminability of contrasts in L2 British English monophthongs; and 2) to test the perceptual assimilation model-L2 (PAM-L2) towards the ability to discriminate British English contrasts. The…
Descriptors: Language Variation, Vowels, Second Language Learning, English (Second Language)
Rybinski, Krzysztof; Kopciuszewska, Elzbieta – Assessment & Evaluation in Higher Education, 2021
This article presents the first-ever big data study of the student evaluation of teaching (SET) using artificial intelligence (AI). We train natural language processing (NLP) models on 1.6 million student evaluations from the US and the UK. We address two research questions: (1) are these models able to predict student ratings from the student…
Descriptors: Artificial Intelligence, Technology Uses in Education, Student Evaluation of Teacher Performance, Natural Language Processing
Bradshaw, Laine P.; Madison, Matthew J. – International Journal of Testing, 2016
In item response theory (IRT), the invariance property states that item parameter estimates are independent of the examinee sample, and examinee ability estimates are independent of the test items. While this property has long been established and understood by the measurement community for IRT models, the same cannot be said for diagnostic…
Descriptors: Classification, Models, Simulation, Psychometrics
Nurnberger-Haag, Julie – Research in Mathematics Education, 2018
Practicing teachers as well as researchers, mathematicians, and teacher educators have offered opinions and theoretical critiques of the multiple models used to teach integer arithmetic. Few studies, however, have investigated what students learn with models or empirically compared affordances and constraints of integer models. This led me to…
Descriptors: Subtraction, Mathematics Instruction, Teaching Methods, Criticism
Chen, Binglin; West, Matthew; Ziles, Craig – International Educational Data Mining Society, 2018
This paper attempts to quantify the accuracy limit of "nextitem-correct" prediction by using numerical optimization to estimate the student's probability of getting each question correct given a complete sequence of item responses. This optimization is performed without an explicit parameterized model of student behavior, but with the…
Descriptors: Accuracy, Probability, Student Behavior, Test Items
Malicka, Aleksandra – Language Teaching Research, 2020
This study set out to test the theoretical premise of the SSARC model of pedagogic task sequencing, which postulates that tasks should be sequenced for learners from cognitively simple to complex. This experiment compared the performance of three tasks differing in cognitive complexity in a simple-complex sequence versus in the absence of any…
Descriptors: Accuracy, Language Fluency, Teaching Methods, Second Language Learning
Aghekyan, Rosa – Journal of College Science Teaching, 2018
Inclusion of model building in the learning process promotes a deep understanding of scientific content and concepts. Likewise, higher level questioning requires high cognitive demand and critical reasoning and leads to positive educational results. The purpose of this study was to determine whether the conjunction of higher level thinking…
Descriptors: Thinking Skills, Critical Thinking, Logical Thinking, Scientific Concepts
Martin-Fernandez, Manuel; Revuelta, Javier – Psicologica: International Journal of Methodology and Experimental Psychology, 2017
This study compares the performance of two estimation algorithms of new usage, the Metropolis-Hastings Robins-Monro (MHRM) and the Hamiltonian MCMC (HMC), with two consolidated algorithms in the psychometric literature, the marginal likelihood via EM algorithm (MML-EM) and the Markov chain Monte Carlo (MCMC), in the estimation of multidimensional…
Descriptors: Bayesian Statistics, Item Response Theory, Models, Comparative Analysis

Peer reviewed
Direct link
