Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 11 |
Since 2006 (last 20 years) | 37 |
Descriptor
Item Response Theory | 39 |
Probability | 39 |
Statistical Analysis | 39 |
Models | 12 |
Scores | 11 |
Computation | 10 |
Foreign Countries | 9 |
Monte Carlo Methods | 6 |
Test Items | 6 |
Bayesian Statistics | 5 |
College Students | 5 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 31 |
Reports - Research | 27 |
Reports - Evaluative | 10 |
Speeches/Meeting Papers | 3 |
Guides - Non-Classroom | 2 |
Numerical/Quantitative Data | 2 |
Reports - Descriptive | 2 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 8 |
Elementary Education | 4 |
Postsecondary Education | 4 |
Kindergarten | 3 |
Early Childhood Education | 2 |
Junior High Schools | 2 |
High Schools | 1 |
Middle Schools | 1 |
Primary Education | 1 |
Secondary Education | 1 |
Audience
Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Raykov, Tenko; Marcoulides, George A.; Pusic, Martin – Measurement: Interdisciplinary Research and Perspectives, 2021
An interval estimation procedure is discussed that can be used to evaluate the probability of a particular response for a binary or binary scored item at a pre-specified point along an underlying latent continuum. The item is assumed to: (a) be part of a unidimensional multi-component measuring instrument that may contain also polytomous items,…
Descriptors: Item Response Theory, Computation, Probability, Test Items
Man, Kaiwen; Harring, Jeffery R.; Ouyang, Yunbo; Thomas, Sarah L. – International Journal of Testing, 2018
Many important high-stakes decisions--college admission, academic performance evaluation, and even job promotion--depend on accurate and reliable scores from valid large-scale assessments. However, examinees sometimes cheat by copying answers from other test-takers or practicing with test items ahead of time, which can undermine the effectiveness…
Descriptors: Reaction Time, High Stakes Tests, Test Wiseness, Cheating
Haberman, Shelby J.; Lee, Yi-Hsuan – ETS Research Report Series, 2017
In investigations of unusual testing behavior, a common question is whether a specific pattern of responses occurs unusually often within a group of examinees. In many current tests, modern communication techniques can permit quite large numbers of examinees to share keys, or common response patterns, to the entire test. To address this issue,…
Descriptors: Student Evaluation, Testing, Item Response Theory, Maximum Likelihood Statistics
Luo, Yong; Jiao, Hong – Educational and Psychological Measurement, 2018
Stan is a new Bayesian statistical software program that implements the powerful and efficient Hamiltonian Monte Carlo (HMC) algorithm. To date there is not a source that systematically provides Stan code for various item response theory (IRT) models. This article provides Stan code for three representative IRT models, including the…
Descriptors: Bayesian Statistics, Item Response Theory, Probability, Computer Software
Herrmann-Abell, Cari F.; DeBoer, George E. – Grantee Submission, 2016
Understanding students' misconceptions and how they change is an essential part of supporting students in their science learning. This paper presents results from distractor-driven multiple-choice assessments that target students' misconceptions about energy. Over 20,000 elementary, middle and high school students from across the U.S. participated…
Descriptors: Item Response Theory, Probability, Elementary School Students, Middle School Students
Maeda, Hotaka; Zhang, Bo – International Journal of Testing, 2017
The omega (?) statistic is reputed to be one of the best indices for detecting answer copying on multiple choice tests, but its performance relies on the accurate estimation of copier ability, which is challenging because responses from the copiers may have been contaminated. We propose an algorithm that aims to identify and delete the suspected…
Descriptors: Cheating, Test Items, Mathematics, Statistics
Fiedler, Daniela; Tröbst, Steffen; Harms, Ute – CBE - Life Sciences Education, 2017
Students of all ages face severe conceptual difficulties regarding key aspects of evolution-- the central, unifying, and overarching theme in biology. Aspects strongly related to abstract "threshold" concepts like randomness and probability appear to pose particular difficulties. A further problem is the lack of an appropriate instrument…
Descriptors: College Students, Concept Formation, Probability, Evolution
Oliveri, Maria Elena; Lawless, Rene; Robin, Frederic; Bridgeman, Brent – Applied Measurement in Education, 2018
We analyzed a pool of items from an admissions test for differential item functioning (DIF) for groups based on age, socioeconomic status, citizenship, or English language status using Mantel-Haenszel and item response theory. DIF items were systematically examined to identify its possible sources by item type, content, and wording. DIF was…
Descriptors: Test Bias, Comparative Analysis, Item Banks, Item Response Theory
Sinharay, Sandip; Wan, Ping; Whitaker, Mike; Kim, Dong-In; Zhang, Litong; Choi, Seung W. – Journal of Educational Measurement, 2014
With an increase in the number of online tests, interruptions during testing due to unexpected technical issues seem unavoidable. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. There is a lack of research on this…
Descriptors: Computer Assisted Testing, Testing Problems, Scores, Regression (Statistics)
Hardcastle, Joseph; Herrmann-Abell, Cari F.; DeBoer, George E. – Grantee Submission, 2017
Can student performance on computer-based tests (CBT) and paper-and-pencil tests (PPT) be considered equivalent measures of student knowledge? States and school districts are grappling with this question, and although studies addressing this question are growing, additional research is needed. We report on the performance of students who took…
Descriptors: Academic Achievement, Computer Assisted Testing, Comparative Analysis, Student Evaluation
Jones, W. Paul – Educational and Psychological Measurement, 2014
A study in a university clinic/laboratory investigated adaptive Bayesian scaling as a supplement to interpretation of scores on the Mini-IPIP. A "probability of belonging" in categories of low, medium, or high on each of the Big Five traits was calculated after each item response and continued until all items had been used or until a…
Descriptors: Personality Traits, Personality Measures, Bayesian Statistics, Clinics
Ostrow, Korinn; Donnelly, Chistopher; Heffernan, Neil – International Educational Data Mining Society, 2015
As adaptive tutoring systems grow increasingly popular for the completion of classwork and homework, it is crucial to assess the manner in which students are scored within these platforms. The majority of systems, including ASSISTments, return the binary correctness of a student's first attempt at solving each problem. Yet for many teachers,…
Descriptors: Intelligent Tutoring Systems, Scoring, Testing, Credits
Timmons, Kristy; Pelletier, Janette – Early Child Development and Care, 2016
In this study, we explored the influence of kindergarten children's perspectives of school on their literacy and self-regulation outcomes. Children's early perspectives were captured in a three-question, finger-puppet interview. Responses to the interview questions were coded thematically as being academic and/or social in nature, and were…
Descriptors: Childhood Attitudes, Kindergarten, Longitudinal Studies, Puppetry
Mahmud, Zamalia; Porter, Anne – Indonesian Mathematical Society Journal on Mathematics Education, 2015
Students' understanding of probability concepts have been investigated from various different perspectives. This study was set out to investigate perceived understanding of probability concepts of forty-four students from the STAT131 Understanding Uncertainty and Variation course at the University of Wollongong, NSW. Rasch measurement which is…
Descriptors: Probability, Concept Teaching, Item Response Theory, Computer Assisted Testing
Dong, Nianbo – Society for Research on Educational Effectiveness, 2012
This paper is based on previous studies in applying propensity score methods to study multiple treatment variables to examine the causal moderator effect. The propensity score methods will be demonstrated in a case study to examine the causal moderator effect, where the moderators are categorical and continuous variables. Moderation analysis is an…
Descriptors: Probability, Statistical Analysis, Case Studies, Intervention