ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	8

Descriptor

Bayesian Statistics	13
Probability	13
Test Items	13
Item Response Theory	6
Multiple Choice Tests	4
Simulation	4
Classification	3
Difficulty Level	3
Maximum Likelihood Statistics	3
Models	3
Scoring	3
Test Construction	3
Adaptive Testing	2
Factor Analysis	2
Goodness of Fit	2
Guessing (Tests)	2
Item Analysis	2
Markov Processes	2
Measurement	2
Response Style (Tests)	2
Scaling	2
Scores	2
Statistical Analysis	2
Statistical Bias	2
Ability	1
More ▼

Source

Applied Measurement in…	2
Applied Psychological…	2
EURASIA Journal of…	1
Educational and Psychological…	1
Journal of Educational and…	1
Measurement:…	1
Practical Assessment,…	1
ProQuest LLC	1

Publication Type

Journal Articles	8
Reports - Research	7
Reports - Evaluative	3
Dissertations/Theses -…	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Higher Education	2
Postsecondary Education	1

Audience

Location

Canada

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Maintaining Score Scales over Time: A Comparison of Five Scoring Methods

Peer reviewed

Direct link

Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023

This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…

Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

Dissecting Knowledge, Guessing, and Blunder in Multiple Choice Assessments

Peer reviewed

Direct link

Abu-Ghazalah, Rashid M.; Dubins, David N.; Poon, Gregory M. K. – Applied Measurement in Education, 2023

Multiple choice results are inherently probabilistic outcomes, as correct responses reflect a combination of knowledge and guessing, while incorrect responses additionally reflect blunder, a confidently committed mistake. To objectively resolve knowledge from responses in an MC test structure, we evaluated probabilistic models that explicitly…

Descriptors: Guessing (Tests), Multiple Choice Tests, Probability, Models

Evaluating the Effectiveness of the Expectation-Maximization (EM) Algorithm for Bayesian Network Calibration

Direct link

Tingir, Seyfullah – ProQuest LLC, 2019

Educators use various statistical techniques to explain relationships between latent and observable variables. One way to model these relationships is to use Bayesian networks as a scoring model. However, adjusting the conditional probability tables (CPT-parameters) to fit a set of observations is still a challenge when using Bayesian networks. A…

Descriptors: Bayesian Statistics, Statistical Analysis, Scoring, Probability

Careful with Those Priors: A Note on Bayesian Estimation in Two-Parameter Logistic Item Response Theory Models

Peer reviewed

Direct link

Marcoulides, Katerina M. – Measurement: Interdisciplinary Research and Perspectives, 2018

This study examined the use of Bayesian analysis methods for the estimation of item parameters in a two-parameter logistic item response theory model. Using simulated data under various design conditions with both informative and non-informative priors, the parameter recovery of Bayesian analysis methods were examined. Overall results showed that…

Descriptors: Bayesian Statistics, Item Response Theory, Probability, Difficulty Level

Definite Integral Automatic Analysis Mechanism Research and Development Using the "Find the Area by Integration" Unit as an Example

Peer reviewed

Direct link

Ting, Mu Yu – EURASIA Journal of Mathematics, Science & Technology Education, 2017

Using the capabilities of expert knowledge structures, the researcher prepared test questions on the university calculus topic of "finding the area by integration." The quiz is divided into two types of multiple choice items (one out of four and one out of many). After the calculus course was taught and tested, the results revealed that…

Descriptors: Calculus, Mathematics Instruction, College Mathematics, Multiple Choice Tests

Reweighting Data in the Spirit of Tukey: Using Bayesian Posterior Probabilities as Rasch Residuals for Studying Misfit

Peer reviewed

Direct link

Dardick, William R.; Mislevy, Robert J. – Educational and Psychological Measurement, 2016

A new variant of the iterative "data = fit + residual" data-analytical approach described by Mosteller and Tukey is proposed and implemented in the context of item response theory psychometric models. Posterior probabilities from a Bayesian mixture model of a Rasch item response theory model and an unscalable latent class are expressed…

Descriptors: Bayesian Statistics, Probability, Data Analysis, Item Response Theory

Diagnosis of Subtraction Bugs Using Bayesian Networks

Peer reviewed

Direct link

Lee, Jihyun; Corter, James E. – Applied Psychological Measurement, 2011

Diagnosis of misconceptions or "bugs" in procedural skills is difficult because of their unstable nature. This study addresses this problem by proposing and evaluating a probability-based approach to the diagnosis of bugs in children's multicolumn subtraction performance using Bayesian networks. This approach assumes a causal network relating…

Descriptors: Misconceptions, Probability, Children, Subtraction

Scoring and Classifying Examinees Using Measurement Decision Theory

Peer reviewed

Direct link

Rudner, Lawrence M. – Practical Assessment, Research & Evaluation, 2009

This paper describes and evaluates the use of measurement decision theory (MDT) to classify examinees based on their item response patterns. The model has a simple framework that starts with the conditional probabilities of examinees in each category or mastery state responding correctly to each item. The presented evaluation investigates: (1) the…

Descriptors: Classification, Scoring, Item Response Theory, Measurement

Calibrating Item Families and Summarizing the Results Using Family Expected Response Functions

Peer reviewed

Direct link

Sinharay, Sandip; Johnson, Matthew S.; Williamson, David M. – Journal of Educational and Behavioral Statistics, 2003

Item families, which are groups of related items, are becoming increasingly popular in complex educational assessments. For example, in automatic item generation (AIG) systems, a test may consist of multiple items generated from each of a number of item models. Item calibration or scoring for such an assessment requires fitting models that can…

Descriptors: Test Items, Markov Processes, Educational Testing, Probability

Marginal Maximum Likelihood Estimation for a Psychometric Model of Discontinuous Development.

Download full text

Mislevy, Robert J.; Wilson, Mark – 1992

Standard item response theory (IRT) models posit latent variables to account for regularities in students' performance on test items. They can accommodate learning only if the expected changes in performance are smooth, and, in an appropriate metric, uniform over items. Wilson's "Saltus" model extends the ideas of IRT to development that…

Descriptors: Bayesian Statistics, Change, Development, Item Response Theory

Bayesian Tailored Testing and the Influence of Item Bank Characteristics

Peer reviewed

Jensema, Carl J. – Applied Psychological Measurement, 1977

Owen's Bayesian tailored testing method is introduced along with a brief review of its derivation. The characteristics of a good item bank are outlined and explored in terms of their influence on the Bayesian tailoring process. (Author/RC)

Descriptors: Ability, Adaptive Testing, Bayesian Statistics, Computer Oriented Programs

A Factor Analytic Item Response Theory Approach for Relating Item Content to Test Scores.

Abdel-fattah, Abdel-fattah A. – 1992

A scaling procedure is proposed, based on item response theory (IRT), to fit non-hierarchical test structure as well. The binary scores of a test of English were used for calculating the probabilities of answering each item correctly. The probability matrix was factor analyzed, and the difficulty intervals or estimates corresponding to the factors…

Descriptors: Bayesian Statistics, Difficulty Level, English, Estimation (Mathematics)

Developments in Latent Trait Theory: A Review of Models, Technical Issues, and Applications.

Download full text

Hambleton, Ronald K.; And Others – 1977

Latent trait theory supposes that, in testing situations, examinee performance on a test can be predicted (or explained) by defining examinee characteristics, referred to as traits, estimating scores for examinees on these traits and using the scores to predict or explain test performance (Lord and Novick, 1968). In view of the breakthroughs in…

Descriptors: Adaptive Testing, Bayesian Statistics, Cognitive Measurement, Computer Programs

Mislevy, Robert J.	2
Abdel-fattah, Abdel-fattah A.	1
Abu-Ghazalah, Rashid M.	1
Corter, James E.	1
Dardick, William R.	1
Dubins, David N.	1
Hambleton, Ronald K.	1
Jensema, Carl J.	1
Johnson, Matthew S.	1
Kim, Stella Yun	1
Lee, Jihyun	1
Lee, Won-Chan	1
Marcoulides, Katerina M.	1
Poon, Gregory M. K.	1
Rudner, Lawrence M.	1
Sinharay, Sandip	1
Ting, Mu Yu	1
Tingir, Seyfullah	1
Williamson, David M.	1
Wilson, Mark	1
More ▼