ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	40

Descriptor

Models	44
Probability	44
Test Items	44
Item Response Theory	23
Psychometrics	10
Classification	9
Difficulty Level	9
Evaluation Methods	9
Computation	8
Statistical Analysis	8
Foreign Countries	7
Scores	7
Simulation	7
Test Construction	7
Item Analysis	6
Diagnostic Tests	5
Error Patterns	5
Mathematics Tests	5
Prediction	5
Reaction Time	5
Regression (Statistics)	5
Test Bias	5
Accuracy	4
Cognitive Ability	4
Comparative Analysis	4
More ▼

Publication Type

Journal Articles	40
Reports - Research	27
Reports - Evaluative	9
Reports - Descriptive	6
Speeches/Meeting Papers	3
Collected Works - Proceedings	1
Numerical/Quantitative Data	1
Opinion Papers	1

Education Level

Higher Education	8
Postsecondary Education	5
Elementary Education	4
Elementary Secondary Education	3
Secondary Education	3
Intermediate Grades	2
Grade 4	1
Grade 6	1
Grade 7	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Practitioners	1
Researchers	1

Location

Australia	1
California	1
Canada	1
Germany	1
Indonesia	1
Iran	1
Israel	1
Italy	1
Japan	1
Netherlands (Amsterdam)	1
Singapore	1
South Africa	1
South Korea	1
Sri Lanka	1
Taiwan	1
United Kingdom (England)	1
United States	1
Vermont	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	3
National Assessment of…	2
Armed Services Vocational…	1
Graduate Record Examinations	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 44 results Save | Export

Guesses and Slips as Proficiency-Related Phenomena and Impacts on Parameter Invariance

Peer reviewed

Direct link

Xiangyi Liao; Daniel M Bolt – Educational Measurement: Issues and Practice, 2024

Traditional approaches to the modeling of multiple-choice item response data (e.g., 3PL, 4PL models) emphasize slips and guesses as random events. In this paper, an item response model is presented that characterizes both disjunctively interacting guessing and conjunctively interacting slipping processes as proficiency-related phenomena. We show…

Descriptors: Item Response Theory, Test Items, Error Correction, Guessing (Tests)

Correcting for Extreme Response Style: Model Choice Matters

Peer reviewed

Direct link

Martijn Schoenmakers; Jesper Tijmstra; Jeroen Vermunt; Maria Bolsinova – Educational and Psychological Measurement, 2024

Extreme response style (ERS), the tendency of participants to select extreme item categories regardless of the item content, has frequently been found to decrease the validity of Likert-type questionnaire results. For this reason, various item response theory (IRT) models have been proposed to model ERS and correct for it. Comparisons of these…

Descriptors: Item Response Theory, Response Style (Tests), Models, Likert Scales

The Reliability of the Posterior Probability of Skill Attainment in Diagnostic Classification Models

Peer reviewed

Direct link

Johnson, Matthew S.; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2020

One common score reported from diagnostic classification assessments is the vector of posterior means of the skill mastery indicators. As with any assessment, it is important to derive and report estimates of the reliability of the reported scores. After reviewing a reliability measure suggested by Templin and Bradshaw, this article suggests three…

Descriptors: Reliability, Probability, Skill Development, Classification

Testing the Within-State Distribution in Mixture Models for Responses and Response Times

Peer reviewed

Direct link

Kuijpers, Renske E.; Visser, Ingmar; Molenaar, Dylan – Journal of Educational and Behavioral Statistics, 2021

Mixture models have been developed to enable detection of within-subject differences in responses and response times to psychometric test items. To enable mixture modeling of both responses and response times, a distributional assumption is needed for the within-state response time distribution. Since violations of the assumed response time…

Descriptors: Test Items, Responses, Reaction Time, Models

Dissecting Knowledge, Guessing, and Blunder in Multiple Choice Assessments

Peer reviewed

Direct link

Abu-Ghazalah, Rashid M.; Dubins, David N.; Poon, Gregory M. K. – Applied Measurement in Education, 2023

Multiple choice results are inherently probabilistic outcomes, as correct responses reflect a combination of knowledge and guessing, while incorrect responses additionally reflect blunder, a confidently committed mistake. To objectively resolve knowledge from responses in an MC test structure, we evaluated probabilistic models that explicitly…

Descriptors: Guessing (Tests), Multiple Choice Tests, Probability, Models

On Joining a Signal Detection Choice Model with Response Time Models

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2021

In a signal detection theory (SDT) approach to multiple choice exams, examinees are viewed as choosing, for each item, the alternative that is perceived as being the most plausible, with perceived plausibility depending in part on whether or not an item is known. The SDT model is a process model and provides measures of item difficulty, item…

Descriptors: Perception, Bias, Theories, Test Items

A Generalized Approach to Defining Item Discrimination for DCMs

Peer reviewed

Direct link

Henson, Robert; DiBello, Lou; Stout, Bill – Measurement: Interdisciplinary Research and Perspectives, 2018

Diagnostic classification models (DCMs, also known as cognitive diagnosis models) hold the promise of providing detailed classroom information about the skills a student has or has not mastered. Specifically, DCMs are special cases of constrained latent class models where classes are defined based on mastery/nonmastery of a set of attributes (or…

Descriptors: Classification, Diagnostic Tests, Models, Mastery Learning

Development of Diagnostic Assessments in Probability for Middle Graders

Peer reviewed
PDF on ERIC

Download full text

Lee, Hollylynne; Bradshaw, Laine; Famularo, Lisa; Masters, Jessica; Azevedo, Roger; Johnson, Sheri; Schellman, Madeline; Elrod, Emily; Sanei, Hamid – Grantee Submission, 2019

The research shared in this conference paper report illustrates how an iterative process to item development that involves expert review and cognitive lab interviews with students can be used to collect evidence of validity for assessment items. Analysis of students' reasoning was also used to expand a model for identifying conceptions and…

Descriptors: Middle School Students, Interviews, Misconceptions, Test Items

Towards a Model-Free Estimate of the Limits to Student Modeling Accuracy

Peer reviewed
PDF on ERIC

Download full text

Chen, Binglin; West, Matthew; Ziles, Craig – International Educational Data Mining Society, 2018

This paper attempts to quantify the accuracy limit of "nextitem-correct" prediction by using numerical optimization to estimate the student's probability of getting each question correct given a complete sequence of item responses. This optimization is performed without an explicit parameterized model of student behavior, but with the…

Descriptors: Accuracy, Probability, Student Behavior, Test Items

Partially Compensatory Multidimensional Item Response Theory Models: Two Alternate Model Forms

Peer reviewed

Direct link

DeMars, Christine E. – Educational and Psychological Measurement, 2016

Partially compensatory models may capture the cognitive skills needed to answer test items more realistically than compensatory models, but estimating the model parameters may be a challenge. Data were simulated to follow two different partially compensatory models, a model with an interaction term and a product model. The model parameters were…

Descriptors: Item Response Theory, Models, Thinking Skills, Test Items

Asymptotic Standard Errors of Observed-Score Equating with Polytomous IRT Models

Peer reviewed

Direct link

Andersson, Björn – Journal of Educational Measurement, 2016

In observed-score equipercentile equating, the goal is to make scores on two scales or tests measuring the same construct comparable by matching the percentiles of the respective score distributions. If the tests consist of different items with multiple categories for each item, a suitable model for the responses is a polytomous item response…

Descriptors: Equated Scores, Item Response Theory, Error of Measurement, Tests

Using Student Ability and Item Difficulty for Making Defensible Pass/Fail Decisions for Borderline Grades

Peer reviewed
PDF on ERIC

Download full text

Shulruf, Boaz; Jones, Phil; Turner, Rolf – Higher Education Studies, 2015

The determination of Pass/Fail decisions over Borderline grades, (i.e., grades which do not clearly distinguish between the competent and incompetent examinees) has been an ongoing challenge for academic institutions. This study utilises the Objective Borderline Method (OBM) to determine examinee ability and item difficulty, and from that…

Descriptors: Undergraduate Students, Pass Fail Grading, Decision Making, Probability

Studying Differential Item Functioning via Latent Variable Modeling: A Note on a Multiple-Testing Procedure

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A.; Lee, Chun-Lung; Chang, Chi – Educational and Psychological Measurement, 2013

This note is concerned with a latent variable modeling approach for the study of differential item functioning in a multigroup setting. A multiple-testing procedure that can be used to evaluate group differences in response probabilities on individual items is discussed. The method is readily employed when the aim is also to locate possible…

Descriptors: Test Bias, Statistical Analysis, Models, Hypothesis Testing

Modeling Answer Changes on Test Items

Peer reviewed

Direct link

van der Linden, Wim J.; Jeon, Minjeong – Journal of Educational and Behavioral Statistics, 2012

The probability of test takers changing answers upon review of their initial choices is modeled. The primary purpose of the model is to check erasures on answer sheets recorded by an optical scanner for numbers and patterns that may be indicative of irregular behavior, such as teachers or school administrators changing answer sheets after their…

Descriptors: Probability, Models, Test Items, Educational Testing

CAT Model with Personalized Algorithm for Evaluation of Estimated Student Knowledge

Peer reviewed

Direct link

Andjelic, Svetlana; Cekerevac, Zoran – Education and Information Technologies, 2014

This article presents the original model of the computer adaptive testing and grade formation, based on scientifically recognized theories. The base of the model is a personalized algorithm for selection of questions depending on the accuracy of the answer to the previous question. The test is divided into three basic levels of difficulty, and the…

Descriptors: Computer Assisted Testing, Educational Technology, Grades (Scholastic), Test Construction

Previous Page | Next Page »

Pages: 1 | 2 | 3

Educational and Psychological…	5
ETS Research Report Series	4
Journal of Educational…	4
Journal of Educational and…	4
Measurement:…	4
Psychometrika	4
Applied Psychological…	3
International Journal of…	2
Journal of Experimental…	2
Applied Measurement in…	1
Education and Information…	1
Educational Measurement:…	1
Grantee Submission	1
Hacettepe University Journal…	1
Higher Education Studies	1
International Educational…	1
International Group for the…	1
Journal of Psycholinguistic…	1
Practical Assessment,…	1
Studies in Educational…	1
More ▼

von Davier, Matthias	3
Henson, Robert	2
Xu, Xueli	2
Abdel-fattah, Abdel-fattah A.	1
Abu-Ghazalah, Rashid M.	1
Anderson, Carolyn J.	1
Andersson, Björn	1
Andjelic, Svetlana	1
Arndt, Jason	1
Atar, Burcu	1
Azevedo, Roger	1
Bartolucci, F.	1
Bechger, Timo	1
Beretvas, S. Natasha	1
Bertling, Jonas P.	1
Bradshaw, Laine	1
Braeken, Johan	1
Cekerevac, Zoran	1
Chandler, Steve	1
Chang, Chi	1
Chen, Binglin	1
Chick, Helen L., Ed.	1
Daniel M Bolt	1
De Boeck, Paul	1
DeCarlo, Lawrence T.	1
More ▼