ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	15
Since 2006 (last 20 years)	18

Descriptor

Bayesian Statistics	21
Foreign Countries	21
Test Items	21
Item Response Theory	13
Achievement Tests	11
International Assessment	9
Mathematics Tests	8
Models	7
Secondary School Students	7
Computation	5
Elementary Secondary Education	5
Item Analysis	5
Comparative Analysis	4
Markov Processes	4
Mathematics Achievement	4
Monte Carlo Methods	4
Responses	4
Science Achievement	4
Science Tests	4
Scores	4
Simulation	4
Test Bias	4
Accuracy	3
Computer Assisted Testing	3
Correlation	3
More ▼

Source

Educational and Psychological…	6
Assessment & Evaluation in…	2
Grantee Submission	2
Applied Measurement in…	1
ETS Research Report Series	1
Educational Technology &…	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Learning Analytics	1
More ▼

Publication Type

Journal Articles	16
Reports - Research	16
Reports - Evaluative	3
Information Analyses	1
Opinion Papers	1
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Secondary Education	9
Higher Education	4
Postsecondary Education	4
Elementary Secondary Education	3
Junior High Schools	3
Middle Schools	3
Elementary Education	2
Grade 8	2
High Schools	2
Intermediate Grades	2
Grade 4	1
Grade 5	1
Grade 9	1
More ▼

Audience

Researchers

Location

Taiwan	3
Canada	2
Germany	2
Nigeria	2
Saudi Arabia	2
Africa	1
Botswana	1
Chile	1
Georgia Republic	1
Germany (Berlin)	1
Ghana	1
Malaysia	1
Netherlands	1
Norway	1
Philippines	1
Poland	1
Russia	1
Singapore	1
South Africa	1
Switzerland	1
Thailand	1
Turkey	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	5
Trends in International…	4
Graduate Record Examinations	1
Progress in International…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Learning to Love LLMs for Answer Interpretation: Chain-of-Thought Prompting and the AMMORE Dataset

Peer reviewed
PDF on ERIC

Download full text

Owen Henkel; Hannah Horne-Robinson; Maria Dyshel; Greg Thompson; Ralph Abboud; Nabil Al Nahin Ch; Baptiste Moreau-Pernet; Kirk Vanacore – Journal of Learning Analytics, 2025

This paper introduces AMMORE, a new dataset of 53,000 math open-response question-answer pairs from Rori, a mathematics learning platform used by middle and high school students in several African countries. Using this dataset, we conducted two experiments to evaluate the use of large language models (LLM) for grading particularly challenging…

Descriptors: Learning Analytics, Learning Management Systems, Mathematics Instruction, Middle School Students

Dissecting Knowledge, Guessing, and Blunder in Multiple Choice Assessments

Peer reviewed

Direct link

Abu-Ghazalah, Rashid M.; Dubins, David N.; Poon, Gregory M. K. – Applied Measurement in Education, 2023

Multiple choice results are inherently probabilistic outcomes, as correct responses reflect a combination of knowledge and guessing, while incorrect responses additionally reflect blunder, a confidently committed mistake. To objectively resolve knowledge from responses in an MC test structure, we evaluated probabilistic models that explicitly…

Descriptors: Guessing (Tests), Multiple Choice Tests, Probability, Models

A Mixture IRTree Model for Performance Decline and Nonignorable Missing Data

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2020

In educational assessments and achievement tests, test developers and administrators commonly assume that test-takers attempt all test items with full effort and leave no blank responses with unplanned missing values. However, aberrant response behavior--such as performance decline, dropping out beyond a certain point, and skipping certain items…

Descriptors: Item Response Theory, Response Style (Tests), Test Items, Statistical Analysis

Dimensionality Assessment of Binary Response Test Items: A Non-Parametric Approach of Bayesian Item Response Theory Measurement

Peer reviewed
PDF on ERIC

Download full text

Ayanwale, Musa Adekunle; Isaac-Oloniyo, Flourish O.; Abayomi, Funmilayo R. – International Journal of Evaluation and Research in Education, 2020

This study investigated dimensionality of Binary Response Items through a non-parametric technique of Item Response Theory measurement framework. The study used causal comparative research type of nonexperimental design. The sample consisted of 5,076 public senior secondary school examinees (SSS3) between the age of 14-16 years from 45 schools,…

Descriptors: Test Items, Item Response Theory, Bayesian Statistics, Nonparametric Statistics

A Bayesian Item Response Model for Examining Item Position Effects in Complex Survey Data

Peer reviewed

Direct link

Trendtel, Matthias; Robitzsch, Alexander – Journal of Educational and Behavioral Statistics, 2021

A multidimensional Bayesian item response model is proposed for modeling item position effects. The first dimension corresponds to the ability that is to be measured; the second dimension represents a factor that allows for individual differences in item position effects called persistence. This model allows for nonlinear item position effects on…

Descriptors: Bayesian Statistics, Item Response Theory, Test Items, Test Format

A Sequential Bayesian Changepoint Detection Procedure for Aberrant Behaviors in Computerized Testing

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jing Lu; Chun Wang; Jiwei Zhang; Xue Wang – Grantee Submission, 2023

Changepoints are abrupt variations in a sequence of data in statistical inference. In educational and psychological assessments, it is pivotal to properly differentiate examinees' aberrant behaviors from solution behavior to ensure test reliability and validity. In this paper, we propose a sequential Bayesian changepoint detection algorithm to…

Descriptors: Bayesian Statistics, Behavior Patterns, Computer Assisted Testing, Accuracy

A Response Time Process Model for Not-Reached and Omitted Items

Peer reviewed

Direct link

Lu, Jing; Wang, Chun – Journal of Educational Measurement, 2020

Item nonresponses are prevalent in standardized testing. They happen either when students fail to reach the end of a test due to a time limit or quitting, or when students choose to omit some items strategically. Oftentimes, item nonresponses are nonrandom, and hence, the missing data mechanism needs to be properly modeled. In this paper, we…

Descriptors: Item Response Theory, Test Items, Standardized Tests, Responses

Comparison of Confirmatory Factor Analysis Estimation Methods on Mixed-Format Data

Peer reviewed
PDF on ERIC

Download full text

Kilic, Abdullah Faruk; Dogan, Nuri – International Journal of Assessment Tools in Education, 2021

Weighted least squares (WLS), weighted least squares mean-and-variance-adjusted (WLSMV), unweighted least squares mean-and-variance-adjusted (ULSMV), maximum likelihood (ML), robust maximum likelihood (MLR) and Bayesian estimation methods were compared in mixed item response type data via Monte Carlo simulation. The percentage of polytomous items,…

Descriptors: Factor Analysis, Computation, Least Squares Statistics, Maximum Likelihood Statistics

A Mixture IRTree Model for Extreme Response Style: Accounting for Response Process Uncertainty

Peer reviewed

Direct link

Kim, Nana; Bolt, Daniel M. – Educational and Psychological Measurement, 2021

This paper presents a mixture item response tree (IRTree) model for extreme response style. Unlike traditional applications of single IRTree models, a mixture approach provides a way of representing the mixture of respondents following different underlying response processes (between individuals), as well as the uncertainty present at the…

Descriptors: Item Response Theory, Response Style (Tests), Models, Test Items

Accounting for Differential Item Functioning Using Bayesian Approximate Measurement Invariance

Peer reviewed

Direct link

Sideridis, Georgios D.; Tsaousis, Ioannis; Alamri, Abeer A. – Educational and Psychological Measurement, 2020

The main thesis of the present study is to use the Bayesian structural equation modeling (BSEM) methodology of establishing approximate measurement invariance (A-MI) using data from a national examination in Saudi Arabia as an alternative to not meeting strong invariance criteria. Instead, we illustrate how to account for the absence of…

Descriptors: Bayesian Statistics, Structural Equation Models, Foreign Countries, Error of Measurement

A Short Note on Obtaining Point Estimates of the IRT Ability Parameter with MCMC Estimation in Mplus: How Many Plausible Values Are Needed?

Peer reviewed

Direct link

Luo, Yong; Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2019

Plausible values can be used to either estimate population-level statistics or compute point estimates of latent variables. While it is well known that five plausible values are usually sufficient for accurate estimation of population-level statistics in large-scale surveys, the minimum number of plausible values needed to obtain accurate latent…

Descriptors: Item Response Theory, Monte Carlo Methods, Markov Processes, Outcome Measures

A Bayesian Beta-Mixture Model for Nonparametric IRT (BBM-IRT)

Peer reviewed
PDF on ERIC

Download full text

Arenson, Ethan A.; Karabatsos, George – Grantee Submission, 2017

Item response models typically assume that the item characteristic (step) curves follow a logistic or normal cumulative distribution function, which are strictly monotone functions of person test ability. Such assumptions can be overly-restrictive for real item response data. We propose a simple and more flexible Bayesian nonparametric IRT model…

Descriptors: Bayesian Statistics, Item Response Theory, Nonparametric Statistics, Models

Dealing with Omitted and Not-Reached Items in Competence Tests: Evaluating Approaches Accounting for Missing Responses in Item Response Theory Models

Peer reviewed

Direct link

Pohl, Steffi; Gräfe, Linda; Rose, Norman – Educational and Psychological Measurement, 2014

Data from competence tests usually show a number of missing responses on test items due to both omitted and not-reached items. Different approaches for dealing with missing responses exist, and there are no clear guidelines on which of those to use. While classical approaches rely on an ignorable missing data mechanism, the most recently developed…

Descriptors: Test Items, Achievement Tests, Item Response Theory, Models

Assessing Scientific Reasoning: A Comprehensive Evaluation of Item Features That Affect Item Difficulty

Peer reviewed

Direct link

Stiller, Jurik; Hartmann, Stefan; Mathesius, Sabrina; Straube, Philipp; Tiemann, Rüdiger; Nordmeier, Volkhard; Krüger, Dirk; Upmeier zu Belzen, Annette – Assessment & Evaluation in Higher Education, 2016

The aim of this study was to improve the criterion-related test score interpretation of a text-based assessment of scientific reasoning competencies in higher education by evaluating factors which systematically affect item difficulty. To provide evidence about the specific demands which test items of various difficulty make on pre-service…

Descriptors: Logical Thinking, Scientific Concepts, Difficulty Level, Test Items

Gender and Minority Achievement Gaps in Science in Eighth Grade: Item Analyses of Nationally Representative Data. Research Report. ETS RR-17-36

Peer reviewed
PDF on ERIC

Download full text

Qian, Xiaoyu; Nandakumar, Ratna; Glutting, Joseoph; Ford, Danielle; Fifield, Steve – ETS Research Report Series, 2017

In this study, we investigated gender and minority achievement gaps on 8th-grade science items employing a multilevel item response methodology. Both gaps were wider on physics and earth science items than on biology and chemistry items. Larger gender gaps were found on items with specific topics favoring male students than other items, for…

Descriptors: Item Analysis, Gender Differences, Achievement Gap, Grade 8

Previous Page | Next Page »

Pages: 1 | 2

Huang, Hung-Yu	2
Abayomi, Funmilayo R.	1
Abu-Ghazalah, Rashid M.	1
Alamri, Abeer A.	1
Arenson, Ethan A.	1
Ayanwale, Musa Adekunle	1
Baptiste Moreau-Pernet	1
Berger, Martijn P. F.	1
Blömeke, Sigrid	1
Bolt, Daniel M.	1
Braeken, Johan	1
Chun Wang	1
Dimitrov, Dimiter M.	1
Dogan, Nuri	1
Dubins, David N.	1
Fifield, Steve	1
Ford, Danielle	1
Glutting, Joseoph	1
Greg Thompson	1
Gräfe, Linda	1
Hambleton, Ronald K.	1
Hannah Horne-Robinson	1
Hartmann, Stefan	1
Isaac-Oloniyo, Flourish O.	1
Jing Lu	1
More ▼