ERIC - Search Results

Publication Date

In 2025	1
Since 2024	3
Since 2021 (last 5 years)	10
Since 2016 (last 10 years)	14
Since 2006 (last 20 years)	15

Descriptor

Item Response Theory	15
Programming Languages	15
Item Analysis	6
Test Items	6
Computer Software	4
Models	4
Accuracy	3
Classification	3
Correlation	3
Foreign Countries	3
Longitudinal Studies	3
Monte Carlo Methods	3
Psychometrics	3
Simulation	3
Artificial Intelligence	2
Bayesian Statistics	2
Computer Science Education	2
Difficulty Level	2
Evaluation Methods	2
Geometry	2
Outcomes of Education	2
Questionnaires	2
Sample Size	2
Scaling	2
Scores	2
More ▼

Source

Measurement:…	3
Grantee Submission	2
International Journal of…	2
Computer Science Education	1
Education Sciences	1
InTech	1
Interchange: A Quarterly…	1
International Educational…	1
Journal of Intelligence	1
Journal on Efficiency and…	1
Large-scale Assessments in…	1
More ▼

Publication Type

Reports - Research	13
Journal Articles	11
Books	1
Collected Works - General	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Higher Education	2
Postsecondary Education	2
Elementary Education	1
Elementary Secondary Education	1

Audience

Location

Nigeria	2
Germany	1

Laws, Policies, & Programs

Assessments and Surveys

National Education…	1
Raven Progressive Matrices	1
Rosenberg Self Esteem Scale	1
Test of English for…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

A Validation Study of the Extended Relevance Scale Using the D3mirt Package for R

Peer reviewed

Direct link

Erik Forsberg; Anders Sjöberg – Measurement: Interdisciplinary Research and Perspectives, 2025

This paper reports a validation study based on descriptive multidimensional item response theory (DMIRT), implemented in the R package "D3mirt" by using the ERS-C, an extended version of the Relevance subscale from the Moral Foundations Questionnaire including two new items for collectivism (17 items in total). Two latent models are…

Descriptors: Evaluation Methods, Programming Languages, Altruism, Collectivism

A Note on Improving Variational Estimation for Multidimensional Item Response Theory

Peer reviewed

Direct link

Chenchen Ma; Jing Ouyang; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Survey instruments and assessments are frequently used in many domains of social science. When the constructs that these assessments try to measure become multifaceted, multidimensional item response theory (MIRT) provides a unified framework and convenient statistical tool for item analysis, calibration, and scoring. However, the computational…

Descriptors: Algorithms, Item Response Theory, Scoring, Accuracy

NEPSscaling: Plausible Value Estimation for Competence Tests Administered in the German National Educational Panel Study

Peer reviewed

Direct link

Scharl, Anna; Zink, Eva – Large-scale Assessments in Education, 2022

Educational large-scale assessments (LSAs) often provide plausible values for the administered competence tests to facilitate the estimation of population effects. This requires the specification of a background model that is appropriate for the specific research question. Because the "German National Educational Panel Study" (NEPS) is…

Descriptors: National Competency Tests, Foreign Countries, Programming Languages, Longitudinal Studies

To What Extent Are Item Discrimination Values Realistic? A New Index for Two-Dimensional Structures

Peer reviewed
PDF on ERIC

Download full text

Kilic, Abdullah Faruk; Uysal, Ibrahim – International Journal of Assessment Tools in Education, 2022

Most researchers investigate the corrected item-total correlation of items when analyzing item discrimination in multi-dimensional structures under the Classical Test Theory, which might lead to underestimating item discrimination, thereby removing items from the test. Researchers might investigate the corrected item-total correlation with the…

Descriptors: Item Analysis, Correlation, Item Response Theory, Test Items

Diagnosing a 12-Item Dataset of Raven Matrices: With Dexter

Peer reviewed
PDF on ERIC

Download full text

Partchev, Ivailo – Journal of Intelligence, 2020

We analyze a 12-item version of Raven's Standard Progressive Matrices test, traditionally scored with the sum score. We discuss some important differences between assessment in practice and psychometric modelling. We demonstrate some advanced diagnostic tools in the freely available R package, dexter. We find that the first item in the test…

Descriptors: Intelligence Tests, Scores, Psychometrics, Diagnostic Tests

Multidimensional Item Response Theory Calibration of Dichotomous Response Structure Using R Language for Statistical Computing

Peer reviewed

Direct link

Musa Adekunle Ayanwale; Jamiu Oluwadamilare Amusa; Adekunle Ibrahim Oladejo; Funmilayo Ayedun – Interchange: A Quarterly Review of Education, 2024

The study focuses on assessing the proficiency levels of higher education students, specifically the physics achievement test (PHY 101) at the National Open University of Nigeria (NOUN). This test, like others, evaluates various aspects of knowledge and skills simultaneously. However, relying on traditional models for such tests can result in…

Descriptors: Item Response Theory, Difficulty Level, Item Analysis, Test Items

Investigation of the Effect of Parameter Estimation and Classification Accuracy in Mixture IRT Models under Different Conditions

Peer reviewed
PDF on ERIC

Download full text

Saatcioglu, Fatima Munevver; Atar, Hakan Yavuz – International Journal of Assessment Tools in Education, 2022

This study aims to examine the effects of mixture item response theory (IRT) models on item parameter estimation and classification accuracy under different conditions. The manipulated variables of the simulation study are set as mixture IRT models (Rasch, 2PL, 3PL); sample size (600, 1000); the number of items (10, 30); the number of latent…

Descriptors: Accuracy, Classification, Item Response Theory, Programming Languages

Measurement Bias and Error Correction in a Two-Stage Estimation for Multilevel IRT Models

Peer reviewed
PDF on ERIC

Download full text

Direct link

Xue Zhang; Chun Wang – Grantee Submission, 2021

Among current state-of-art estimation methods for multilevel IRT models, the two-stage divide-and-conquer strategy has practical advantages, such as clearer definition of factors, convenience for secondary data analysis, convenience for model calibration and fit evaluation, and avoidance of improper solutions. However, various studies have shown…

Descriptors: Error of Measurement, Error Correction, Item Response Theory, Comparative Analysis

Ensuring Scalability of a Cognitive Multiple-Choice Test through the Mokken Package in R Programming Language

Peer reviewed
PDF on ERIC

Download full text

Ayanwale, Musa Adekunle; Ndlovu, Mdutshekelwa – Education Sciences, 2021

This study investigated the scalability of a cognitive multiple-choice test through the Mokken package in the R programming language for statistical computing. A 2019 mathematics West African Examinations Council (WAEC) instrument was used to gather data from randomly drawn K-12 participants (N = 2866; Male = 1232; Female = 1634; Mean age = 16.5…

Descriptors: Cognitive Tests, Multiple Choice Tests, Scaling, Test Items

Using the eRm Package for Rasch Modeling

Peer reviewed

Direct link

Padgett, R. Noah; Morgan, Grant B. – Measurement: Interdisciplinary Research and Perspectives, 2020

The "extended Rasch modeling" (eRm) package in R provides users with a comprehensive set of tools for Rasch modeling for scale evaluation and general modeling. We provide a brief introduction to Rasch modeling followed by a review of literature that utilizes the eRm package. Then, the key features of the eRm package for scale evaluation…

Descriptors: Computer Software, Programming Languages, Self Esteem, Self Concept Measures

No Meaning Left Unlearned: Predicting Learners' Knowledge of Atypical Meanings of Words from Vocabulary Tests for Their Typical Meanings

Peer reviewed
PDF on ERIC

Download full text

Ehara, Yo – International Educational Data Mining Society, 2022

Language learners are underserved if there are unlearned meanings of a word that they think they have already learned. For example, "circle" as a noun is well known, whereas its use as a verb is not. For artificial-intelligence-based support systems for learning vocabulary, assessing each learner's knowledge of such atypical but common…

Descriptors: Language Tests, Vocabulary Development, Second Language Learning, Second Language Instruction

Using Stan for Item Response Theory Models

Peer reviewed

Direct link

Ames, Allison J.; Au, Chi Hang – Measurement: Interdisciplinary Research and Perspectives, 2018

Stan is a flexible probabilistic programming language providing full Bayesian inference through Hamiltonian Monte Carlo algorithms. The benefits of Hamiltonian Monte Carlo include improved efficiency and faster inference, when compared to other MCMC software implementations. Users can interface with Stan through a variety of computing…

Descriptors: Item Response Theory, Computer Software Evaluation, Computer Software, Programming Languages

The Coding Stages Assessment: Development and Validation of an Instrument for Assessing Young Children's Proficiency in the Scratchjr Programming Language

Peer reviewed

Direct link

de Ruiter, Laura E.; Bers, Marina U. – Computer Science Education, 2022

Background and Context: Despite the increasing implementation of coding in early curricula, there are few valid and reliable assessments of coding abilities for young children. This impedes studying learning outcomes and the development and evaluation of curricula. Objective: Developing and validating a new instrument for assessing young…

Descriptors: Programming Languages, Computer Software, Coding, Computer Science Education

Bayesian Diagnostics for Test Design and Analysis

Peer reviewed
PDF on ERIC

Download full text

Silva, R. M.; Guan, Y.; Swartz, T. B. – Journal on Efficiency and Responsibility in Education and Science, 2017

This paper attempts to bridge the gap between classical test theory and item response theory. It is demonstrated that the familiar and popular statistics used in classical test theory can be translated into a Bayesian framework where all of the advantages of the Bayesian paradigm can be realized. In particular, prior opinion can be introduced and…

Descriptors: Item Response Theory, Bayesian Statistics, Test Construction, Markov Processes

Advances in Learning Processes

Direct link

Rosson, Mary Beth, Ed. – InTech, 2010

Readers will find several papers that address high-level issues in the use of technology in education, for example architecture and design frameworks for building online education materials or tools. Several other chapters report novel approaches to intelligent tutors or adaptive systems in educational settings. A number of chapters consider many…

Descriptors: Educational Technology, Student Projects, Active Learning, Information Systems

Chun Wang	2
Adekunle Ibrahim Oladejo	1
Ames, Allison J.	1
Anders Sjöberg	1
Atar, Hakan Yavuz	1
Au, Chi Hang	1
Ayanwale, Musa Adekunle	1
Bers, Marina U.	1
Chenchen Ma	1
Ehara, Yo	1
Erik Forsberg	1
Funmilayo Ayedun	1
Gongjun Xu	1
Guan, Y.	1
Jamiu Oluwadamilare Amusa	1
Jing Ouyang	1
Kilic, Abdullah Faruk	1
Morgan, Grant B.	1
Musa Adekunle Ayanwale	1
Ndlovu, Mdutshekelwa	1
Padgett, R. Noah	1
Partchev, Ivailo	1
Rosson, Mary Beth, Ed.	1
Saatcioglu, Fatima Munevver	1
Scharl, Anna	1
More ▼