ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	5

Descriptor

Computer Software	5
Foreign Countries	5
Item Response Theory	4
Test Items	4
Achievement Tests	3
Models	3
Accuracy	2
Bayesian Statistics	2
Computation	2
Correlation	2
International Assessment	2
Markov Processes	2
Mathematics Tests	2
Monte Carlo Methods	2
Responses	2
Artificial Intelligence	1
Automation	1
Beliefs	1
Classification	1
Cluster Grouping	1
Coding	1
College Entrance Examinations	1
College Faculty	1
College Students	1
Comparative Analysis	1
More ▼

Source

Educational and Psychological…

Author

Wang, Wen-Chung	2
Chen, Hui-Fang	1
Dimitrov, Dimiter M.	1
Goldhammer, Frank	1
Huang, Hung-Yu	1
Jin, Kuan-Yu	1
Khorramdel, Lale	1
Luo, Yong	1
Sälzer, Christine	1
Tyack, Lillian	1
Zehner, Fabian	1
von Davier, Matthias	1
More ▼

Publication Type

Journal Articles	5
Reports - Research	5

Education Level

Elementary Education	2
Higher Education	2
Postsecondary Education	2
Secondary Education	2
Elementary Secondary Education	1
Grade 4	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1

Audience

Location

Germany	1
Hong Kong	1
Saudi Arabia	1
Taiwan	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	3
Program for International…	2
Students Evaluation of…	1

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

A Short Note on Obtaining Point Estimates of the IRT Ability Parameter with MCMC Estimation in Mplus: How Many Plausible Values Are Needed?

Peer reviewed

Direct link

Luo, Yong; Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2019

Plausible values can be used to either estimate population-level statistics or compute point estimates of latent variables. While it is well known that five plausible values are usually sufficient for accurate estimation of population-level statistics in large-scale surveys, the minimum number of plausible values needed to obtain accurate latent…

Descriptors: Item Response Theory, Monte Carlo Methods, Markov Processes, Outcome Measures

Automatic Coding of Short Text Responses via Clustering in Educational Assessment

Peer reviewed

Direct link

Zehner, Fabian; Sälzer, Christine; Goldhammer, Frank – Educational and Psychological Measurement, 2016

Automatic coding of short text responses opens new doors in assessment. We implemented and integrated baseline methods of natural language processing and statistical modelling by means of software components that are available under open licenses. The accuracy of automatic text coding is demonstrated by using data collected in the "Programme…

Descriptors: Educational Assessment, Coding, Automation, Responses

Item Response Theory Models for Wording Effects in Mixed-Format Scales

Peer reviewed

Direct link

Wang, Wen-Chung; Chen, Hui-Fang; Jin, Kuan-Yu – Educational and Psychological Measurement, 2015

Many scales contain both positively and negatively worded items. Reverse recoding of negatively worded items might not be enough for them to function as positively worded items do. In this study, we commented on the drawbacks of existing approaches to wording effect in mixed-format scales and used bi-factor item response theory (IRT) models to…

Descriptors: Item Response Theory, Test Format, Language Usage, Test Items

Multilevel Higher-Order Item Response Theory Models

Peer reviewed

Direct link

Huang, Hung-Yu; Wang, Wen-Chung – Educational and Psychological Measurement, 2014

In the social sciences, latent traits often have a hierarchical structure, and data can be sampled from multiple levels. Both hierarchical latent traits and multilevel data can occur simultaneously. In this study, we developed a general class of item response theory models to accommodate both hierarchical latent traits and multilevel data. The…

Descriptors: Item Response Theory, Hierarchical Linear Modeling, Computation, Test Reliability