Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 16 |
Since 2006 (last 20 years) | 33 |
Descriptor
Foreign Countries | 36 |
Item Response Theory | 36 |
Computer Software | 33 |
Models | 14 |
Test Items | 13 |
Achievement Tests | 11 |
Statistical Analysis | 11 |
Computation | 9 |
Computer Assisted Testing | 9 |
College Students | 7 |
International Assessment | 7 |
More ▼ |
Source
Author
Wang, Wen-Chung | 4 |
Huang, Hung-Yu | 2 |
Jin, Kuan-Yu | 2 |
Yang, Ji Seung | 2 |
Zheng, Xiaying | 2 |
Adams, Raymond J. | 1 |
Ahmed Al - Badri | 1 |
Barker, T. | 1 |
Basturk, Ramazan | 1 |
Bertsch, Andreas | 1 |
Blais, Jean-Guy | 1 |
More ▼ |
Publication Type
Journal Articles | 29 |
Reports - Research | 23 |
Reports - Descriptive | 5 |
Reports - Evaluative | 4 |
Collected Works - Proceedings | 3 |
Speeches/Meeting Papers | 3 |
Guides - Non-Classroom | 1 |
Education Level
Audience
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 7 |
Trends in International… | 3 |
MacArthur Communicative… | 1 |
Progress in International… | 1 |
Students Evaluation of… | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Mimi Ismail; Ahmed Al - Badri; Said Al - Senaidi – Journal of Education and e-Learning Research, 2025
This study aimed to reveal the differences in individuals' abilities, their standard errors, and the psychometric properties of the test according to the two methods of applying the test (electronic and paper). The descriptive approach was used to achieve the study's objectives. The study sample consisted of 74 male and female students at the…
Descriptors: Achievement Tests, Computer Assisted Testing, Psychometrics, Item Response Theory
Zheng, Xiaying; Yang, Ji Seung – Measurement: Interdisciplinary Research and Perspectives, 2021
The purpose of this paper is to briefly introduce two most common applications of multiple group item response theory (IRT) models, namely detecting differential item functioning (DIF) analysis and nonequivalent group score linking with a simultaneous calibration. We illustrate how to conduct those analyses using the "Stata" item…
Descriptors: Item Response Theory, Test Bias, Computer Software, Statistical Analysis
Maeng, Seungho – Asia-Pacific Science Education, 2021
This study examined a case of GeoMapApp-based assessment to investigate a learning progression for middle school students' understanding of geoscience content and geocognition (spatial, temporal, and retrospective reasoning and system thinking). A 2-year GeoMapApp-based assessment process was administered along with a double-round of the construct…
Descriptors: Middle School Students, Learning Processes, Computer Software, Geology
Reichert, Frank; Zhang, Deju; Law, Nancy W. Y.; Wong, Gary K. W.; de la Torre, Jimmy – Educational Technology Research and Development, 2020
Digital literacy competence (DL) is an important capacity for students' learning in a rapidly changing world. However, little is known about the empirical structure of DL. In this paper, we review major DL assessment frameworks and explore the dimensionality of DL from an empirical perspective using assessment data collected using authentic…
Descriptors: Technological Literacy, Competence, Computer Software, Computer Oriented Programs
PaaBen, Benjamin; Bertsch, Andreas; Langer-Fischer, Katharina; Rüdian, Sylvio; Wang, Xia; Sinha, Rupali; Kuzilek, Jakub; Britsch, Stefan; Pinkwart, Niels – International Educational Data Mining Society, 2021
Many modern anatomy curricula teach histology using virtual microscopes, where students inspect tissue slices in a computer program (e.g. a web browser). However, the educational data mining (EDM) potential of these virtual microscopes remains under-utilized. In this paper, we use EDM techniques to investigate three research questions on a virtual…
Descriptors: Anatomy, Science Instruction, Computer Simulation, Computer Software
von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023
Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…
Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education
Vista, Alvin – European Journal of Educational Research, 2019
Cheating detection is an important issue in standardized testing, especially in large-scale settings. Statistical approaches are often computationally intensive and require specialised software to conduct. We present a two-stage approach that quickly filters suspected groups using statistical testing on an IRT-based answer-copying index. We also…
Descriptors: Cheating, Identification, Computer Software, Standardized Tests
Luo, Yong; Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2019
Plausible values can be used to either estimate population-level statistics or compute point estimates of latent variables. While it is well known that five plausible values are usually sufficient for accurate estimation of population-level statistics in large-scale surveys, the minimum number of plausible values needed to obtain accurate latent…
Descriptors: Item Response Theory, Monte Carlo Methods, Markov Processes, Outcome Measures
Yang, Ji Seung; Zheng, Xiaying – Journal of Educational and Behavioral Statistics, 2018
The purpose of this article is to introduce and review the capability and performance of the Stata item response theory (IRT) package that is available from Stata v.14, 2015. Using a simulated data set and a publicly available item response data set extracted from Programme of International Student Assessment, we review the IRT package from…
Descriptors: Item Response Theory, Item Analysis, Computer Software, Statistical Analysis
Karlin, Omar; Karlin, Sayaka – InSight: A Journal of Scholarly Teaching, 2018
This study had two aims. The first was to explain the process of using the Rasch measurement model to validate tests in an easy-to-understand way for those unfamiliar with the Rasch measurement model. The second was to validate two final exams with several shared items. The exams were given to two groups of students with slightly differing English…
Descriptors: Item Response Theory, Test Validity, Test Items, Accuracy
O'Keeffe, Cormac – E-Learning and Digital Media, 2017
International Large Scale Assessments have been producing data about educational attainment for over 60 years. More recently however, these assessments as tests have become digitally and computationally complex and increasingly rely on the calculative work performed by algorithms. In this article I first consider the coordination of relations…
Descriptors: Achievement Tests, Foreign Countries, Secondary School Students, International Assessment
Wu, Mike; Davis, Richard L.; Domingue, Benjamin W.; Piech, Chris; Goodman, Noah – International Educational Data Mining Society, 2020
Item Response Theory (IRT) is a ubiquitous model for understanding humans based on their responses to questions, used in fields as diverse as education, medicine and psychology. Large modern datasets offer opportunities to capture more nuances in human behavior, potentially improving test scoring and better informing public policy. Yet larger…
Descriptors: Item Response Theory, Accuracy, Data Analysis, Public Policy
Debeer, Dries; Janssen, Rianne; De Boeck, Paul – Journal of Educational Measurement, 2017
When dealing with missing responses, two types of omissions can be discerned: items can be skipped or not reached by the test taker. When the occurrence of these omissions is related to the proficiency process the missingness is nonignorable. The purpose of this article is to present a tree-based IRT framework for modeling responses and omissions…
Descriptors: Item Response Theory, Test Items, Responses, Testing Problems
Ravand, Hamdollah – Practical Assessment, Research & Evaluation, 2015
Cognitive diagnostic models (CDM) have been around for more than a decade but their application is far from widespread for mainly two reasons: (1) CDMs are novel, as compared to traditional IRT models. Consequently, many researchers lack familiarity with them and their properties, and (2) Software programs doing CDMs have been expensive and not…
Descriptors: Test Theory, Models, Computer Software, Open Source Technology
Kalkan, Ömür Kaya; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
Linear factor analysis models used to examine constructs underlying the responses are not very suitable for dichotomous or polytomous response formats. The associated problems cannot be eliminated by polychoric or tetrachoric correlations in place of the Pearson correlation. Therefore, we considered parameters obtained from the NOHARM and FACTOR…
Descriptors: Sample Size, Nonparametric Statistics, Factor Analysis, Correlation