Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 12 |
| Since 2017 (last 10 years) | 31 |
| Since 2007 (last 20 years) | 39 |
Descriptor
Source
Author
Publication Type
| Journal Articles | 26 |
| Reports - Research | 25 |
| Dissertations/Theses -… | 6 |
| Reports - Descriptive | 4 |
| Reports - Evaluative | 4 |
| Speeches/Meeting Papers | 3 |
| Non-Print Media | 2 |
| Reference Materials - General | 2 |
| Tests/Questionnaires | 2 |
Education Level
| Elementary Secondary Education | 13 |
| Elementary Education | 12 |
| Secondary Education | 12 |
| Higher Education | 8 |
| Junior High Schools | 8 |
| Middle Schools | 8 |
| Postsecondary Education | 7 |
| Grade 8 | 5 |
| Grade 4 | 3 |
| Grade 7 | 3 |
| High Schools | 3 |
| More ▼ | |
Audience
| Policymakers | 2 |
| Practitioners | 1 |
| Teachers | 1 |
Location
| Singapore | 2 |
| Turkey | 2 |
| Asia | 1 |
| Austria | 1 |
| Germany | 1 |
| Hong Kong | 1 |
| Japan | 1 |
| Massachusetts | 1 |
| South Korea | 1 |
| Taiwan | 1 |
| Thailand | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| Trends in International… | 12 |
| Advanced Placement… | 2 |
| National Assessment of… | 2 |
| Big Five Inventory | 1 |
| Florida Comprehensive… | 1 |
| Force Concept Inventory | 1 |
| Program for International… | 1 |
What Works Clearinghouse Rating
Su, Kun; Henson, Robert A. – Journal of Educational and Behavioral Statistics, 2023
This article provides a process to carefully evaluate the suitability of a content domain for which diagnostic classification models (DCMs) could be applicable and then optimized steps for constructing a test blueprint for applying DCMs and a real-life example illustrating this process. The content domains were carefully evaluated using a set of…
Descriptors: Classification, Models, Science Tests, Physics
Andrew M. Olney – Grantee Submission, 2023
Multiple choice questions are traditionally expensive to produce. Recent advances in large language models (LLMs) have led to fine-tuned LLMs that generate questions competitive with human-authored questions. However, the relative capabilities of ChatGPT-family models have not yet been established for this task. We present a carefully-controlled…
Descriptors: Test Construction, Multiple Choice Tests, Test Items, Algorithms
Lawrence T. DeCarlo – Educational and Psychological Measurement, 2024
A psychological framework for different types of items commonly used with mixed-format exams is proposed. A choice model based on signal detection theory (SDT) is used for multiple-choice (MC) items, whereas an item response theory (IRT) model is used for open-ended (OE) items. The SDT and IRT models are shown to share a common conceptualization…
Descriptors: Test Format, Multiple Choice Tests, Item Response Theory, Models
Gombert, Sebastian; Di Mitri, Daniele; Karademir, Onur; Kubsch, Marcus; Kolbe, Hannah; Tautz, Simon; Grimm, Adrian; Bohm, Isabell; Neumann, Knut; Drachsler, Hendrik – Journal of Computer Assisted Learning, 2023
Background: Formative assessments are needed to enable monitoring how student knowledge develops throughout a unit. Constructed response items which require learners to formulate their own free-text responses are well suited for testing their active knowledge. However, assessing such constructed responses in an automated fashion is a complex task…
Descriptors: Coding, Energy, Scientific Concepts, Formative Evaluation
Hansen, John; Stewart, John – Physical Review Physics Education Research, 2021
This work is the fourth of a series of papers applying multidimensional item response theory (MIRT) to widely used physics conceptual assessments. This study applies MIRT analysis using both exploratory and confirmatory methods to the Brief Electricity and Magnetism Assessment (BEMA) to explore the assessment's structure and to determine a…
Descriptors: Item Response Theory, Science Tests, Energy, Magnets
Qi Huang; Daniel M. Bolt; Weicong Lyu – Large-scale Assessments in Education, 2024
Large scale international assessments depend on invariance of measurement across countries. An important consideration when observing cross-national differential item functioning (DIF) is whether the DIF actually reflects a source of bias, or might instead be a methodological artifact reflecting item response theory (IRT) model misspecification.…
Descriptors: Test Items, Item Response Theory, Test Bias, Test Validity
DeCarlo, Lawrence T. – Journal of Educational Measurement, 2021
In a signal detection theory (SDT) approach to multiple choice exams, examinees are viewed as choosing, for each item, the alternative that is perceived as being the most plausible, with perceived plausibility depending in part on whether or not an item is known. The SDT model is a process model and provides measures of item difficulty, item…
Descriptors: Perception, Bias, Theories, Test Items
Schmid, Kelly M.; Lee, Dennis; Weindling, Monica; Syed, Awais; Agyemang, Stephanie-Louise Yacoba; Donovan, Brian; Radick, Gregory; Smith, Michelle K. – Journal of Microbiology & Biology Education, 2022
Undergraduate genetics courses have historically focused on simple genetic models, rather than taking a more multifactorial approach where students explore how traits are influenced by a combination of genes, the environment, and gene-by-environment interactions. While a focus on simple genetic models can provide straightforward examples to…
Descriptors: Undergraduate Students, Genetics, Science Instruction, Models
Laliyo, Lukman Abdul Rauf; Hamdi, Syukrul; Pikoli, Masrid; Abdullah, Romario; Panigoro, Citra – European Journal of Educational Research, 2021
One of the issues that hinder the students' learning progress is the inability to construct an epistemological explanation of a scientific phenomenon. Four-tier multiple-choice (hereinafter, 4TMC) instrument and Partial-Credit Model were employed to elaborate on the diagnosis process of the aforementioned problem. This study was to develop and…
Descriptors: Learning Processes, Multiple Choice Tests, Models, Test Items
Chengyu Cui; Chun Wang; Gongjun Xu – Grantee Submission, 2024
Multidimensional item response theory (MIRT) models have generated increasing interest in the psychometrics literature. Efficient approaches for estimating MIRT models with dichotomous responses have been developed, but constructing an equally efficient and robust algorithm for polytomous models has received limited attention. To address this gap,…
Descriptors: Item Response Theory, Accuracy, Simulation, Psychometrics
Wu, Qian; De Laet, Tinne; Janssen, Rianne – Journal of Educational Measurement, 2019
Single-best answers to multiple-choice items are commonly dichotomized into correct and incorrect responses, and modeled using either a dichotomous item response theory (IRT) model or a polytomous one if differences among all response options are to be retained. The current study presents an alternative IRT-based modeling approach to…
Descriptors: Multiple Choice Tests, Item Response Theory, Test Items, Responses
Nixi Wang – ProQuest LLC, 2022
Measurement errors attributable to cultural issues are complex and challenging for educational assessments. We need assessment tests sensitive to the cultural heterogeneity of populations, and psychometric methods appropriate to address fairness and equity concerns. Built on the research of culturally responsive assessment, this dissertation…
Descriptors: Culturally Relevant Education, Testing, Equal Education, Validity
Intasoi, Sasima; Junpeng, Putcharee; Tang, Keow Ngang; Ketchatturat, Jatuphum; Zhang, Yidan; Wilson, Mark – International Journal of Evaluation and Research in Education, 2020
The study aimed to develop and validate an assessment framework of multidimensional scientific competencies for seventh-grade students in the northeastern region of Thailand. A total of 289 samples with three different scientific competency levels were randomly selected to participate as test-takers. The design-based research encompassing four…
Descriptors: Science Tests, Grade 7, Foreign Countries, Science Process Skills
Lin, Jing-Wen; Yu, Ruan-Ching – Asia Pacific Journal of Education, 2022
Modelling ability is one of the essential elements of the latest educational reforms, and Trends in International Mathematics and Science Study (TIMSS) is a curriculum-based assessment which allows educational systems worldwide to inspect the curricular influences. The aims of this study were to examine the role of modelling ability in the…
Descriptors: Grade 8, Educational Change, Cross Cultural Studies, Test Items
von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023
Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…
Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

Peer reviewed
Direct link
