ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	12
Since 2017 (last 10 years)	31
Since 2007 (last 20 years)	39

Descriptor

Models	41
Science Tests	41
Test Items	41
Foreign Countries	19
Item Response Theory	18
Achievement Tests	15
Mathematics Tests	14
Difficulty Level	13
International Assessment	13
Science Achievement	13
Elementary Secondary Education	12
Mathematics Achievement	12
Item Analysis	9
Multiple Choice Tests	7
Comparative Analysis	6
Science Education	6
Statistical Analysis	6
Elementary School Science	5
Elementary School Students	5
Physics	5
Psychometrics	5
Science Instruction	5
Scientific Concepts	5
Test Validity	5
Validity	5
More ▼

Publication Type

Journal Articles	26
Reports - Research	25
Dissertations/Theses -…	6
Reports - Descriptive	4
Reports - Evaluative	4
Speeches/Meeting Papers	3
Non-Print Media	2
Reference Materials - General	2
Tests/Questionnaires	2

Education Level

Elementary Secondary Education	13
Elementary Education	12
Secondary Education	12
Higher Education	8
Junior High Schools	8
Middle Schools	8
Postsecondary Education	7
Grade 8	5
Grade 4	3
Grade 7	3
High Schools	3
Intermediate Grades	2
More ▼

Audience

Policymakers	2
Practitioners	1
Teachers	1

Location

Singapore	2
Turkey	2
Asia	1
Austria	1
Germany	1
Hong Kong	1
Japan	1
Massachusetts	1
South Korea	1
Taiwan	1
Thailand	1
United Kingdom (England)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	12
Advanced Placement…	2
National Assessment of…	2
Big Five Inventory	1
Florida Comprehensive…	1
Force Concept Inventory	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 41 results Save | Export

Optimizing Diagnostic Classification Models Application Considering Real-Life Constraints

Peer reviewed

Direct link

Su, Kun; Henson, Robert A. – Journal of Educational and Behavioral Statistics, 2023

This article provides a process to carefully evaluate the suitability of a content domain for which diagnostic classification models (DCMs) could be applicable and then optimized steps for constructing a test blueprint for applying DCMs and a real-life example illustrating this process. The content domains were carefully evaluated using a set of…

Descriptors: Classification, Models, Science Tests, Physics

Generating Multiple Choice Questions from a Textbook: LLMs Match Human Performance on Most Metrics

Peer reviewed
PDF on ERIC

Download full text

Andrew M. Olney – Grantee Submission, 2023

Multiple choice questions are traditionally expensive to produce. Recent advances in large language models (LLMs) have led to fine-tuned LLMs that generate questions competitive with human-authored questions. However, the relative capabilities of ChatGPT-family models have not yet been established for this task. We present a carefully-controlled…

Descriptors: Test Construction, Multiple Choice Tests, Test Items, Algorithms

Fused SDT/IRT Models for Mixed-Format Exams

Peer reviewed

Direct link

Lawrence T. DeCarlo – Educational and Psychological Measurement, 2024

A psychological framework for different types of items commonly used with mixed-format exams is proposed. A choice model based on signal detection theory (SDT) is used for multiple-choice (MC) items, whereas an item response theory (IRT) model is used for open-ended (OE) items. The SDT and IRT models are shown to share a common conceptualization…

Descriptors: Test Format, Multiple Choice Tests, Item Response Theory, Models

Coding Energy Knowledge in Constructed Responses with Explainable NLP Models

Peer reviewed

Direct link

Gombert, Sebastian; Di Mitri, Daniele; Karademir, Onur; Kubsch, Marcus; Kolbe, Hannah; Tautz, Simon; Grimm, Adrian; Bohm, Isabell; Neumann, Knut; Drachsler, Hendrik – Journal of Computer Assisted Learning, 2023

Background: Formative assessments are needed to enable monitoring how student knowledge develops throughout a unit. Constructed response items which require learners to formulate their own free-text responses are well suited for testing their active knowledge. However, assessing such constructed responses in an automated fashion is a complex task…

Descriptors: Coding, Energy, Scientific Concepts, Formative Evaluation

Multidimensional Item Response Theory and the Brief Electricity and Magnetism Assessment

Peer reviewed

Direct link

Hansen, John; Stewart, John – Physical Review Physics Education Research, 2021

This work is the fourth of a series of papers applying multidimensional item response theory (MIRT) to widely used physics conceptual assessments. This study applies MIRT analysis using both exploratory and confirmatory methods to the Brief Electricity and Magnetism Assessment (BEMA) to explore the assessment's structure and to determine a…

Descriptors: Item Response Theory, Science Tests, Energy, Magnets

Investigating Item Complexity as a Source of Cross-National DIF in TIMSS Math and Science

Peer reviewed

Direct link

Qi Huang; Daniel M. Bolt; Weicong Lyu – Large-scale Assessments in Education, 2024

Large scale international assessments depend on invariance of measurement across countries. An important consideration when observing cross-national differential item functioning (DIF) is whether the DIF actually reflects a source of bias, or might instead be a methodological artifact reflecting item response theory (IRT) model misspecification.…

Descriptors: Test Items, Item Response Theory, Test Bias, Test Validity

On Joining a Signal Detection Choice Model with Response Time Models

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2021

In a signal detection theory (SDT) approach to multiple choice exams, examinees are viewed as choosing, for each item, the alternative that is perceived as being the most plausible, with perceived plausibility depending in part on whether or not an item is known. The SDT model is a process model and provides measures of item difficulty, item…

Descriptors: Perception, Bias, Theories, Test Items

Mendelian or Multifactorial? Current Undergraduate Genetics Assessments Focus on Genes and Rarely Include the Environment

Peer reviewed
PDF on ERIC

Download full text

Schmid, Kelly M.; Lee, Dennis; Weindling, Monica; Syed, Awais; Agyemang, Stephanie-Louise Yacoba; Donovan, Brian; Radick, Gregory; Smith, Michelle K. – Journal of Microbiology & Biology Education, 2022

Undergraduate genetics courses have historically focused on simple genetic models, rather than taking a more multifactorial approach where students explore how traits are influenced by a combination of genes, the environment, and gene-by-environment interactions. While a focus on simple genetic models can provide straightforward examples to…

Descriptors: Undergraduate Students, Genetics, Science Instruction, Models

Implementation of Four-Tier Multiple-Choice Instruments Based on the Partial Credit Model in Evaluating Students' Learning Progress

Peer reviewed
PDF on ERIC

Download full text

Laliyo, Lukman Abdul Rauf; Hamdi, Syukrul; Pikoli, Masrid; Abdullah, Romario; Panigoro, Citra – European Journal of Educational Research, 2021

One of the issues that hinder the students' learning progress is the inability to construct an epistemological explanation of a scientific phenomenon. Four-tier multiple-choice (hereinafter, 4TMC) instrument and Partial-Credit Model were employed to elaborate on the diagnosis process of the aforementioned problem. This study was to develop and…

Descriptors: Learning Processes, Multiple Choice Tests, Models, Test Items

Variational Estimation for Multidimensional Generalized Partial Credit Model

Peer reviewed
PDF on ERIC

Download full text

Direct link

Chengyu Cui; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Multidimensional item response theory (MIRT) models have generated increasing interest in the psychometrics literature. Efficient approaches for estimating MIRT models with dichotomous responses have been developed, but constructing an equally efficient and robust algorithm for polytomous models has received limited attention. To address this gap,…

Descriptors: Item Response Theory, Accuracy, Simulation, Psychometrics

Modeling Partial Knowledge on Multiple-Choice Items Using Elimination Testing

Peer reviewed

Direct link

Wu, Qian; De Laet, Tinne; Janssen, Rianne – Journal of Educational Measurement, 2019

Single-best answers to multiple-choice items are commonly dichotomized into correct and incorrect responses, and modeled using either a dichotomous item response theory (IRT) model or a polytomous one if differences among all response options are to be retained. The current study presents an alternative IRT-based modeling approach to…

Descriptors: Multiple Choice Tests, Item Response Theory, Test Items, Responses

Toward Culturally Responsive and Equitable Testing: Innovative Psychometric Analyses on Contextualized Measurement and Adaptive Testing

Direct link

Nixi Wang – ProQuest LLC, 2022

Measurement errors attributable to cultural issues are complex and challenging for educational assessments. We need assessment tests sensitive to the cultural heterogeneity of populations, and psychometric methods appropriate to address fairness and equity concerns. Built on the research of culturally responsive assessment, this dissertation…

Descriptors: Culturally Relevant Education, Testing, Equal Education, Validity

Developing an Assessment Framework of Multidimensional Scientific Competencies

Peer reviewed
PDF on ERIC

Download full text

Intasoi, Sasima; Junpeng, Putcharee; Tang, Keow Ngang; Ketchatturat, Jatuphum; Zhang, Yidan; Wilson, Mark – International Journal of Evaluation and Research in Education, 2020

The study aimed to develop and validate an assessment framework of multidimensional scientific competencies for seventh-grade students in the northeastern region of Thailand. A total of 289 samples with three different scientific competency levels were randomly selected to participate as test-takers. The design-based research encompassing four…

Descriptors: Science Tests, Grade 7, Foreign Countries, Science Process Skills

Examining Modelling Ability before Educational Reforms: Findings of Cross-Country Comparisons of Grade 8 Data from TIMSS 2011

Peer reviewed

Direct link

Lin, Jing-Wen; Yu, Ruan-Ching – Asia Pacific Journal of Education, 2022

Modelling ability is one of the essential elements of the latest educational reforms, and Trends in International Mathematics and Science Study (TIMSS) is a curriculum-based assessment which allows educational systems worldwide to inspect the curricular influences. The aims of this study were to examine the role of modelling ability in the…

Descriptors: Grade 8, Educational Change, Cross Cultural Studies, Test Items

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

Previous Page | Next Page »

Pages: 1 | 2 | 3

ProQuest LLC	6
Educational and Psychological…	4
Grantee Submission	3
College Board	2
Journal of Educational…	2
Large-scale Assessments in…	2
Applied Psychological…	1
Asia Pacific Journal of…	1
British Journal of…	1
Chemistry Education Research…	1
Contributions from Science…	1
Curriculum Journal	1
European Journal of…	1
International Electronic…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Chemical Education	1
Journal of Computer Assisted…	1
Journal of Educational and…	1
Journal of Microbiology &…	1
Online Submission	1
Physical Review Physics…	1
More ▼

Engelhard, George, Jr.	2
Reshetar, Rosemary	2
Thummaphan, Phonraphee	2
Wind, Stefanie A.	2
Abdullah, Romario	1
Adadan, Emine	1
Agyemang, Stephanie-Louise…	1
Akaygun, Sevil	1
Andrew M. Olney	1
Arenson, Ethan A.	1
Baird, Jo-Anne	1
Beck, Christina	1
Ben-Eliyahu, Einat	1
Bock, H. Darrell	1
Bohm, Isabell	1
Bolt, Daniel M.	1
Chengyu Cui	1
Chiang, Jui-Ling	1
Chun Wang	1
Daniel M. Bolt	1
Darling, Andrew	1
De Laet, Tinne	1
DeCarlo, Lawrence T.	1
Di Mitri, Daniele	1
Dirlik, Ezgi Mor	1
More ▼