ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	9
Since 2017 (last 10 years)	19
Since 2007 (last 20 years)	42

Descriptor

Models	64
Scoring	64
Test Items	64
Item Response Theory	27
Computer Assisted Testing	14
Item Analysis	14
Psychometrics	13
Test Construction	13
Comparative Analysis	10
Simulation	10
Foreign Countries	9
Scores	9
Computer Software	8
Difficulty Level	8
Accuracy	7
Adaptive Testing	7
Goodness of Fit	7
Language Tests	7
Maximum Likelihood Statistics	7
Test Validity	7
Computation	6
Mathematics Tests	6
Multiple Choice Tests	6
Test Reliability	6
Test Theory	6
More ▼

Publication Type

Journal Articles	39
Reports - Research	33
Reports - Evaluative	16
Reports - Descriptive	9
Speeches/Meeting Papers	6
Dissertations/Theses -…	3
Tests/Questionnaires	3
Collected Works - Proceedings	1
Guides - Classroom - Teacher	1
Non-Print Media	1
Reference Materials - General	1
More ▼

Education Level

Secondary Education	8
High Schools	4
Higher Education	4
Elementary Secondary Education	3
Postsecondary Education	3
Elementary Education	1
Grade 8	1
Junior High Schools	1
Middle Schools	1

Audience

Practitioners	1
Researchers	1

Location

California	1
Canada	1
China	1
Denmark	1
Netherlands	1
Sweden	1
United Kingdom	1
United Kingdom (Scotland)	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	3
Graduate Record Examinations	2
Alberta Grade Twelve Diploma…	1
Metropolitan Achievement Tests	1
Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 64 results Save | Export

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

Item Response Theory and Modeling with Stata

Peer reviewed

Direct link

Raykov, Tenko – Measurement: Interdisciplinary Research and Perspectives, 2023

This software review discusses the capabilities of Stata to conduct item response theory modeling. The commands needed for fitting the popular one-, two-, and three-parameter logistic models are initially discussed. The procedure for testing the discrimination parameter equality in the one-parameter model is then outlined. The commands for fitting…

Descriptors: Item Response Theory, Models, Comparative Analysis, Item Analysis

Using Nominal Models to Examine How High School Students Use an I Do Not Know Response Option When Answering Scale Items

Direct link

Laura Laclede – ProQuest LLC, 2023

Because non-cognitive constructs can influence student success in education beyond academic achievement, it is essential that they are reliably conceptualized and measured. Within this context, there are several gaps in the literature related to correctly interpreting the meaning of scale scores when a non-standard response option like I do not…

Descriptors: High School Students, Test Wiseness, Models, Test Items

A Multidimensional Item Response Theory Model for Continuous and Graded Responses with Error in Persons and Items

Peer reviewed

Direct link

Ferrando, Pere J.; Navarro-González, David – Educational and Psychological Measurement, 2021

Item response theory "dual" models (DMs) in which both items and individuals are viewed as sources of differential measurement error so far have been proposed only for unidimensional measures. This article proposes two multidimensional extensions of existing DMs: the M-DTCRM (dual Thurstonian continuous response model), intended for…

Descriptors: Item Response Theory, Error of Measurement, Models, Factor Analysis

Technology-Enhanced Items and Model-Data Misfit. Research Report. ETS RR-22-11

Peer reviewed
PDF on ERIC

Download full text

Carol Eckerly; Yue Jia; Paul Jewsbury – ETS Research Report Series, 2022

Testing programs have explored the use of technology-enhanced items alongside traditional item types (e.g., multiple-choice and constructed-response items) as measurement evidence of latent constructs modeled with item response theory (IRT). In this report, we discuss considerations in applying IRT models to a particular type of adaptive testlet…

Descriptors: Computer Assisted Testing, Test Items, Item Response Theory, Scoring

A Class of Cognitive Diagnosis Models for Polytomous Data

Peer reviewed

Direct link

Gao, Xuliang; Ma, Wenchao; Wang, Daxun; Cai, Yan; Tu, Dongbo – Journal of Educational and Behavioral Statistics, 2021

This article proposes a class of cognitive diagnosis models (CDMs) for polytomously scored items with different link functions. Many existing polytomous CDMs can be considered as special cases of the proposed class of polytomous CDMs. Simulation studies were carried out to investigate the feasibility of the proposed CDMs and the performance of…

Descriptors: Cognitive Measurement, Models, Test Items, Scoring

Beyond Semantic Distance: Automated Scoring of Divergent Thinking Greatly Improves with Large Language Models

Peer reviewed
PDF on ERIC

Download full text

Direct link

Peter Organisciak; Selcuk Acar; Denis Dumas; Kelly Berthiaume – Grantee Submission, 2023

Automated scoring for divergent thinking (DT) seeks to overcome a key obstacle to creativity measurement: the effort, cost, and reliability of scoring open-ended tests. For a common test of DT, the Alternate Uses Task (AUT), the primary automated approach casts the problem as a semantic distance between a prompt and the resulting idea in a text…

Descriptors: Automation, Computer Assisted Testing, Scoring, Creative Thinking

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…

Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests

Psychometric Approaches to Analyzing C-Tests

Peer reviewed

Direct link

Alpizar, David; Li, Tongyun; Norris, John M.; Gu, Lixiong – Language Testing, 2023

The C-test is a type of gap-filling test designed to efficiently measure second language proficiency. The typical C-test consists of several short paragraphs with the second half of every second word deleted. The words with deleted parts are considered as items nested within the corresponding paragraph. Given this testlet structure, it is commonly…

Descriptors: Psychometrics, Language Tests, Second Language Learning, Test Items

Accounting for Rater Effects with the Hierarchical Rater Model Framework When Scoring Simple Structured Constructed Response Tests

Peer reviewed

Direct link

Nieto, Ricardo; Casabianca, Jodi M. – Journal of Educational Measurement, 2019

Many large-scale assessments are designed to yield two or more scores for an individual by administering multiple sections measuring different but related skills. Multidimensional tests, or more specifically, simple structured tests, such as these rely on multiple multiple-choice and/or constructed responses sections of items to generate multiple…

Descriptors: Tests, Scoring, Responses, Test Items

Cross-Classified Random Effects Modeling for Moderated Item Calibration

Peer reviewed

Direct link

Chung, Seungwon; Cai, Li – Journal of Educational and Behavioral Statistics, 2021

In the research reported here, we propose a new method for scale alignment and test scoring in the context of supporting students with disabilities. In educational assessment, students from these special populations take modified tests because of a demonstrated disability that requires more assistance than standard testing accommodation. Updated…

Descriptors: Students with Disabilities, Scoring, Achievement Tests, Test Items

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

Empirical Analysis of Diagramatic Representation Test Instruments Using Partial Credit Model in Realizing Learning Outcomes

Peer reviewed
PDF on ERIC

Download full text

Warsono; Nursuhud, Puji Iman; Darma, Rio Sandhika; Supahar – International Journal of Instruction, 2020

The study was conducted to analyze the items about the ability of high school students diagram representation and obtain Item Curve Characteristic. Grid test instruments are compiled based on competencies and indicators of diagram representation which are then used to compile items. The test instrument consisted of five items and was validated by…

Descriptors: High School Students, Problem Solving, Visual Aids, Scoring

SARM: A Computer Program for Estimating Speed-Accuracy Response Models for Dichotomous Items. Research Report. ETS RR-18-15

Peer reviewed
PDF on ERIC

Download full text

van Rijn, Peter W.; Ali, Usama S. – ETS Research Report Series, 2018

A computer program was developed to estimate speed-accuracy response models for dichotomous items. This report describes how the models are estimated and how to specify data and input files. An example using data from a listening section of an international language test is described to illustrate the modeling approach and features of the computer…

Descriptors: Computer Software, Computation, Reaction Time, Timed Tests

Performance of Automated Speech Scoring on Different Low- to Medium-Entropy Item Types for Low-Proficiency English Learners. Research Report. ETS RR-17-12

Peer reviewed
PDF on ERIC

Download full text

Loukina, Anastassia; Zechner, Klaus; Yoon, Su-Youn; Zhang, Mo; Tao, Jidong; Wang, Xinhao; Lee, Chong Min; Mulholland, Matthew – ETS Research Report Series, 2017

This report presents an overview of the "SpeechRater"? automated scoring engine model building and evaluation process for several item types with a focus on a low-English-proficiency test-taker population. We discuss each stage of speech scoring, including automatic speech recognition, filtering models for nonscorable responses, and…

Descriptors: Automation, Scoring, Speech Tests, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Journal of Educational and…	7
ETS Research Report Series	6
Educational and Psychological…	3
Journal of Educational…	3
ProQuest LLC	3
Psychometrika	3
College Board	2
Educational Testing Service	2
International Educational…	2
International Journal of…	2
American Journal of…	1
Applied Psychological…	1
Assessment & Evaluation in…	1
Assessment in Education:…	1
Educational Assessment	1
Evaluation and the Health…	1
Grantee Submission	1
Instructional Science	1
International Journal of…	1
International Journal of…	1
Journal of Applied Measurement	1
Journal of Applied Testing…	1
Language Testing	1
Measurement:…	1
National Center for Research…	1
More ▼

Bennett, Randy Elliot	2
Cai, Li	2
Lee, Chong Min	2
Sinharay, Sandip	2
Suh, Youngsuk	2
Wang, Xinhao	2
Wollack, James A.	2
Yoon, Su-Youn	2
Zechner, Klaus	2
Zhang, Mo	2
Ali, Usama S.	1
Alpizar, David	1
Aybek, Eren Can	1
Batchelder, William H.	1
Bergstrom, Betty	1
Bhaskar, R.	1
Black, Beth	1
Bock, R. Darrell	1
Bolt, Daniel M.	1
Bramley, Tom	1
Breyer, F. Jay	1
Bunderson, C. Victor	1
Burton, Richard F.	1
Cai, Yan	1
More ▼