ERIC - Search Results

Publication Date

In 2025	2
Since 2024	8
Since 2021 (last 5 years)	32
Since 2016 (last 10 years)	65
Since 2006 (last 20 years)	110

Descriptor

Item Analysis	110
Scoring	110
Test Items	55
Item Response Theory	37
Test Construction	29
Test Validity	27
Foreign Countries	26
Comparative Analysis	24
Psychometrics	23
Test Reliability	23
Scores	19
Computer Assisted Testing	17
Student Evaluation	16
Correlation	15
Second Language Learning	15
English (Second Language)	14
Testing	14
Accuracy	13
Achievement Tests	13
Language Tests	13
Teaching Methods	13
Difficulty Level	12
Scaling	11
Decision Making	10
Mathematics Tests	10
More ▼

Publication Type

Journal Articles	88
Reports - Research	62
Reports - Evaluative	30
Reports - Descriptive	13
Numerical/Quantitative Data	9
Tests/Questionnaires	8
Dissertations/Theses -…	4
Collected Works - General	2
Speeches/Meeting Papers	2
Books	1
Guides - Non-Classroom	1
Information Analyses	1
More ▼

Education Level

Secondary Education	22
Elementary Education	19
Higher Education	17
Elementary Secondary Education	13
Postsecondary Education	12
High Schools	9
Middle Schools	9
Early Childhood Education	8
Grade 6	8
Grade 8	8
Junior High Schools	8
Grade 5	7
Grade 7	7
Primary Education	7
Grade 4	6
Intermediate Grades	6
Grade 3	5
Grade 9	4
Grade 10	3
Grade 11	3
Kindergarten	3
Grade 12	1
Preschool Education	1
More ▼

Audience

Researchers

Location

China	3
Australia	2
Europe	2
Indonesia	2
Iran	2
Netherlands	2
Oregon	2
United Kingdom	2
Asia	1
Czech Republic	1
Florida	1
Hong Kong	1
Idaho	1
Illinois	1
Latin America	1
Maryland	1
Massachusetts	1
New York	1
New Zealand	1
North Carolina (Greensboro)	1
Norway	1
Pennsylvania	1
Poland	1
Puerto Rico	1
Texas	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Individuals with Disabilities…	3

Assessments and Surveys

Program for International…	4
National Assessment of…	2
Peabody Picture Vocabulary…	2
Test of English as a Foreign…	2
Trends in International…	2
Cornell Critical Thinking Test	1
Graduate Record Examinations	1
Kaufman Assessment Battery…	1
Kaufman Test of Educational…	1
Remote Associates Test	1
Wechsler Adult Intelligence…	1
Wechsler Individual…	1
Wechsler Intelligence Scale…	1
Wechsler Preschool and…	1
Woodcock Johnson Tests of…	1
Woodcock Johnson Tests of…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 110 results Save | Export

Item Response Theory and Modeling with Stata

Peer reviewed

Direct link

Raykov, Tenko – Measurement: Interdisciplinary Research and Perspectives, 2023

This software review discusses the capabilities of Stata to conduct item response theory modeling. The commands needed for fitting the popular one-, two-, and three-parameter logistic models are initially discussed. The procedure for testing the discrimination parameter equality in the one-parameter model is then outlined. The commands for fitting…

Descriptors: Item Response Theory, Models, Comparative Analysis, Item Analysis

An Approach to Test Equating under the Latent "D"-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Measurement: Interdisciplinary Research and Perspectives, 2021

This study offers an approach to test equating under the latent D-scoring method (DSM-L) using the nonequivalent groups with anchor tests (NEAT) design. The accuracy of the test equating was examined via a simulation study under a 3 × 3 design by two conditions: group ability at three levels and test difficulty at three levels. The results for…

Descriptors: Equated Scores, Scoring, Test Items, Accuracy

Using Nominal Models to Examine How High School Students Use an I Do Not Know Response Option When Answering Scale Items

Direct link

Laura Laclede – ProQuest LLC, 2023

Because non-cognitive constructs can influence student success in education beyond academic achievement, it is essential that they are reliably conceptualized and measured. Within this context, there are several gaps in the literature related to correctly interpreting the meaning of scale scores when a non-standard response option like I do not…

Descriptors: High School Students, Test Wiseness, Models, Test Items

Evaluating ChatGPT as a Self-Learning Tool in Medical Biochemistry: A Performance Assessment in Undergraduate Medical University Examination

Peer reviewed

Direct link

Krishna Mohan Surapaneni; Anusha Rajajagadeesan; Lakshmi Goudhaman; Shalini Lakshmanan; Saranya Sundaramoorthi; Dineshkumar Ravi; Kalaiselvi Rajendiran; Porchelvan Swaminathan – Biochemistry and Molecular Biology Education, 2024

The emergence of ChatGPT as one of the most advanced chatbots and its ability to generate diverse data has given room for numerous discussions worldwide regarding its utility, particularly in advancing medical education and research. This study seeks to assess the performance of ChatGPT in medical biochemistry to evaluate its potential as an…

Descriptors: Biochemistry, Science Instruction, Artificial Intelligence, Teaching Methods

Comparing the Score Interpretation across Modes in PISA: An Investigation of How Item Facets Affect Difficulty

Peer reviewed

Direct link

Harrison, Scott; Kroehne, Ulf; Goldhammer, Frank; Lüdtke, Oliver; Robitzsch, Alexander – Large-scale Assessments in Education, 2023

Background: Mode effects, the variations in item and scale properties attributed to the mode of test administration (paper vs. computer), have stimulated research around test equivalence and trend estimation in PISA. The PISA assessment framework provides the backbone to the interpretation of the results of the PISA test scores. However, an…

Descriptors: Scoring, Test Items, Difficulty Level, Foreign Countries

Item Response Theory Modeling of the Verb Naming Test

Peer reviewed

Direct link

Fergadiotis, Gerasimos; Casilio, Marianne; Dickey, Michael Walsh; Steel, Stacey; Nicholson, Hannele; Fleegle, Mikala; Swiderski, Alexander; Hula, William D. – Journal of Speech, Language, and Hearing Research, 2023

Purpose: Item response theory (IRT) is a modern psychometric framework with several advantageous properties as compared with classical test theory. IRT has been successfully used to model performance on anomia tests in individuals with aphasia; however, all efforts to date have focused on noun production accuracy. The purpose of this study is to…

Descriptors: Item Response Theory, Psychometrics, Verbs, Naming

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

Exploring Speededness in Pre-Reform GCSEs (2009 to 2016)

Download full text

Direct link

Emma Walland – Research Matters, 2024

GCSE examinations (taken by students aged 16 years in England) are not intended to be speeded (i.e. to be partly a test of how quickly students can answer questions). However, there has been little research exploring this. The aim of this research was to explore the speededness of past GCSE written examinations, using only the data from scored…

Descriptors: Educational Change, Test Items, Item Analysis, Scoring

Assessing the Ethical Capabilities of Chat GPT in Healthcare: A Study on Its Proficiency in Situational Judgement Test

Peer reviewed

Direct link

Kunal Sareen – Innovations in Education and Teaching International, 2024

This study examines the proficiency of Chat GPT, an AI language model, in answering questions on the Situational Judgement Test (SJT), a widely used assessment tool for evaluating the fundamental competencies of medical graduates in the UK. A total of 252 SJT questions from the "Oxford Assess and Progress: Situational Judgement" Test…

Descriptors: Ethics, Decision Making, Artificial Intelligence, Computer Software

Latent "D"-Scoring Modeling: Estimation of Item and Person Parameters

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Educational and Psychological Measurement, 2021

This study presents a latent (item response theory--like) framework of a recently developed classical approach to test scoring, equating, and item analysis, referred to as "D"-scoring method. Specifically, (a) person and item parameters are estimated under an item response function model on the "D"-scale (from 0 to 1) using…

Descriptors: Scoring, Equated Scores, Item Analysis, Item Response Theory

Polytomous Testlet Response Models for Technology-Enhanced Innovative Items: Implications on Model Fit and Trait Inference

Peer reviewed

Direct link

Kang, Hyeon-Ah; Han, Suhwa; Kim, Doyoung; Kao, Shu-Chuan – Educational and Psychological Measurement, 2022

The development of technology-enhanced innovative items calls for practical models that can describe polytomous testlet items. In this study, we evaluate four measurement models that can characterize polytomous items administered in testlets: (a) generalized partial credit model (GPCM), (b) testlet-as-a-polytomous-item model (TPIM), (c)…

Descriptors: Goodness of Fit, Item Response Theory, Test Items, Scoring

A Comparison between the Use of Latent D-Scoring Method Models and Item Response Theory Models with Respect to Item Fit and Person Recovery Parameter

Direct link

Mohammed Alqabbaa – ProQuest LLC, 2021

Psychometricians at an organization named the Education and Training Evaluation Commission (ETEC) developed a new test scoring method called the latent D-scoring method (DSM-L) where it is believed that the new method itself is much easier and more efficient to use compared to the Item Response Theory (IRT) method. However, there are no studies…

Descriptors: Item Response Theory, Scoring, Item Analysis, Equated Scores

A New Scoring Method for Item Response Theory Analysis of C-Tests

Peer reviewed

Direct link

Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025

This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…

Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…

Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests

Impacts of Scoring Methods on Multiple-Select Multiple-Choice Item Statistics

Direct link

Alicia A. Stoltenberg – ProQuest LLC, 2024

Multiple-select multiple-choice items, or multiple-choice items with more than one correct answer, are used to quickly assess content on standardized assessments. Because there are multiple keys to these item types, there are also multiple ways to score student responses to these items. The purpose of this study was to investigate how changing the…

Descriptors: Scoring, Evaluation Methods, Multiple Choice Tests, Standardized Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Journal of Psychoeducational…	7
Language Testing	6
ETS Research Report Series	5
Educational and Psychological…	4
ProQuest LLC	4
International Association for…	3
International Journal of…	3
Journal of Technology,…	3
Measurement:…	3
Partnership for Assessment of…	3
Assessment for Effective…	2
Canadian Journal of School…	2
Journal of Educational…	2
Journal of Educational and…	2
Journal of Experimental…	2
Journal of Speech, Language,…	2
Ministerial Council on…	2
New Meridian Corporation	2
AERA Online Paper Repository	1
Action in Teacher Education	1
Applied Measurement in…	1
Applied Psychological…	1
Assessment	1
Biochemistry and Molecular…	1
Bioscience Education e-Journal	1
More ▼

Dimitrov, Dimiter M.	5
Atanasov, Dimitar V.	3
Bowles, Ryan P.	2
Chernyshenko, Oleksandr S.	2
Donovan, Jenny	2
Forthmann, Boris	2
Foster, Tricia D.	2
Guo, Hongwen	2
Hutton, Penny	2
Justice, Laura M.	2
Khan, Kiren S.	2
Lennon, Melissa	2
Martin, Michael O., Ed.	2
Mullis, Ina V. S., Ed.	2
Piasta, Shayne B.	2
Skibbe, Lori E.	2
von Davier, Matthias	2
Adame, Cindy	1
Ahmed, S.	1
Alicia A. Stoltenberg	1
Allan S. Cohen	1
Allee-Smith, Paula J.	1
Alpizar, David	1
Anusha Rajajagadeesan	1
Attali, Yigal	1
More ▼