ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	10

Descriptor

Computer Software	16
Goodness of Fit	16
Test Items	16
Item Response Theory	7
Item Analysis	5
Models	5
Psychometrics	4
Statistical Analysis	4
Accuracy	3
Classification	3
Correlation	3
Error of Measurement	3
Mathematical Models	3
Measurement Techniques	3
Scores	3
Scoring	3
Simulation	3
Test Reliability	3
Testing	3
Comparative Analysis	2
Computation	2
Construct Validity	2
Educational Assessment	2
Equated Scores	2
Factor Analysis	2
More ▼

Source

Cogent Education	1
Eurasian Journal of…	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Language Assessment Quarterly	1
Language Testing	1
Measurement:…	1
ProQuest LLC	1
Routledge, Taylor & Francis…	1

Publication Type

Reports - Research	10
Journal Articles	8
Speeches/Meeting Papers	3
Reports - Descriptive	2
Reports - Evaluative	2
Books	1
Dissertations/Theses -…	1

Education Level

Higher Education	2
Postsecondary Education	1
Secondary Education	1

Audience

Researchers	3
Policymakers	1
Practitioners	1
Students	1

Location

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
Graduate Management Admission…	1
Peabody Picture Vocabulary…	1
Program for International…	1
Students Evaluation of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Modeling Directional Testlet Effects on Multiple Open-Ended Questions

Peer reviewed

Direct link

Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025

Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…

Descriptors: Models, Test Items, Educational Assessment, Scores

A Comprehensive Review of Rasch Measurement in Language Assessment: Recommendations and Guidelines for Research

Peer reviewed

Direct link

Aryadoust, Vahid; Ng, Li Ying; Sayama, Hiroki – Language Testing, 2021

Over the past decades, the application of Rasch measurement in language assessment has gradually increased. In the present study, we coded 215 papers using Rasch measurement published in 21 applied linguistics journals for multiple features. We found that seven Rasch models and 23 software packages were adopted in these papers, with many-facet…

Descriptors: Language Tests, Testing, Test Items, Network Analysis

Implementation of Cognitive Diagnosis Modeling Using the GDINA R Package

Peer reviewed
PDF on ERIC

Download full text

Torre, Jimmy de la; Akbay, Lokman – Eurasian Journal of Educational Research, 2019

Purpose: Well-designed assessment methodologies and various cognitive diagnosis models (CDMs) to extract diagnostic information about examinees' individual strengths and weaknesses have been developed. Due to this novelty, as well as educational specialists' lack of familiarity with CDMs, their applications are not widespread. This article aims at…

Descriptors: Cognitive Measurement, Models, Computer Software, Testing

Monte Carlo Simulation in Item Response Theory Applications Using SAS

Peer reviewed

Direct link

Ames, Allison J.; Leventhal, Brian C.; Ezike, Nnamdi C. – Measurement: Interdisciplinary Research and Perspectives, 2020

Data simulation and Monte Carlo simulation studies are important skills for researchers and practitioners of educational and psychological measurement, but there are few resources on the topic specific to item response theory. Even fewer resources exist on the statistical software techniques to implement simulation studies. This article presents…

Descriptors: Monte Carlo Methods, Item Response Theory, Simulation, Computer Software

Diagnostic Classification Models: Recent Developments, Practical Issues, and Prospects

Peer reviewed

Direct link

Ravand, Hamdollah; Baghaei, Purya – International Journal of Testing, 2020

More than three decades after their introduction, diagnostic classification models (DCM) do not seem to have been implemented in educational systems for the purposes they were devised. Most DCM research is either methodological for model development and refinement or retrofitting to existing nondiagnostic tests and, in the latter case, basically…

Descriptors: Classification, Models, Diagnostic Tests, Test Construction

Item Response Data Analysis Using Stata Item Response Theory Package

Peer reviewed

Direct link

Yang, Ji Seung; Zheng, Xiaying – Journal of Educational and Behavioral Statistics, 2018

The purpose of this article is to introduce and review the capability and performance of the Stata item response theory (IRT) package that is available from Stata v.14, 2015. Using a simulated data set and a publicly available item response data set extracted from Programme of International Student Assessment, we review the IRT package from…

Descriptors: Item Response Theory, Item Analysis, Computer Software, Statistical Analysis

Rasch Analysis: A Primer for School Psychology Researchers and Practitioners

Peer reviewed

Direct link

Boone, William J.; Noltemeyer, Amity – Cogent Education, 2017

In order to progress as a field, school psychology research must be informed by effective measurement techniques. One approach to address the need for careful measurement is Rasch analysis. This technique can (a) facilitate the development of instruments that provide useful data, (b) provide data that can be used confidently for both descriptive…

Descriptors: Item Response Theory, School Psychology, School Psychologists, Educational Research

Evaluating IRT- and CTT-Based Methods of Estimating Classification Consistency and Accuracy Indices from Single Administrations

Direct link

Deng, Nina – ProQuest LLC, 2011

Three decision consistency and accuracy (DC/DA) methods, the Livingston and Lewis (LL) method, LEE method, and the Hambleton and Han (HH) method, were evaluated. The purposes of the study were: (1) to evaluate the accuracy and robustness of these methods, especially when their assumptions were not well satisfied, (2) to investigate the "true"…

Descriptors: Item Response Theory, Test Theory, Computation, Classification

Construct Validity and Measurement Invariance of the Peabody Picture Vocabulary Test-III Form A

Peer reviewed

Direct link

Pae, Hye K.; Greenberg, Daphne; Morris, Robin D. – Language Assessment Quarterly, 2012

The aim of this study was to apply the Rasch model to an analysis of the psychometric properties of the Peabody Picture Vocabulary Test--III Form A (PPVT--IIIA) items with struggling adult readers. The PPVT--IIIA was administered to 229 African American adults whose isolated word reading skills were between third and fifth grades. Conformity of…

Descriptors: African Americans, Test Items, Construct Validity, Test Validity

Handbook of Polytomous Item Response Theory Models

Direct link

Nering, Michael L., Ed.; Ostini, Remo, Ed. – Routledge, Taylor & Francis Group, 2010

This comprehensive "Handbook" focuses on the most used polytomous item response theory (IRT) models. These models help us understand the interaction between examinees and test questions where the questions have various response categories. The book reviews all of the major models and includes discussions about how and where the models…

Descriptors: Guides, Item Response Theory, Test Items, Correlation

A Primer on Logistic Regression.

Download full text

Woldbeck, Tanya – 1998

This paper introduces logistic regression as a viable alternative when the researcher is faced with variables that are not continuous. If one is to use simple regression, the dependent variable must be measured on a continuous scale. In the behavioral sciences, it may not always be appropriate or possible to have a measured dependent variable on a…

Descriptors: Behavioral Science Research, Chi Square, Computer Software, Goodness of Fit

Rasch Analysis Using SPSS.

Phillips, Gary W. – 1983

Ways in which the Statistical Package for the Social Sciences (SPSS) can be used to perform some Rasch analyses are described in detail. It is shown how SPSS and a set of item calibrations can be used to estimate person abilities, standard errors of measurement, test characteristic curve, test information curve, classification consistency on a…

Descriptors: Classification, Computer Software, Error of Measurement, Estimation (Mathematics)

The Relationship of Constrained Free-Response to Multiple-Choice and Open-Ended Items.

Download full text

Bennett, Randy Elliot; And Others – 1989

This study examined the relationship of a machine-scorable, constrained free-response computer science item that required the student to debug a faulty program to two other types of items: multiple-choice and free-response requiring production of a computer program. The free-response items were from the College Board's Advanced Placement Computer…

Descriptors: College Students, Computer Science, Computer Software, Debugging (Computers)

A New, More Powerful Approach to Multitrait-Multimethod Analyses: An Application of Second-Order Confirmatory Factor Analysis.

Download full text

Marsh, Herbert W.; Hocevar, Dennis – 1986

The advantages of applying confirmatory factor analysis (CFA) to multitrait-multimethod (MTMM) data are widely recognized. However, because CFA as traditionally applied to MTMM data incorporates single indicators of each scale (i.e., each trait/method combination), important weaknesses are the failure to: (1) correct appropriately for measurement…

Descriptors: Computer Software, Construct Validity, Correlation, Error of Measurement

An Exploratory Study of the Applicability of Item Response Theory Methods to the Graduate Management Admission Test.

Download full text

Kingston, Neal; And Others – 1985

A necessary prerequisite to the operational use of item response theory (IRT) in any testing program is the investigation of the feasibility of such an approach. This report presents the results of such research for the Graduate Management Admission Test (GMAT). Despite the fact that GMAT data appear to violate a basic assumption of the…

Descriptors: College Entrance Examinations, Computer Software, Correlation, Equated Scores

Previous Page | Next Page »

Pages: 1 | 2

Akbay, Lokman	1
Ames, Allison J.	1
Aryadoust, Vahid	1
Baghaei, Purya	1
Bennett, Randy Elliot	1
Boone, William J.	1
Deng, Nina	1
Ezike, Nnamdi C.	1
Greenberg, Daphne	1
Hocevar, Dennis	1
Kingston, Neal	1
Kuan-Yu Jin	1
Lang, William Steve	1
Leventhal, Brian C.	1
Marsh, Herbert W.	1
Morris, Robin D.	1
Nering, Michael L., Ed.	1
Ng, Li Ying	1
Noltemeyer, Amity	1
Ostini, Remo, Ed.	1
Pae, Hye K.	1
Phillips, Gary W.	1
Ravand, Hamdollah	1
Sayama, Hiroki	1
More ▼