ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	20

Descriptor

Models	24
Test Items	24
Test Theory	24
Difficulty Level	8
Comparative Analysis	7
Measurement Techniques	7
Evaluation Methods	6
Item Analysis	6
Item Response Theory	6
Psychometrics	6
Scoring	6
Testing	6
Evaluation Problems	5
Measurement	5
Classification	4
Definitions	4
Diagnostic Tests	4
Test Construction	4
Error of Measurement	3
Evaluation Criteria	3
Foreign Countries	3
Goodness of Fit	3
Mathematics Tests	3
Multiple Choice Tests	3
Responses	3
More ▼

Source

Measurement:…	5
Applied Psychological…	2
Educational and Psychological…	2
Journal of Educational and…	2
Asia Pacific Education Review	1
College Board	1
EURASIA Journal of…	1
Instructional Science	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Physical Review Physics…	1
ProQuest LLC	1
School Science and Mathematics	1
More ▼

Publication Type

Journal Articles	19
Reports - Research	12
Reports - Evaluative	5
Opinion Papers	4
Reports - Descriptive	3
Speeches/Meeting Papers	3
Dissertations/Theses -…	1
Information Analyses	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	2
Grade 8	2
Secondary Education	2
Elementary Education	1
Grade 3	1
Grade 4	1
Grade 7	1
High Schools	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
More ▼

Audience

Practitioners

Location

United States	3
South Korea	1
Sweden	1

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	3
Armed Services Vocational…	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Item Parameter Recovery via Traditional 2PL, Testlet and Bi-Factor Models for Testlet-Based Tests

Peer reviewed
PDF on ERIC

Download full text

Soysal, Sumeyra; Yilmaz Kogar, Esin – International Journal of Assessment Tools in Education, 2022

The testlet comprises a set of items based on a common stimulus. When the testlet is used in the tests, there may violate the local independence assumption, and in this case, it would not be appropriate to use traditional item response theory models in the tests in which the testlet is included. When the testlet is discussed, one of the most…

Descriptors: Test Items, Test Theory, Models, Sample Size

Classical Test Theory and Item Response Theory Comparison of the Brief Electricity and Magnetism Assessment and the Conceptual Survey of Electricity and Magnetism

Peer reviewed

Direct link

Eaton, Philip; Johnson, Keith; Barrett, Frank; Willoughby, Shannon – Physical Review Physics Education Research, 2019

For proper assessment selection understanding the statistical similarities amongst assessments that measure the same, or very similar, topics is imperative. This study seeks to extend the comparative analysis between the brief electricity and magnetism assessment (BEMA) and the conceptual survey of electricity and magnetism (CSEM) presented by…

Descriptors: Test Theory, Item Response Theory, Comparative Analysis, Energy

The Comparison of Item Parameters Estimated from Parametric and Nonparametric Item Response Theory Models in Case of the Violance of Local Independence Assumption

Peer reviewed
PDF on ERIC

Download full text

Dirlik, Ezgi Mor – International Journal of Progressive Education, 2019

Item response theory (IRT) has so many advantages than its precedent Classical Test Theory (CTT) such as non-changing item parameters, ability parameter estimations free from the items. However, in order to get these advantages, some assumptions should be met and they are; unidimensionality, normality and local independence. However, it is not…

Descriptors: Comparative Analysis, Nonparametric Statistics, Item Response Theory, Models

A Strategy for Replacing Sum Scoring

Peer reviewed

Direct link

Ramsay, James O.; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2017

This article promotes the use of modern test theory in testing situations where sum scores for binary responses are now used. It directly compares the efficiencies and biases of classical and modern test analyses and finds an improvement in the root mean squared error of ability estimates of about 5% for two designed multiple-choice tests and…

Descriptors: Scoring, Test Theory, Computation, Maximum Likelihood Statistics

Maximum Likelihood Item Easiness Models for Test Theory without an Answer Key

Peer reviewed

Direct link

France, Stephen L.; Batchelder, William H. – Educational and Psychological Measurement, 2015

Cultural consensus theory (CCT) is a data aggregation technique with many applications in the social and behavioral sciences. We describe the intuition and theory behind a set of CCT models for continuous type data using maximum likelihood inference methodology. We describe how bias parameters can be incorporated into these models. We introduce…

Descriptors: Maximum Likelihood Statistics, Test Items, Difficulty Level, Test Theory

What CDM Can Tell about What Students Have Learned: An Analysis of TIMSS Eighth Grade Mathematics

Peer reviewed

Direct link

Choi, Kyong Mi; Lee, Young-Sun; Park, Yoon Soo – EURASIA Journal of Mathematics, Science & Technology Education, 2015

International trended assessments have long attempted to provide instructional information to educational researchers and classroom teachers. Studies have shown that traditional methods of item analysis have not provided specific information that can be directly applicable to improve student performance. To this end, cognitive diagnosis models…

Descriptors: International Assessment, Mathematics Tests, Grade 8, Models

Relationships between Cognitive Diagnosis, CTT, and IRT Indices: An Empirical Investigation

Peer reviewed

Direct link

Lee, Young-Sun; de la Torre, Jimmy; Park, Yoon Soo – Asia Pacific Education Review, 2012

Cognitive diagnosis models (CDMs) continue to generate interest among researchers and practitioners because they can provide diagnostic information relevant to classroom instruction and student learning. However, its modeling component has outpaced its complementary component-test construction. Thus, most applications of cognitive diagnosis…

Descriptors: Cognitive Measurement, Models, Test Theory, Item Response Theory

Why Should We Assess the Goodness-of-Fit of IRT Models?

Peer reviewed

Direct link

Maydeu-Olivares, Alberto – Measurement: Interdisciplinary Research and Perspectives, 2013

In this rejoinder, Maydeu-Olivares states that, in item response theory (IRT) measurement applications, the application of goodness-of-fit (GOF) methods informs researchers of the discrepancy between the model and the data being fitted (the room for improvement). By routinely reporting the GOF of IRT models, together with the substantive results…

Descriptors: Goodness of Fit, Models, Evaluation Methods, Item Response Theory

Rating Quality Studies Using Rasch Measurement Theory. Research Report 2013-3

Download full text

Engelhard, George, Jr.; Wind, Stefanie A. – College Board, 2013

The major purpose of this study is to examine the quality of ratings assigned to CR (constructed-response) questions in large-scale assessments from the perspective of Rasch Measurement Theory. Rasch Measurement Theory provides a framework for the examination of rating scale category structure that can yield useful information for interpreting the…

Descriptors: Measurement Techniques, Rating Scales, Test Theory, Scores

Impact of Psychometric Decisions on Assessment Outcomes in an Alternate Assessment

Direct link

Rao, Vasanthi – ProQuest LLC, 2012

In 1997, based on the amendments to Individuals with Disabilities Education Act (IDEA), all states were faced with a statutory requirement to develop and implement alternate assessments for students with disabilities unable to participate in the statewide large-scale assessment. States were given the challenge of creating, implementing, and…

Descriptors: Alternative Assessment, Psychometrics, Item Response Theory, Models

Conceptual Issues in Response-Time Modeling

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational Measurement, 2009

Two different traditions of response-time (RT) modeling are reviewed: the tradition of distinct models for RTs and responses, and the tradition of model integration in which RTs are incorporated in response models or the other way around. Several conceptual issues underlying both traditions are made explicit and analyzed for their consequences. We…

Descriptors: Test Items, Models, Reaction Time, Measurement

Predictive Control of Speededness in Adaptive Testing

Peer reviewed

Direct link

van der Linden, Wim J. – Applied Psychological Measurement, 2009

An adaptive testing method is presented that controls the speededness of a test using predictions of the test takers' response times on the candidate items in the pool. Two different types of predictions are investigated: posterior predictions given the actual response times on the items already administered and posterior predictions that use the…

Descriptors: Simulation, Adaptive Testing, Vocational Aptitude, Bayesian Statistics

Some Notes on the Reinvention of Latent Structure Models as Diagnostic Classification Models

Peer reviewed

Direct link

von Davier, Matthias – Measurement: Interdisciplinary Research and Perspectives, 2009

In this commentary, the author points out few issues, one being that there are models mislabeled as diagnostic, which deal with linear decompositions of item difficulties rather than estimating multidimensional skill variables. The author discusses the issue that there are many new names for essentially well-known models for multiple simultaneous…

Descriptors: Test Items, Probability, Models, Diagnostic Tests

Re-Examining Test Item Issues in the TIMSS Mathematics and Science Assessments

Peer reviewed

Direct link

Wang, Jianjun – School Science and Mathematics, 2011

As the largest international study ever taken in history, the Trend in Mathematics and Science Study (TIMSS) has been held as a benchmark to measure U.S. student performance in the global context. In-depth analyses of the TIMSS project are conducted in this study to examine key issues of the comparative investigation: (1) item flaws in mathematics…

Descriptors: Test Items, Figurative Language, Item Response Theory, Benchmarking

Diagnostic Classification Modeling: Opportunity for Identity

Peer reviewed

Direct link

Hancock, Gregory R. – Measurement: Interdisciplinary Research and Perspectives, 2009

As Rupp and Templin (2008) stated directly, diagnostic classification methods "are confirmatory in nature." Methods, though, are neither inherently confirmatory nor exploratory. Diagnostic classification modeling, with its analytical and computational obstacles eventually yielding as a comprehensive and potent discipline emerges, will…

Descriptors: Structural Equation Models, Test Items, Models, Diagnostic Tests

Previous Page | Next Page »

Pages: 1 | 2

van der Linden, Wim J.	4
Lee, Young-Sun	2
Park, Yoon Soo	2
Barrett, Frank	1
Batchelder, William H.	1
Bhaskar, R.	1
Choi, Kyong Mi	1
Dillard, Jesse F.	1
Dirlik, Ezgi Mor	1
Eaton, Philip	1
Engelhard, George, Jr.	1
France, Stephen L.	1
Graham, James M.	1
Haladyna, Tom	1
Hancock, Gregory R.	1
Jiao, Hong	1
Johnson, Keith	1
Maydeu-Olivares, Alberto	1
Ramsay, James O.	1
Rao, Vasanthi	1
Robitzsch, Alexander	1
Roid, Gale	1
Sotaridona, Leonardo	1
Soysal, Sumeyra	1
Wang, Jianjun	1
More ▼