ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	17

Descriptor

Item Response Theory	20
Test Theory	20
Test Items	9
Test Reliability	7
Error of Measurement	5
Item Analysis	5
Evaluation Methods	4
Test Validity	4
Comparative Analysis	3
Correlation	3
Generalizability Theory	3
Measurement	3
Models	3
Psychometrics	3
Reliability	3
Test Construction	3
Testing	3
Computation	2
Computer Software	2
Evaluation Research	2
Foreign Countries	2
Goodness of Fit	2
Higher Education	2
Language Proficiency	2
Language Tests	2
More ▼

Source

Applied Psychological…	3
Measurement:…	3
Educational and Psychological…	2
Association for Institutional…	1
Communique	1
Grantee Submission	1
International Journal of…	1
Journal for Specialists in…	1
Journal of Economic Education	1
Journal on Educational…	1
Online Submission	1
Physical Review Special…	1
Practical Assessment,…	1
Psychological Assessment	1
Research Papers in Education	1
More ▼

Publication Type

Reports - Descriptive	20
Journal Articles	18
Opinion Papers	1
Reference Materials -…	1

Education Level

Higher Education	3
Postsecondary Education	2
High Schools	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Iran	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

Wechsler Preschool and…

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

Psychometric Packages in R

Peer reviewed

Direct link

Schumacker, Randall – Measurement: Interdisciplinary Research and Perspectives, 2019

The R software provides packages and functions that provide data analysis in classical true score, generalizability theory, item response theory, and Rasch measurement theories. A brief list of notable articles in each measurement theory and the first measurement journals is followed by a list of R psychometric software packages. Each psychometric…

Descriptors: Psychometrics, Computer Software, Measurement, Item Response Theory

Item Construction Using Reflective, Formative, or Rasch Measurement Models: Implications for Group Work

Peer reviewed

Direct link

Peterson, Christina Hamme; Gischlar, Karen L.; Peterson, N. Andrew – Journal for Specialists in Group Work, 2017

Measures that accurately capture the phenomenon are critical to research and practice in group work. The vast majority of group-related measures were developed using the reflective measurement model rooted in classical test theory (CTT). Depending on the construct definition and the measure's purpose, the reflective model may not always be the…

Descriptors: Item Response Theory, Group Activities, Test Theory, Test Items

Item Response Theory: An Introduction to Latent Trait Models to Test and Item Development

Peer reviewed
PDF on ERIC

Download full text

Bichi, Ado Abdu; Talib, Rohaya – International Journal of Evaluation and Research in Education, 2018

Testing in educational system perform a number of functions, the results from a test can be used to make a number of decisions in education. It is therefore well accepted in the education literature that, testing is an important element of education. To effectively utilize the tests in educational policies and quality assurance its validity and…

Descriptors: Item Response Theory, Test Items, Test Construction, Decision Making

Cognitive Diagnostic Modeling Using R

Peer reviewed
PDF on ERIC

Download full text

Ravand, Hamdollah – Practical Assessment, Research & Evaluation, 2015

Cognitive diagnostic models (CDM) have been around for more than a decade but their application is far from widespread for mainly two reasons: (1) CDMs are novel, as compared to traditional IRT models. Consequently, many researchers lack familiarity with them and their properties, and (2) Software programs doing CDMs have been expensive and not…

Descriptors: Test Theory, Models, Computer Software, Open Source Technology

The Reliability and Precision of Total Scores and IRT Estimates as a Function of Polytomous IRT Parameters and Latent Trait Distribution

Peer reviewed

Direct link

Culpepper, Steven Andrew – Applied Psychological Measurement, 2013

A classic topic in the fields of psychometrics and measurement has been the impact of the number of scale categories on test score reliability. This study builds on previous research by further articulating the relationship between item response theory (IRT) and classical test theory (CTT). Equations are presented for comparing the reliability and…

Descriptors: Item Response Theory, Reliability, Scores, Error of Measurement

Computer-Adaptive Assessments: Fundamentals and Considerations

Direct link

Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015

As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…

Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency

Improving Comprehension Assessment for Middle and High School Students: Challenges and Opportunities

Peer reviewed
PDF on ERIC

Download full text

Sabatini, John; Petscher, Yaacov; O'Reilly, Tenaha; Truckenmiller, Adrea – Grantee Submission, 2015

For decades, standardized reading comprehension tests have consisted of a series of passages and associated multiple-choice questions. Although widely used in and out of the classroom, there continues to be considerable disagreement regarding how or whether such tests have net value in the service of advancing educational progress in reading. This…

Descriptors: Middle School Students, High School Students, Reading Comprehension, Reading Tests

Why Should We Assess the Goodness-of-Fit of IRT Models?

Peer reviewed

Direct link

Maydeu-Olivares, Alberto – Measurement: Interdisciplinary Research and Perspectives, 2013

In this rejoinder, Maydeu-Olivares states that, in item response theory (IRT) measurement applications, the application of goodness-of-fit (GOF) methods informs researchers of the discrepancy between the model and the data being fitted (the room for improvement). By routinely reporting the GOF of IRT models, together with the substantive results…

Descriptors: Goodness of Fit, Models, Evaluation Methods, Item Response Theory

Test Theories, Educational Priorities and Reliability of Public Examinations in England

Peer reviewed

Direct link

Baird, Jo-Anne; Black, Paul – Research Papers in Education, 2013

Much has already been written on the controversies surrounding the use of different test theories in educational assessment. Other authors have noted the prevalence of classical test theory over item response theory in practice. This Special Issue draws together articles based upon work conducted on the Reliability Programme for England's…

Descriptors: Test Theory, Foreign Countries, Test Reliability, Item Response Theory

Making Meaningful Measurement in Survey Research: A Demonstration of the Utility of the Rasch Model. IR Applications. Volume 28

Download full text

Royal, Kenneth D. – Association for Institutional Research (NJ1), 2010

Quality measurement is essential in every form of research, including institutional research and assessment. This paper addresses the erroneous assumptions institutional researchers often make with regard to survey research and provides an alternative method to producing more valid and reliable measures. Rasch measurement models are discussed and…

Descriptors: Institutional Research, Higher Education, Surveys, Measurement

Assessing Hopelessness in Terminally Ill Cancer Patients: Development of the Hopelessness Assessment in Illness Questionnaire

Peer reviewed

Direct link

Rosenfeld, Barry; Pessin, Hayley; Lewis, Charles; Abbey, Jennifer; Olden, Megan; Sachs, Emily; Amakawa, Lia; Kolva, Elissa; Brescia, Robert; Breitbart, William – Psychological Assessment, 2011

Hopelessness has become an increasingly important construct in palliative care research, yet concerns exist regarding the utility of existing measures when applied to patients with a terminal illness. This article describes a series of studies focused on the exploration, development, and analysis of a measure of hopelessness specifically intended…

Descriptors: Expertise, Psychological Patterns, Terminal Illness, Cancer

On Bias in Linear Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Measurement: Interdisciplinary Research and Perspectives, 2010

The traditional way of equating the scores on a new test form X to those on an old form Y is equipercentile equating for a population of examinees. Because the population is likely to change between the two administrations, a popular approach is to equate for a "synthetic population." The authors of the articles in this issue of the…

Descriptors: Test Format, Equated Scores, Population Distribution, Population Trends

Measurement Theory in Language Testing: Past Traditions and Current Trends

Peer reviewed
PDF on ERIC

Download full text

Salmani-Nodoushan, Mohammad Ali – Journal on Educational Psychology, 2009

A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure, and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for any…

Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory

Measurement Theory in Language Testing: Past Traditions and Current Trends

Download full text

Salmani-Nodoushan, Mohammad Ali – Online Submission, 2009

A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure; and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for…

Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory

Polytomous Differential Item Functioning and Violations of Ordering of the Expected Latent Trait by the Raw Score

Peer reviewed

Direct link

DeMars, Christine E. – Educational and Psychological Measurement, 2008

The graded response (GR) and generalized partial credit (GPC) models do not imply that examinees ordered by raw observed score will necessarily be ordered on the expected value of the latent trait (OEL). Factors were manipulated to assess whether increased violations of OEL also produced increased Type I error rates in differential item…

Descriptors: Test Items, Raw Scores, Test Theory, Error of Measurement

Previous Page | Next Page »

Pages: 1 | 2

Petscher, Yaacov	2
Raju, Nambury S.	2
Salmani-Nodoushan, Mohammad…	2
Truckenmiller, Adrea	2
Abbey, Jennifer	1
Amakawa, Lia	1
Baird, Jo-Anne	1
Beichner, Robert	1
Bichi, Ado Abdu	1
Black, Paul	1
Breitbart, William	1
Brescia, Robert	1
Culpepper, Steven Andrew	1
DeMars, Christine E.	1
Ding, Lin	1
Gischlar, Karen L.	1
Kolva, Elissa	1
Lewis, Charles	1
Maydeu-Olivares, Alberto	1
Mitchell, Alison M.	1
Nering, Michael L.	1
O'Reilly, Tenaha	1
Olden, Megan	1
Oshima, T. C.	1
Oshima, T.C.	1
More ▼