Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 17 |
Descriptor
Item Response Theory | 20 |
Test Theory | 20 |
Test Items | 9 |
Test Reliability | 7 |
Error of Measurement | 5 |
Item Analysis | 5 |
Evaluation Methods | 4 |
Test Validity | 4 |
Comparative Analysis | 3 |
Correlation | 3 |
Generalizability Theory | 3 |
More ▼ |
Source
Author
Petscher, Yaacov | 2 |
Raju, Nambury S. | 2 |
Salmani-Nodoushan, Mohammad… | 2 |
Truckenmiller, Adrea | 2 |
Abbey, Jennifer | 1 |
Amakawa, Lia | 1 |
Baird, Jo-Anne | 1 |
Beichner, Robert | 1 |
Bichi, Ado Abdu | 1 |
Black, Paul | 1 |
Breitbart, William | 1 |
More ▼ |
Publication Type
Reports - Descriptive | 20 |
Journal Articles | 18 |
Opinion Papers | 1 |
Reference Materials -… | 1 |
Education Level
Higher Education | 3 |
Postsecondary Education | 2 |
High Schools | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Location
Iran | 1 |
United Kingdom (England) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Wechsler Preschool and… | 1 |
What Works Clearinghouse Rating
Schumacker, Randall – Measurement: Interdisciplinary Research and Perspectives, 2019
The R software provides packages and functions that provide data analysis in classical true score, generalizability theory, item response theory, and Rasch measurement theories. A brief list of notable articles in each measurement theory and the first measurement journals is followed by a list of R psychometric software packages. Each psychometric…
Descriptors: Psychometrics, Computer Software, Measurement, Item Response Theory
Peterson, Christina Hamme; Gischlar, Karen L.; Peterson, N. Andrew – Journal for Specialists in Group Work, 2017
Measures that accurately capture the phenomenon are critical to research and practice in group work. The vast majority of group-related measures were developed using the reflective measurement model rooted in classical test theory (CTT). Depending on the construct definition and the measure's purpose, the reflective model may not always be the…
Descriptors: Item Response Theory, Group Activities, Test Theory, Test Items
Bichi, Ado Abdu; Talib, Rohaya – International Journal of Evaluation and Research in Education, 2018
Testing in educational system perform a number of functions, the results from a test can be used to make a number of decisions in education. It is therefore well accepted in the education literature that, testing is an important element of education. To effectively utilize the tests in educational policies and quality assurance its validity and…
Descriptors: Item Response Theory, Test Items, Test Construction, Decision Making
Ravand, Hamdollah – Practical Assessment, Research & Evaluation, 2015
Cognitive diagnostic models (CDM) have been around for more than a decade but their application is far from widespread for mainly two reasons: (1) CDMs are novel, as compared to traditional IRT models. Consequently, many researchers lack familiarity with them and their properties, and (2) Software programs doing CDMs have been expensive and not…
Descriptors: Test Theory, Models, Computer Software, Open Source Technology
Culpepper, Steven Andrew – Applied Psychological Measurement, 2013
A classic topic in the fields of psychometrics and measurement has been the impact of the number of scale categories on test score reliability. This study builds on previous research by further articulating the relationship between item response theory (IRT) and classical test theory (CTT). Equations are presented for comparing the reliability and…
Descriptors: Item Response Theory, Reliability, Scores, Error of Measurement
Mitchell, Alison M.; Truckenmiller, Adrea; Petscher, Yaacov – Communique, 2015
As part of the Race to the Top initiative, the United States Department of Education made nearly 1 billion dollars available in State Educational Technology grants with the goal of ramping up school technology. One result of this effort is that states, districts, and schools across the country are using computerized assessments to measure their…
Descriptors: Computer Assisted Testing, Educational Technology, Testing, Efficiency
Improving Comprehension Assessment for Middle and High School Students: Challenges and Opportunities
Sabatini, John; Petscher, Yaacov; O'Reilly, Tenaha; Truckenmiller, Adrea – Grantee Submission, 2015
For decades, standardized reading comprehension tests have consisted of a series of passages and associated multiple-choice questions. Although widely used in and out of the classroom, there continues to be considerable disagreement regarding how or whether such tests have net value in the service of advancing educational progress in reading. This…
Descriptors: Middle School Students, High School Students, Reading Comprehension, Reading Tests
Maydeu-Olivares, Alberto – Measurement: Interdisciplinary Research and Perspectives, 2013
In this rejoinder, Maydeu-Olivares states that, in item response theory (IRT) measurement applications, the application of goodness-of-fit (GOF) methods informs researchers of the discrepancy between the model and the data being fitted (the room for improvement). By routinely reporting the GOF of IRT models, together with the substantive results…
Descriptors: Goodness of Fit, Models, Evaluation Methods, Item Response Theory
Baird, Jo-Anne; Black, Paul – Research Papers in Education, 2013
Much has already been written on the controversies surrounding the use of different test theories in educational assessment. Other authors have noted the prevalence of classical test theory over item response theory in practice. This Special Issue draws together articles based upon work conducted on the Reliability Programme for England's…
Descriptors: Test Theory, Foreign Countries, Test Reliability, Item Response Theory
Royal, Kenneth D. – Association for Institutional Research (NJ1), 2010
Quality measurement is essential in every form of research, including institutional research and assessment. This paper addresses the erroneous assumptions institutional researchers often make with regard to survey research and provides an alternative method to producing more valid and reliable measures. Rasch measurement models are discussed and…
Descriptors: Institutional Research, Higher Education, Surveys, Measurement
Rosenfeld, Barry; Pessin, Hayley; Lewis, Charles; Abbey, Jennifer; Olden, Megan; Sachs, Emily; Amakawa, Lia; Kolva, Elissa; Brescia, Robert; Breitbart, William – Psychological Assessment, 2011
Hopelessness has become an increasingly important construct in palliative care research, yet concerns exist regarding the utility of existing measures when applied to patients with a terminal illness. This article describes a series of studies focused on the exploration, development, and analysis of a measure of hopelessness specifically intended…
Descriptors: Expertise, Psychological Patterns, Terminal Illness, Cancer
van der Linden, Wim J. – Measurement: Interdisciplinary Research and Perspectives, 2010
The traditional way of equating the scores on a new test form X to those on an old form Y is equipercentile equating for a population of examinees. Because the population is likely to change between the two administrations, a popular approach is to equate for a "synthetic population." The authors of the articles in this issue of the…
Descriptors: Test Format, Equated Scores, Population Distribution, Population Trends
Salmani-Nodoushan, Mohammad Ali – Journal on Educational Psychology, 2009
A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure, and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for any…
Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory
Salmani-Nodoushan, Mohammad Ali – Online Submission, 2009
A good test is one that has at least three qualities: reliability, or the precision with which a test measures what it is supposed to measure; validity, i.e., if the test really measures what it is supposed to measure; and practicality, or if the test, no matter how sound theoretically, is practicable in reality. These are the sine qua non for…
Descriptors: Generalizability Theory, Testing, Language Tests, Item Response Theory
DeMars, Christine E. – Educational and Psychological Measurement, 2008
The graded response (GR) and generalized partial credit (GPC) models do not imply that examinees ordered by raw observed score will necessarily be ordered on the expected value of the latent trait (OEL). Factors were manipulated to assess whether increased violations of OEL also produced increased Type I error rates in differential item…
Descriptors: Test Items, Raw Scores, Test Theory, Error of Measurement
Previous Page | Next Page ยป
Pages: 1 | 2