Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 9 |
Since 2006 (last 20 years) | 102 |
Descriptor
Item Response Theory | 161 |
Psychometrics | 161 |
Test Items | 50 |
Test Construction | 33 |
Models | 32 |
Measures (Individuals) | 26 |
Evaluation Methods | 24 |
Simulation | 24 |
Foreign Countries | 23 |
Factor Analysis | 21 |
Test Validity | 21 |
More ▼ |
Source
Author
Ferrando, Pere J. | 5 |
Mislevy, Robert J. | 4 |
Wilson, Mark | 3 |
Bejar, Isaac I. | 2 |
Bolt, Daniel M. | 2 |
Briggs, Derek C. | 2 |
Ceulemans, Eva | 2 |
Cui, Ying | 2 |
Fox, Jean-Paul | 2 |
Gierl, Mark J. | 2 |
Kieftenbeld, Vincent | 2 |
More ▼ |
Publication Type
Reports - Evaluative | 161 |
Journal Articles | 133 |
Speeches/Meeting Papers | 11 |
Information Analyses | 2 |
Historical Materials | 1 |
Numerical/Quantitative Data | 1 |
Opinion Papers | 1 |
Reports - Descriptive | 1 |
Education Level
Higher Education | 13 |
Elementary Secondary Education | 8 |
Elementary Education | 6 |
Secondary Education | 5 |
Grade 5 | 4 |
Grade 6 | 3 |
Early Childhood Education | 2 |
Grade 3 | 2 |
Grade 4 | 2 |
Grade 8 | 2 |
High Schools | 2 |
More ▼ |
Audience
Researchers | 1 |
Location
United States | 4 |
United Kingdom (England) | 3 |
Australia | 2 |
Florida | 2 |
Germany | 2 |
Singapore | 2 |
Alabama | 1 |
Canada | 1 |
China (Shanghai) | 1 |
Colombia | 1 |
Colorado | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Jing Ouyang; Gongjun Xu – Grantee Submission, 2022
Latent class models with covariates are widely used for psychological, social, and educational research. Yet the fundamental identifiability issue of these models has not been fully addressed. Among the previous research on the identifiability of latent class models with covariates, Huang and Bandeen-Roche (Psychometrika 69:5-32, 2004) studied the…
Descriptors: Item Response Theory, Models, Identification, Psychological Studies
Joo, Seang-Hwane; Khorramdel, Lale; Yamamoto, Kentaro; Shin, Hyo Jeong; Robin, Frederic – Educational Measurement: Issues and Practice, 2021
In Programme for International Student Assessment (PISA), item response theory (IRT) scaling is used to examine the psychometric properties of items and scales and to provide comparable test scores across participating countries and over time. To balance the comparability of IRT item parameter estimations across countries with the best possible…
Descriptors: Foreign Countries, International Assessment, Achievement Tests, Secondary School Students
Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020
This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…
Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests
Dai, Shenghai; Svetina, Dubravka; Wang, Xiaolin – Journal of Educational and Behavioral Statistics, 2017
There is an increasing interest in reporting test subscores for diagnostic purposes. In this article, we review nine popular R packages (subscore, mirt, TAM, sirt, CDM, NPCD, lavaan, sem, and OpenMX) that are capable of implementing subscore-reporting methods within one or more frameworks including classical test theory, multidimensional item…
Descriptors: Diagnostic Tests, Scores, Computer Software, Item Response Theory
Jerrim, John; Parker, Philip; Choi, Alvaro; Chmielewski, Anna Katyn; Sälzer, Christine; Shure, Nikki – Educational Measurement: Issues and Practice, 2018
The Programme for International Student Assessment (PISA) is an important international study of 15-olds' knowledge and skills. New results are released every 3 years, and have a substantial impact upon education policy. Yet, despite its influence, the methodology underpinning PISA has received significant criticism. Much of this criticism has…
Descriptors: Educational Assessment, Comparative Education, Achievement Tests, Foreign Countries
Ho, Andrew Dean – Journal of Educational and Behavioral Statistics, 2016
in this article, Andrew Dean Ho presents a response to David Thissen's essay, "Bad Questions: An Essay Involving Item Response Theory (2016)," calling it an excellent contribution to the genre of commentaries on the field which joins the likes of the piece by Thissen's frequent collaborator, Howard Wainer (2010), who published "14…
Descriptors: Item Response Theory, Statistics, Psychometrics, Goodness of Fit
Raspa, Melissa; Bann, Carla M.; Gwaltney, Angela; Benke, Timothy A.; Fu, Cary; Glaze, Daniel G.; Haas, Richard; Heydemann, Peter; Jones, Mary; Kaufmann, Walter E.; Lieberman, David; Marsh, Eric; Peters, Sarika; Ryther, Robin; Standridge, Shannon; Skinner, Steven A.; Percy, Alan K.; Neul, Jeffrey L. – American Journal on Intellectual and Developmental Disabilities, 2020
Rett syndrome (RTT) is a neurodevelopmental disorder that primarily affects females. Recent work indicates the potential for disease modifying therapies. However, there remains a need to develop outcome measures for use in clinical trials. Using data from a natural history study (n = 1,075), we examined the factor structure, internal consistency,…
Descriptors: Genetic Disorders, Psychometrics, Psychomotor Skills, Physical Disabilities
Cantley, Ian – Educational Philosophy and Theory, 2019
Mathematics achievement in different education systems around the world is assessed periodically in the Programme for International Student Assessment (PISA). PISA is deemed to yield robust international comparisons of mathematical attainment that enable individual countries and regions to monitor the performance of their education systems…
Descriptors: Mathematics Achievement, Achievement Tests, Foreign Countries, International Assessment
Psychometric and Evidentiary Advances, Opportunities, and Challenges for Simulation-Based Assessment
Levy, Roy – Educational Assessment, 2013
This article characterizes the advances, opportunities, and challenges for psychometrics of simulation-based assessments through a lens that views assessment as evidentiary reasoning. Simulation-based tasks offer the prospect for student experiences that differ from traditional assessment. Such tasks may be used to support evidentiary arguments…
Descriptors: Simulation, Student Evaluation, Psychometrics, Evidence
Wolf, Mikyung Kim; Guzman-Orth, Danielle; Lopez, Alexis; Castellano, Katherine; Himelfarb, Igor; Tsutagawa, Fred S. – Educational Assessment, 2016
This article investigates ways to improve the assessment of English learner students' English language proficiency given the current movement of creating next-generation English language proficiency assessments in the Common Core era. In particular, this article discusses the integration of scaffolding strategies, which are prevalently utilized as…
Descriptors: English Language Learners, Scaffolding (Teaching Technique), Language Tests, Language Proficiency
Boyd, Aimee M.; Dodd, Barbara; Fitzpatrick, Steven – Applied Measurement in Education, 2013
This study compared several exposure control procedures for CAT systems based on the three-parameter logistic testlet response theory model (Wang, Bradlow, & Wainer, 2002) and Masters' (1982) partial credit model when applied to a pool consisting entirely of testlets. The exposure control procedures studied were the modified within 0.10 logits…
Descriptors: Computer Assisted Testing, Item Response Theory, Test Construction, Models
Liu, Ying; Verkuilen, Jay – Applied Psychological Measurement, 2013
The Presence-Severity (P-S) format refers to a compound item structure in which a question is first asked to check the presence of the particular event in question. If the respondent provides an affirmative answer, a follow-up is administered, often about the frequency, density, severity, or impact of the event. Despite the popularity of the P-S…
Descriptors: Item Response Theory, Measures (Individuals), Psychometrics, Cancer
Arendasy, Martin E.; Sommer, Markus – Learning and Individual Differences, 2012
The use of new test administration technologies such as computerized adaptive testing in high-stakes educational and occupational assessments demands large item pools. Classic item construction processes and previous approaches to automatic item generation faced the problems of a considerable loss of items after the item calibration phase. In this…
Descriptors: Item Banks, Test Items, Adaptive Testing, Psychometrics
Lee, Young-Sun; Krishnan, Anita; Park, Yoon Soo – Measurement and Evaluation in Counseling and Development, 2012
The purpose of this study was to investigate psychometric properties of the Children's Depression Inventory within a nonclinical and longitudinal sample (8th and 12th grades). Using the Rasch rating scale, most items represented one dimension. There was adequate separation among items and no overlap between ranges of item difficulties with latent…
Descriptors: Rating Scales, Psychometrics, Depression (Psychology), Item Response Theory
Partchev, Ivailo; De Boeck, Paul; Steyer, Rolf – Assessment, 2013
An old issue in psychological assessment is to what extent power and speed each are measured by a given intelligence test. Starting from accuracy and response time data, an approach based on posterior time limits (cut-offs of recorded response time) leads to three kinds of recoded data: time data (whether or not the response precedes the cut-off),…
Descriptors: Psychological Testing, Intelligence Tests, Time, Item Response Theory