NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 7 results Save | Export
Hansen, Mark; Cai, Li; Monroe, Scott; Li, Zhen – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2014
It is a well-known problem in testing the fit of models to multinomial data that the full underlying contingency table will inevitably be sparse for tests of reasonable length and for realistic sample sizes. Under such conditions, full-information test statistics such as Pearson's X[superscript 2] and the likelihood ratio statistic G[superscript…
Descriptors: Goodness of Fit, Item Response Theory, Classification, Maximum Likelihood Statistics
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Hansen, Mark; Cai, Li; Monroe, Scott; Li, Zhen – Grantee Submission, 2016
Despite the growing popularity of diagnostic classification models (e.g., Rupp, Templin, & Henson, 2010) in educational and psychological measurement, methods for testing their absolute goodness-of-fit to real data remain relatively underdeveloped. For tests of reasonable length and for realistic sample size, full-information test statistics…
Descriptors: Goodness of Fit, Item Response Theory, Classification, Maximum Likelihood Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
Keller, Lisa A.; Keller, Robert R. – Educational and Psychological Measurement, 2011
This article investigates the accuracy of examinee classification into performance categories and the estimation of the theta parameter for several item response theory (IRT) scaling techniques when applied to six administrations of a test. Previous research has investigated only two administrations; however, many testing programs equate tests…
Descriptors: Item Response Theory, Scaling, Sustainability, Classification
Peer reviewed Peer reviewed
Direct linkDirect link
Wyse, Adam E. – Applied Psychological Measurement, 2011
In many practical testing situations, alternate test forms from the same testing program are not strictly parallel to each other and instead the test forms exhibit small psychometric differences. This article investigates the potential practical impact that these small psychometric differences can have on expected classification accuracy. Ten…
Descriptors: Test Format, Test Construction, Testing Programs, Psychometrics
Guion, Robert M.; Ironson, Gail H. – 1979
Challenges to classical psychometric theory are examined in the context of a broader range of fundamental, derived, and intuitive measurements in psychology; the challenges include content-referenced testing, latent trait theory, and generalizability theory. A taxonomy of psychological measurement is developed, based on: (1) purposes of…
Descriptors: Classification, Latent Trait Theory, Measurement Objectives, Program Evaluation
Terrasi, Salvatore – 1989
This study examined the consistency of classification for a sample of special needs students on the state-mandated Massachusetts Basic Skills Inventory (BSI). The study sample consisted of 172 special education students (114 males and 58 females) from 15 elementary schools in a large urban school district in Massachusetts, who took the…
Descriptors: Basic Skills, Classification, Comparative Testing, Educational Diagnosis
Follettie, Joseph F. – 1976
General features of local and national programs for assessing achievements referencing the common instruction are discussed within a single mastery achievement testing framework. The envisioned programs differ only in informative detail. Most such differences are viewed as amenable to formalization and the basis for distinguishing between local…
Descriptors: Academic Achievement, Accountability, Achievement Tests, Classification