ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	4

Source

Applied Psychological…	1
Educational and Psychological…	1
Grantee Submission	1
National Center for Research…	1

Author

Cai, Li	2
Hansen, Mark	2
Li, Zhen	2
Monroe, Scott	2
Follettie, Joseph F.	1
Guion, Robert M.	1
Ironson, Gail H.	1
Keller, Lisa A.	1
Keller, Robert R.	1
Terrasi, Salvatore	1
Wyse, Adam E.	1
More ▼

Publication Type

Reports - Research	7
Journal Articles	2

Education Level

Elementary Secondary Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	2
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Limited-Information Goodness-of-Fit Testing of Diagnostic Classification Item Response Theory Models. CRESST Report 840

Download full text

Hansen, Mark; Cai, Li; Monroe, Scott; Li, Zhen – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2014

It is a well-known problem in testing the fit of models to multinomial data that the full underlying contingency table will inevitably be sparse for tests of reasonable length and for realistic sample sizes. Under such conditions, full-information test statistics such as Pearson's X[superscript 2] and the likelihood ratio statistic G[superscript…

Descriptors: Goodness of Fit, Item Response Theory, Classification, Maximum Likelihood Statistics

Limited-Information Goodness-of-Fit Testing of Diagnostic Classification Item Response Models

Peer reviewed
PDF on ERIC

Download full text

Hansen, Mark; Cai, Li; Monroe, Scott; Li, Zhen – Grantee Submission, 2016

Despite the growing popularity of diagnostic classification models (e.g., Rupp, Templin, & Henson, 2010) in educational and psychological measurement, methods for testing their absolute goodness-of-fit to real data remain relatively underdeveloped. For tests of reasonable length and for realistic sample size, full-information test statistics…

Descriptors: Goodness of Fit, Item Response Theory, Classification, Maximum Likelihood Statistics

The Long-Term Sustainability of Different Item Response Theory Scaling Methods

Peer reviewed

Direct link

Keller, Lisa A.; Keller, Robert R. – Educational and Psychological Measurement, 2011

This article investigates the accuracy of examinee classification into performance categories and the estimation of the theta parameter for several item response theory (IRT) scaling techniques when applied to six administrations of a test. Previous research has investigated only two administrations; however, many testing programs equate tests…

Descriptors: Item Response Theory, Scaling, Sustainability, Classification

The Potential Impact of Not Being Able to Create Parallel Tests on Expected Classification Accuracy

Peer reviewed

Direct link

Wyse, Adam E. – Applied Psychological Measurement, 2011

In many practical testing situations, alternate test forms from the same testing program are not strictly parallel to each other and instead the test forms exhibit small psychometric differences. This article investigates the potential practical impact that these small psychometric differences can have on expected classification accuracy. Ten…

Descriptors: Test Format, Test Construction, Testing Programs, Psychometrics

Principles of Work Sample Testing. Volume I: A Non-Empirical Taxonomy of Test Uses; Volume II: Evaluation of Personnel Testing Programs; Volume III: Construction and Evaluation of Work Sample Tests; Volume IV: Generalizability.

Guion, Robert M.; Ironson, Gail H. – 1979

Challenges to classical psychometric theory are examined in the context of a broader range of fundamental, derived, and intuitive measurements in psychology; the challenges include content-referenced testing, latent trait theory, and generalizability theory. A taxonomy of psychological measurement is developed, based on: (1) purposes of…

Descriptors: Classification, Latent Trait Theory, Measurement Objectives, Program Evaluation

Examining the Reliability of a State-Mandated Basic Skills Test for a Sample of Special Needs Students.

Terrasi, Salvatore – 1989

This study examined the consistency of classification for a sample of special needs students on the state-mandated Massachusetts Basic Skills Inventory (BSI). The study sample consisted of 172 special education students (114 males and 58 females) from 15 elementary schools in a large urban school district in Massachusetts, who took the…

Descriptors: Basic Skills, Classification, Comparative Testing, Educational Diagnosis

GPO: Send Me The Primary Effects of Common Instruction! Professional Paper 34.

Download full text

Follettie, Joseph F. – 1976

General features of local and national programs for assessing achievements referencing the common instruction are discussed within a single mastery achievement testing framework. The envisioned programs differ only in informative detail. Most such differences are viewed as amenable to formalization and the basis for distinguishing between local…

Descriptors: Academic Achievement, Accountability, Achievement Tests, Classification

Classification	7
Testing Programs	7
Item Response Theory	4
Mathematics Tests	3
Achievement Tests	2
Elementary Secondary Education	2
Goodness of Fit	2
International Programs	2
Maximum Likelihood Statistics	2
Models	2
Psychometrics	2
State Programs	2
Test Construction	2
Academic Achievement	1
Accountability	1
Basic Skills	1
Comparative Testing	1
Computation	1
Criterion Referenced Tests	1
Cutting Scores	1
Decision Making	1
Differences	1
Educational Assessment	1
Educational Diagnosis	1
Elementary Education	1
More ▼