NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 7 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Hansen, Mark; Cai, Li; Monroe, Scott; Li, Zhen – Grantee Submission, 2016
Despite the growing popularity of diagnostic classification models (e.g., Rupp, Templin, & Henson, 2010) in educational and psychological measurement, methods for testing their absolute goodness-of-fit to real data remain relatively underdeveloped. For tests of reasonable length and for realistic sample size, full-information test statistics…
Descriptors: Goodness of Fit, Item Response Theory, Classification, Maximum Likelihood Statistics
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sabatini, John; O'Reilly, Tenaha; Deane, Paul – ETS Research Report Series, 2013
This report describes the foundation and rationale for a framework designed to measure reading literacy. The aim of the effort is to build an assessment system that reflects current theoretical conceptions of reading and is developmentally sensitive across a prekindergarten to 12th grade student range. The assessment framework is intended to…
Descriptors: Reading Tests, Literacy, Models, Testing Programs
Peer reviewed Peer reviewed
Direct linkDirect link
Assouline, Susan G.; Lupkowski-Shoplik, Ann – Journal of Psychoeducational Assessment, 2012
The Talent Search model, founded at Johns Hopkins University by Dr. Julian C. Stanley, is fundamentally an above-level testing program. This simplistic description belies the enduring impact that the Talent Search model has had on the lives of hundreds of thousands of gifted students as well as their parents and teachers. In this article, we…
Descriptors: Testing Programs, Academically Gifted, Elementary Secondary Education, Talent
Peer reviewed Peer reviewed
Direct linkDirect link
Li, Ying; Jiao, Hong; Lissitz, Robert W. – Journal of Applied Testing Technology, 2012
This study investigated the application of multidimensional item response theory (IRT) models to validate test structure and dimensionality. Multiple content areas or domains within a single subject often exist in large-scale achievement tests. Such areas or domains may cause multidimensionality or local item dependence, which both violate the…
Descriptors: Achievement Tests, Science Tests, Item Response Theory, Measures (Individuals)
Peer reviewed Peer reviewed
Direct linkDirect link
Lissitz, Robert W.; Wei, Hua – Educational Measurement: Issues and Practice, 2008
In this article we address the issue of consistency in standard setting in the context of an augmented state testing program. Information gained from the external NRT scores is used to help make an informed decision on the determination of cut scores on the state test. The consistency of cut scores on the CRT across grades is maintained by forcing…
Descriptors: Testing Programs, State Programs, Standard Setting, Reliability
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Crundwell, R. Marc – Canadian Journal of Educational Administration and Policy, 2005
Recent focus on student achievement and the effectiveness of schools, school boards, and teachers has lead to increased demands for accountability in education. Large scale assessments are now used in most provinces in Canada to examine the degree to which educational standards are being reached and explore issues of accountability. Alternative…
Descriptors: Models, School Effectiveness, Measures (Individuals), Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
Luecht, Richard M. – Journal of Applied Testing Technology, 2005
Computer-based testing (CBT) is typically implemented using one of three general test delivery models: (1) multiple fixed testing (MFT); (2) computer-adaptive testing (CAT); or (3) multistage testing (MSTs). This article reviews some of the real cost drivers associated with CBT implementation--focusing on item production costs, the costs…
Descriptors: Adaptive Testing, Computer Assisted Testing, Quality Control, Costs