NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 4,696 to 4,710 of 9,533 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Belov, Dmitry I.; Armstrong, Ronald D.; Weissman, Alexander – Applied Psychological Measurement, 2008
This article presents a new algorithm for computerized adaptive testing (CAT) when content constraints are present. The algorithm is based on shadow CAT methodology to meet content constraints but applies Monte Carlo methods and provides the following advantages over shadow CAT: (a) lower maximum item exposure rates, (b) higher utilization of the…
Descriptors: Test Items, Monte Carlo Methods, Law Schools, Adaptive Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Hattie, John A. C.; Brown, Gavin T. L. – Journal of Educational Technology Systems, 2008
National assessment systems can be enhanced with effective school-based assessment (SBA) that allows teachers to focus on improvement decisions. Modern computer-assisted technology systems are often used to deploy SBA systems. Since 2000, New Zealand has researched, developed, and deployed a national, computer-assisted SBA system. Eight major…
Descriptors: Computers, Information Technology, Foreign Countries, Computer Uses in Education
Peer reviewed Peer reviewed
Direct linkDirect link
Wells, Craig S.; Bolt, Daniel M. – Applied Measurement in Education, 2008
Tests of model misfit are often performed to validate the use of a particular model in item response theory. Douglas and Cohen (2001) introduced a general nonparametric approach for detecting misfit under the two-parameter logistic model. However, the statistical properties of their approach, and empirical comparisons to other methods, have not…
Descriptors: Test Length, Test Items, Monte Carlo Methods, Nonparametric Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
Mahoney, Kate – International Journal of Testing, 2008
Education policy in many countries has undergone changes regarding the testing of English Language Learners (ELLs), who by definition are not yet proficient in the language of the test. As policies mandate the inclusion of ELLs in large-scale testing, many question the validity of achievement test scores because the degree to which the test score…
Descriptors: Test Items, Linguistics, Testing, Second Language Learning
Bietau, Lisa Artman – ProQuest LLC, 2011
A foundational mission of our public schools is dedicated to preserving a democratic republic dependent on a literate and actively engaged citizenry. Civic literacy is essential to supporting the rights and responsibilities of all citizens in a democratic society. Civic knowledge is the foundation of our citizens' civic literacy. National…
Descriptors: National Standards, Test Items, Feedback (Response), Citizenship
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ricker, Kathryn L.; von Davier, Alina A. – ETS Research Report Series, 2007
This study explored the effects of external anchor test length on final equating results of several equating methods, including equipercentile (frequency estimation), chained equipercentile, kernel equating (KE) poststratification PSE with optimal bandwidths, and KE PSE linear (large bandwidths) when using the nonequivalent groups anchor test…
Descriptors: Equated Scores, Test Items, Statistical Analysis, Test Length
Peer reviewed Peer reviewed
Direct linkDirect link
Hendrickson, Amy – Educational Measurement: Issues and Practice, 2007
Multistage tests are those in which sets of items are administered adaptively and are scored as a unit. These tests have all of the advantages of adaptive testing, with more efficient and precise measurement across the proficiency scale as well as time savings, without many of the disadvantages of an item-level adaptive test. As a seemingly…
Descriptors: Adaptive Testing, Test Construction, Measurement Techniques, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Seock-Ho; Cohen, Allan S.; Alagoz, Cigdem; Kim, Sukwoo – Journal of Educational Measurement, 2007
Data from a large-scale performance assessment (N = 105,731) were analyzed with five differential item functioning (DIF) detection methods for polytomous items to examine the congruence among the DIF detection methods. Two different versions of the item response theory (IRT) model-based likelihood ratio test, the logistic regression likelihood…
Descriptors: Performance Based Assessment, Performance Tests, Item Response Theory, Test Bias
Peer reviewed Peer reviewed
Direct linkDirect link
Passos, Valeria Lima; Berger, Martijn P. F.; Tan, Frans E. – Applied Psychological Measurement, 2007
The early stage of computerized adaptive testing (CAT) refers to the phase of the trait estimation during the administration of only a few items. This phase can be characterized by bias and instability of estimation. In this study, an item selection criterion is introduced in an attempt to lessen this instability: the D-optimality criterion. A…
Descriptors: Test Construction, Test Items, Item Response Theory, Computer Assisted Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Marshall, Robert C.; Wright, Heather Harris – American Journal of Speech-Language Pathology, 2007
Purpose: The Kentucky Aphasia Test (KAT) is an objective measure of language functioning for persons with aphasia. This article describes materials, administration, and scoring of the KAT; presents the rationale for development of test items; reports information from a pilot study; and discusses the role of the KAT in aphasia assessment. Method:…
Descriptors: Aphasia, Test Format, Language Tests, Expressive Language
Peer reviewed Peer reviewed
Direct linkDirect link
Cheng, Ying; Chang, Hua-Hua; Yi, Qing – Applied Psychological Measurement, 2007
Content balancing is an important issue in the design and implementation of computerized adaptive testing (CAT). Content-balancing techniques that have been applied in fixed content balancing, where the number of items from each content area is fixed, include constrained CAT (CCAT), the modified multinomial model (MMM), modified constrained CAT…
Descriptors: Adaptive Testing, Item Analysis, Computer Assisted Testing, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Braden, Jeffery P.; Iribarren, Jacqueline A. – Journal of Psychoeducational Assessment, 2007
In this article, the authors review the Wechsler Intelligence Scale for Children-Fourth Edition Spanish (WISC-IV Spanish), a Spanish translation and adaptation of the WISC-IV. The test was developed to measure the intellectual ability of Spanish-speaking children in the United States ages 6 years, 0 months, through 16 years, 11 months. These…
Descriptors: Intelligence Tests, Spanish, Translation, Children
Peer reviewed Peer reviewed
Direct linkDirect link
Wainer, Howard; Robinson, Daniel H. – Journal of Educational and Behavioral Statistics, 2007
Fumiko Samejima is best known for her pioneering work in polytomous response item response theory (IRT), yielding the eponymous model that has been used broadly for more than 30 years. In this interview, Samejima, on the verge of retiring from her faculty position at the University of Tennessee, discusses her life and career. She also describes…
Descriptors: Foreign Countries, Psychometrics, Item Response Theory, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Sireci, Stephen G. – Educational Researcher, 2007
Lissitz and Samuelsen (2007) propose a new framework for conceptualizing test validity that separates analysis of test properties from analysis of the construct measured. In response, the author of this article reviews fundamental characteristics of test validity, drawing largely from seminal writings as well as from the accepted standards. He…
Descriptors: Test Content, Test Validity, Guidelines, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
van Ginkel, Joost R.; van der Ark, L. Andries; Sijtsma, Klaas – Multivariate Behavioral Research, 2007
The performance of five simple multiple imputation methods for dealing with missing data were compared. In addition, random imputation and multivariate normal imputation were used as lower and upper benchmark, respectively. Test data were simulated and item scores were deleted such that they were either missing completely at random, missing at…
Descriptors: Evaluation Methods, Psychometrics, Item Response Theory, Scores
Pages: 1  |  ...  |  310  |  311  |  312  |  313  |  314  |  315  |  316  |  317  |  318  |  ...  |  636