NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 4,711 to 4,725 of 9,554 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Jang, Eunice Eunhee – Language Testing, 2009
With recent statistical advances in cognitive diagnostic assessment (CDA), the CDA approach has been increasingly applied to non-diagnostic tests partly to meet accountability demands for student achievement. The study aimed to evaluate critically the validity of the CDA application to an existing non-diagnostic L2 reading comprehension test and…
Descriptors: Feedback (Response), Reading Comprehension, Test Items, Validity
Paek, Insu; Lee, Jihyun; Stankov, Lazar; Wilson, Mark – ETS Research Report Series, 2008
This study investigated the relationship between students' actual performance (accuracy) and their subjective judgments of accuracy (confidence) on selected English language proficiency tests. The unidimensional and multidimensional IRT Rasch approaches were used to model the discrepancy between confidence and accuracy at the item and test level…
Descriptors: Self Esteem, Accuracy, Item Response Theory, English
Peer reviewed Peer reviewed
Direct linkDirect link
Ballou, Dale – National Center on Performance Incentives, 2008
As currently practiced, value-added assessment relies on a strong assumption about the scales used to measure student achievement, namely that these are interval scales, with equal-sized gains at all points on the scale representing the same increment of learning. Many of the metrics in which test results are expressed do not have this property…
Descriptors: Test Items, Intervals, Data Analysis, Item Response Theory
Ashvind Nand Singh – ProQuest LLC, 2008
Due to the relative inability of individuals with intellectual disabilities (ID) to provide an accurate and reliable self-report, assessment in this population is more difficult than with individuals in the general population. As such, assessment procedures must be adjusted to compensate for the relative lack of information that the individual can…
Descriptors: Test Items, Item Analysis, Test Construction, Behavior Rating Scales
Peer reviewed Peer reviewed
Direct linkDirect link
Liu, Ou Lydia; Lee, Hee-Sun; Hofstetter, Carolyn; Linn, Marcia C. – Educational Assessment, 2008
In response to the demand for sound science assessments, this article presents the development of a latent construct called knowledge integration as an effective measure of science inquiry. Knowledge integration assessments ask students to link, distinguish, evaluate, and organize their ideas about complex scientific topics. The article focuses on…
Descriptors: Standardized Tests, Scoring Rubrics, Psychometrics, Concept Mapping
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Young-Sun; Grossman, Jennifer; Krishnan, Anita – Educational and Psychological Measurement, 2008
This study examined the cultural relevance of adult attachment within a Korean sample (N = 390) using Rasch rating scale modeling. The psychometric properties of scores from the Korean version of the Revised Experiences in Close Relationships, comprised of two subscales of Anxiety (self) and Avoidance (other), were assessed. Results obtained from…
Descriptors: Cultural Relevance, Attachment Behavior, Rating Scales, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Belov, Dmitry I.; Armstrong, Ronald D.; Weissman, Alexander – Applied Psychological Measurement, 2008
This article presents a new algorithm for computerized adaptive testing (CAT) when content constraints are present. The algorithm is based on shadow CAT methodology to meet content constraints but applies Monte Carlo methods and provides the following advantages over shadow CAT: (a) lower maximum item exposure rates, (b) higher utilization of the…
Descriptors: Test Items, Monte Carlo Methods, Law Schools, Adaptive Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Hattie, John A. C.; Brown, Gavin T. L. – Journal of Educational Technology Systems, 2008
National assessment systems can be enhanced with effective school-based assessment (SBA) that allows teachers to focus on improvement decisions. Modern computer-assisted technology systems are often used to deploy SBA systems. Since 2000, New Zealand has researched, developed, and deployed a national, computer-assisted SBA system. Eight major…
Descriptors: Computers, Information Technology, Foreign Countries, Computer Uses in Education
Peer reviewed Peer reviewed
Direct linkDirect link
Wells, Craig S.; Bolt, Daniel M. – Applied Measurement in Education, 2008
Tests of model misfit are often performed to validate the use of a particular model in item response theory. Douglas and Cohen (2001) introduced a general nonparametric approach for detecting misfit under the two-parameter logistic model. However, the statistical properties of their approach, and empirical comparisons to other methods, have not…
Descriptors: Test Length, Test Items, Monte Carlo Methods, Nonparametric Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
Mahoney, Kate – International Journal of Testing, 2008
Education policy in many countries has undergone changes regarding the testing of English Language Learners (ELLs), who by definition are not yet proficient in the language of the test. As policies mandate the inclusion of ELLs in large-scale testing, many question the validity of achievement test scores because the degree to which the test score…
Descriptors: Test Items, Linguistics, Testing, Second Language Learning
Bietau, Lisa Artman – ProQuest LLC, 2011
A foundational mission of our public schools is dedicated to preserving a democratic republic dependent on a literate and actively engaged citizenry. Civic literacy is essential to supporting the rights and responsibilities of all citizens in a democratic society. Civic knowledge is the foundation of our citizens' civic literacy. National…
Descriptors: National Standards, Test Items, Feedback (Response), Citizenship
Herman, Joan L.; Osmundson, Ellen; Dietel, Ronald – Assessment and Accountability Comprehensive Center, 2010
This report describes the purposes of benchmark assessments and provides recommendations for selecting and using benchmark assessments--addressing validity, alignment, reliability, fairness and bias and accessibility, instructional sensitivity, utility, and reporting issues. We also present recommendations on building capacity to support schools'…
Descriptors: Multiple Choice Tests, Test Items, Benchmarking, Educational Assessment
O'Shea, Mary B. – ProQuest LLC, 2010
Although much is known about how students perform on standardized tests, little research exists concerning how students think and process while taking such tests. This mixed methods action research study was designed to investigate if a constructivist approach to test preparation could yield improved results for 37 English language arts freshmen…
Descriptors: Test Preparation, Test Items, Statistical Analysis, Grade 9
Nering, Michael L., Ed.; Ostini, Remo, Ed. – Routledge, Taylor & Francis Group, 2010
This comprehensive "Handbook" focuses on the most used polytomous item response theory (IRT) models. These models help us understand the interaction between examinees and test questions where the questions have various response categories. The book reviews all of the major models and includes discussions about how and where the models…
Descriptors: Guides, Item Response Theory, Test Items, Correlation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ricker, Kathryn L.; von Davier, Alina A. – ETS Research Report Series, 2007
This study explored the effects of external anchor test length on final equating results of several equating methods, including equipercentile (frequency estimation), chained equipercentile, kernel equating (KE) poststratification PSE with optimal bandwidths, and KE PSE linear (large bandwidths) when using the nonequivalent groups anchor test…
Descriptors: Equated Scores, Test Items, Statistical Analysis, Test Length
Pages: 1  |  ...  |  311  |  312  |  313  |  314  |  315  |  316  |  317  |  318  |  319  |  ...  |  637