NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 3,466 to 3,480 of 9,552 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Welsh, Megan E.; Eastwood, Melissa; D'Agostino, Jerome V. – Applied Measurement in Education, 2014
Teacher and school accountability systems based on high-stakes tests are ubiquitous throughout the United States and appear to be growing as a catalyst for reform. As a result, educators have increased the proportion of instructional time devoted to test preparation. Although guidelines for what constitutes appropriate and inappropriate test…
Descriptors: High Stakes Tests, Instruction, Test Preparation, Grade 3
Peer reviewed Peer reviewed
Direct linkDirect link
Paek, Insu; Park, Hyun-Jeong; Cai, Li; Chi, Eunlim – Educational and Psychological Measurement, 2014
Typically a longitudinal growth modeling based on item response theory (IRT) requires repeated measures data from a single group with the same test design. If operational or item exposure problems are present, the same test may not be employed to collect data for longitudinal analyses and tests at multiple time points are constructed with unique…
Descriptors: Item Response Theory, Comparative Analysis, Test Items, Equated Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Stoffel, Heather; Raymond, Mark R.; Bucak, S. Deniz; Haist, Steven A. – Practical Assessment, Research & Evaluation, 2014
Previous research on the impact of text and formatting changes on test-item performance has produced mixed results. This matter is important because it is generally acknowledged that "any" change to an item requires that it be recalibrated. The present study investigated the effects of seven classes of stylistic changes on item…
Descriptors: Test Construction, Test Items, Standardized Tests, Physicians
National Assessment Governing Board, 2014
Since 1973, the National Assessment of Educational Progress (NAEP) has gathered information about student achievement in mathematics. Results of these periodic assessments, produced in print and web-based formats, provide valuable information to a wide variety of audiences. They inform citizens about the nature of students' comprehension of the…
Descriptors: National Competency Tests, Mathematics Achievement, Mathematics Skills, Grade 4
Peer reviewed Peer reviewed
Direct linkDirect link
Gan, Zhengdong – Changing English: Studies in Culture and Education, 2012
Leung and Lewkowicz remind us that the debate over the past two decades that is most relevant to ELT (English languge teaching) pedagogy and curriculum concerns test-task authenticity. This paper first reviews how the authenticity debate in the literature of second language acquisition, pedagogy and testing has evolved. Drawing on a body of…
Descriptors: Teaching Methods, English (Second Language), Second Language Learning, Second Language Instruction
Rudner, Lawrence M. – Graduate Management Admission Council, 2012
The Graduate Management Admission Test (GMAT) is administered in English and is designed for programs that teach in English. But the required English skill level is much less than what students will need in the classroom. The exam requires just enough English to allow us to adequately and comprehensively assess Verbal reasoning, Quantitative…
Descriptors: College Entrance Examinations, Graduate Study, Business Administration Education, Test Bias
Peer reviewed Peer reviewed
Direct linkDirect link
Schaffhauser, Dian – T.H.E. Journal, 2012
Tony Alpert, chief operating officer for the Smarter Balanced Assessment Consortium (SBAC), ponders whether to allow tablet computers--and particularly iPads--to be used for summative testing online. As Alpert points out, not only would student cheating compromise the validity of the individual student's test event, "worse yet, it could expose…
Descriptors: Cheating, Test Validity, Test Construction, Consortia
Peer reviewed Peer reviewed
Direct linkDirect link
Jack, Brady Michael; Liu, Chia-Ju; Chiu, Houn-Lin; Tsai, Chun-Yen – International Journal of Science and Mathematics Education, 2012
The present study investigated whether gender differences were present on the confidence judgments made by 8th grade Taiwanese students on the accuracy of their responses to acid-base test items. A total of 147 (76 male, 71 female) students provided item-specific confidence judgments during a test of their knowledge of acids and bases. Using the…
Descriptors: Test Items, Females, Self Esteem, Grade 8
Peer reviewed Peer reviewed
Direct linkDirect link
Chen, Ping; Xin, Tao; Wang, Chun; Chang, Hua-Hua – Psychometrika, 2012
Item replenishing is essential for item bank maintenance in cognitive diagnostic computerized adaptive testing (CD-CAT). In regular CAT, online calibration is commonly used to calibrate the new items continuously. However, until now no reference has publicly become available about online calibration for CD-CAT. Thus, this study investigates the…
Descriptors: Computer Assisted Testing, Adaptive Testing, Diagnostic Tests, Cognitive Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Seybert, Jacob; Stark, Stephen – Applied Psychological Measurement, 2012
A Monte Carlo study was conducted to examine the accuracy of differential item functioning (DIF) detection using the differential functioning of items and tests (DFIT) method. Specifically, the performance of DFIT was compared using "testwide" critical values suggested by Flowers, Oshima, and Raju, based on simulations involving large numbers of…
Descriptors: Test Bias, Monte Carlo Methods, Form Classes (Languages), Simulation
Peer reviewed Peer reviewed
Direct linkDirect link
Boyd, Jeremy K.; Goldberg, Adele E. – Journal of Child Language, 2012
The present study exposed five-year-olds (M=5 ; 2), seven-year-olds (M=7 ; 6) and adults (M=22 ; 4) to instances of a novel phrasal construction, then used a forced choice comprehension task to evaluate their learning of the construction. The abstractness of participants' acquired representations of the novel construction was evaluated by varying…
Descriptors: Verbs, Generalization, Linguistic Input, Young Children
Peer reviewed Peer reviewed
Direct linkDirect link
Wills, Andy J.; Pothos, Emmanuel M. – Psychological Bulletin, 2012
Categorization is one of the fundamental building blocks of cognition, and the study of categorization is notable for the extent to which formal modeling has been a central and influential component of research. However, the field has seen a proliferation of noncomplementary models with little consensus on the relative adequacy of these accounts.…
Descriptors: Classification, Computation, Test Items, Generalizability Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Finch, W. Holmes – Applied Psychological Measurement, 2012
Increasingly, researchers interested in identifying potentially biased test items are encouraged to use a confirmatory, rather than exploratory, approach. One such method for confirmatory testing is rooted in differential bundle functioning (DBF), where hypotheses regarding potential differential item functioning (DIF) for sets of items (bundles)…
Descriptors: Test Bias, Test Items, Statistical Analysis, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Kruyen, Peter M.; Emons, Wilco H. M.; Sijtsma, Klaas – International Journal of Testing, 2012
Personnel selection shows an enduring need for short stand-alone tests consisting of, say, 5 to 15 items. Despite their efficiency, short tests are more vulnerable to measurement error than longer test versions. Consequently, the question arises to what extent reducing test length deteriorates decision quality due to increased impact of…
Descriptors: Measurement, Personnel Selection, Decision Making, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Peterson, Christina Hamme – Journal for Specialists in Group Work, 2012
Counseling work is increasingly conducted in team format. The methods counseling teams use to manage the emotional component of their group life, or their group emotional intelligence, have been proposed as significantly contributing to group member trust, cooperation, and ultimate performance. Item development, exploratory factor analysis, and…
Descriptors: Group Counseling, Emotional Intelligence, Measures (Individuals), Validity
Pages: 1  |  ...  |  228  |  229  |  230  |  231  |  232  |  233  |  234  |  235  |  236  |  ...  |  637