NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 5,281 to 5,295 of 9,547 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Wu, Amery D.; Ercikan, Kadriye – International Journal of Testing, 2006
Identifying the sources of differential item functioning (DIF) in international assessments is very challenging, because such sources are often nebulous and intertwined. Even though researchers frequently focus on test translation and content area, few actually go beyond these factors to investigate other cultural sources of DIF. This article…
Descriptors: Test Bias, Cultural Influences, Case Studies, Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
Ferrando, Pere J.; Condon, Lorena – Structural Equation Modeling: A Multidisciplinary Journal, 2006
This article proposes procedures for assessing acquiescence in a balanced set of binary personality items. These procedures are based on the bidimensional item-factor analysis model, which is an alternative parameterization of the bidimensional 2-parameter normal-ogive item response theory model. First the rationale and general approach are…
Descriptors: Factor Analysis, Item Response Theory, Personality Measures, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Xing, Dehui; Hambleton, Ronald K. – Educational and Psychological Measurement, 2004
Computer-based testing by credentialing agencies has become common; however, selecting a test design is difficult because several good ones are available - parallel forms, computer adaptive (CAT), and multistage (MST). In this study, three computer-based test designs under some common examination conditions were investigated. Item bank size and…
Descriptors: Test Construction, Psychometrics, Item Banks, Computer Assisted Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Li, Yanmei; Cohen, Allan S.; Ibarra, Robert A. – International Journal of Testing, 2004
Most research on differential item functioning (DIF) focuses on methods for detection rather than on understanding why DIF might occur. This study was designed to investigate whether two alternative approaches to parsing items based on structural characteristics related to particular cognitive strategies could be used to help explain gender DIF.…
Descriptors: Test Items, Cognitive Structures, Gender Differences, Mathematics Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Tate, Richard L. – Applied Measurement in Education, 2004
The valid provision of subscores from an item response theory-based test implies a multidimensional test structure. Assuming, in the construction of a new test, that the test features required for a valid and reliable total test score have been specified already, this article describes the resulting subscore performance and the resulting…
Descriptors: Scores, Test Items, Item Response Theory, Test Construction
Peer reviewed Peer reviewed
Forster, Patricia A.; Mueller, Ute; Haimes, David; Malone, John – International Journal of Mathematical Education in Science and Technology, 2003
Inquires into assessment items classified as "extended pieces of work" in applicable mathematics in Western Australia. Identifies opportunities for graphics calculator use in extended pieces implemented in schools. Concludes that availability of the technology has widened the scope of approaches used in extended pieces of work in…
Descriptors: Educational Technology, Foreign Countries, Graphing Calculators, Mathematics Instruction
Peer reviewed Peer reviewed
Direct linkDirect link
Pommerich, Mary – Journal of Educational Measurement, 2006
Domain scores have been proposed as a user-friendly way of providing instructional feedback about examinees' skills. Domain performance typically cannot be measured directly; instead, scores must be estimated using available information. Simulation studies suggest that IRT-based methods yield accurate group domain score estimates. Because…
Descriptors: Test Validity, Scores, Simulation, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Kahraman, Nilufer; Kamata, Akihito – Applied Psychological Measurement, 2004
In this study, the precision of subscale score estimates was evaluated when out-of-scale information was incorporated. Procedures that incorporated out-of-scale information and only information within a subscale were compared through a series of simulations. It was revealed that more information (i.e., more precision) was always provided for…
Descriptors: Scores, Computation, Evaluation Methods, Simulation
Peer reviewed Peer reviewed
Direct linkDirect link
van der Linden, Wim J.; Veldkamp, Bernard P. – Journal of Educational and Behavioral Statistics, 2004
Item-exposure control in computerized adaptive testing is implemented by imposing item-ineligibility constraints on the assembly process of the shadow tests. The method resembles Sympson and Hetter's (1985) method of item-exposure control in that the decisions to impose the constraints are probabilistic. The method does not, however, require…
Descriptors: Probability, Law Schools, Admission (School), Adaptive Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Williams, Natasha J.; Beretvas, S. Natasha – Applied Psychological Measurement, 2006
The relationship between the hierarchical generalized linear model (HGLM) and item response theory (IRT) models has been demonstrated for dichotomous items. The current study demonstrated the use of the HGLM for polytomous items (termed PHGLM) for identification of differential item functioning (DIF). First, the algebraic equivalence between…
Descriptors: Identification, Rating Scales, Test Items, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip; Johnson, Matthew S.; Williamson, David M. – Journal of Educational and Behavioral Statistics, 2003
Item families, which are groups of related items, are becoming increasingly popular in complex educational assessments. For example, in automatic item generation (AIG) systems, a test may consist of multiple items generated from each of a number of item models. Item calibration or scoring for such an assessment requires fitting models that can…
Descriptors: Test Items, Markov Processes, Educational Testing, Probability
Peer reviewed Peer reviewed
Direct linkDirect link
van der Linden, Wim J.; Sotaridona, Leonardo – Journal of Educational and Behavioral Statistics, 2006
A statistical test for detecting answer copying on multiple-choice items is presented. The test is based on the exact null distribution of the number of random matches between two test takers under the assumption that the response process follows a known response model. The null distribution can easily be generalized to the family of distributions…
Descriptors: Test Items, Multiple Choice Tests, Cheating, Responses
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Moses, Tim; Kim, Sooyeon – ETS Research Report Series, 2007
This study evaluated the impact of unequal reliability on test equating methods in the nonequivalent groups with anchor test (NEAT) design. Classical true score-based models were compared in terms of their assumptions about how reliability impacts test scores. These models were related to treatment of population ability differences by different…
Descriptors: Reliability, Equated Scores, Test Items, Statistical Analysis
Johnstone, Christopher; Liu, Kristi; Altman, Jason; Thurlow, Martha – National Center on Educational Outcomes, University of Minnesota, 2007
This document reports on research related to large-scale assessments for students with learning disabilities in the area of reading. As part of a process of making assessments more universally designed the authors examined the role of "readable and comprehensible" test items (Thompson, Johnstone, & Thurlow, 2002). In this research, they used think…
Descriptors: Test Items, Readability, Learning Disabilities, Protocol Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Wright, Anthony A. – Journal of the Experimental Analysis of Behavior, 2007
Rhesus monkeys were trained and tested in visual and auditory list-memory tasks with sequences of four travel pictures or four natural/environmental sounds followed by single test items. Acquisitions of the visual list-memory task are presented. Visual recency (last item) memory diminished with retention delay, and primacy (first item) memory…
Descriptors: Memory, Test Items, Familiarity, Inhibition
Pages: 1  |  ...  |  349  |  350  |  351  |  352  |  353  |  354  |  355  |  356  |  357  |  ...  |  637