NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Does not meet standards1
Showing 1,771 to 1,785 of 3,296 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Puhan, Gautam; Moses, Tim; Grant, Mary; McHale, Fred – ETS Research Report Series, 2008
A single group (SG) equating design with nearly equivalent test forms (SiGNET) design was developed by Grant (2006) to equate small volume tests. The basis of this design is that examinees take two largely overlapping test forms within a single administration. The scored items for the operational form are divided into mini-tests called testlets.…
Descriptors: Data Collection, Equated Scores, Item Sampling, Sample Size
Peer reviewed Peer reviewed
Direct linkDirect link
Nordmoe, Eric D. – Teaching Statistics: An International Journal for Teachers, 2008
This article reports on a delicious finding from a recent study claiming a causal link between dark chocolate consumption and blood pressure reductions. In the article, I provide ideas for using this study to whet student appetites for a discussion of statistical ideas, including experimental design, measurement error and inference methods.
Descriptors: Causal Models, Health Behavior, Research Design, Hypertension
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Guemin; Lewis, Daniel M. – Educational and Psychological Measurement, 2008
The bookmark standard-setting procedure is an item response theory-based method that is widely implemented in state testing programs. This study estimates standard errors for cut scores resulting from bookmark standard settings under a generalizability theory model and investigates the effects of different universes of generalization and error…
Descriptors: Generalizability Theory, Testing Programs, Error of Measurement, Cutting Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Dekle, Dawn J.; Leung, Denis H. Y.; Zhu, Min – Psychological Methods, 2008
Across many areas of psychology, concordance is commonly used to measure the (intragroup) agreement in ranking a number of items by a group of judges. Sometimes, however, the judges come from multiple groups, and in those situations, the interest is to measure the concordance between groups, under the assumption that there is some within-group…
Descriptors: Item Response Theory, Statistical Analysis, Psychological Studies, Evaluators
Peer reviewed Peer reviewed
Direct linkDirect link
Eid, Michael; Nussbeck, Fridtjof W.; Geiser, Christian; Cole, David A.; Gollwitzer, Mario; Lischetzke, Tanja – Psychological Methods, 2008
The question as to which structural equation model should be selected when multitrait-multimethod (MTMM) data are analyzed is of interest to many researchers. In the past, attempts to find a well-fitting model have often been data-driven and highly arbitrary. In the present article, the authors argue that the measurement design (type of methods…
Descriptors: Structural Equation Models, Multitrait Multimethod Techniques, Statistical Analysis, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Hertzog, Christopher; von Oertzen, Timo; Ghisletta, Paolo; Lindenberger, Ulman – Structural Equation Modeling: A Multidisciplinary Journal, 2008
We evaluated the statistical power of single-indicator latent growth curve models to detect individual differences in change (variances of latent slopes) as a function of sample size, number of longitudinal measurement occasions, and growth curve reliability. We recommend the 2 degree-of-freedom generalized test assessing loss of fit when both…
Descriptors: Sample Size, Error of Measurement, Individual Differences, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Cui, Zhongmin; Kolen, Michael J. – Applied Psychological Measurement, 2008
This article considers two methods of estimating standard errors of equipercentile equating: the parametric bootstrap method and the nonparametric bootstrap method. Using a simulation study, these two methods are compared under three sample sizes (300, 1,000, and 3,000), for two test content areas (the Iowa Tests of Basic Skills Maps and Diagrams…
Descriptors: Test Length, Test Content, Simulation, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Solano-Flores, Guillermo; Li, Min – Educational Measurement: Issues and Practice, 2009
We addressed the challenge of scoring cognitive interviews in research involving multiple cultural groups. We interviewed 123 fourth- and fifth-grade students from three cultural groups to probe how they related a mathematics item to their personal lives. Item meaningfulness--the tendency of students to relate the content and/or context of an item…
Descriptors: Generalizability Theory, Scoring, Error of Measurement, Grade 5
Peer reviewed Peer reviewed
Direct linkDirect link
Ramos, Erica; Alfonso, Vincent C.; Schermerhorn, Susan M. – Psychology in the Schools, 2009
The interpretation of cognitive test scores often leads to decisions concerning the diagnosis, educational placement, and types of interventions used for children. Therefore, it is important that practitioners administer and score cognitive tests without error. This study assesses the frequency and types of examiner errors that occur during the…
Descriptors: Graduate Students, Cognitive Tests, Scoring, Cognitive Ability
Peer reviewed Peer reviewed
Direct linkDirect link
Mokkink, Lidwine B.; Knol, Dirk L.; Zekveld, Adriana A.; Goverts, S. Theo; Kramer, Sophia E. – Journal of Speech, Language, and Hearing Research, 2009
Purpose: Seven scales of the Communication Profile for the Hearing Impaired (CPHI; M. E. Demorest & S. A. Erdman, 1987) were translated into Dutch: Maladaptive Behavior, Verbal Strategies, and Nonverbal Strategies (within the area of Communication Strategies) and Self-Acceptance, Acceptance of Loss, Stress, and Withdrawal (within the area of…
Descriptors: Communication Strategies, Factor Structure, Adjustment (to Environment), Patients
Peer reviewed Peer reviewed
Direct linkDirect link
Heim, Bradley T. – Journal of Human Resources, 2009
This paper proposes a new method for estimating family labor supply in the presence of taxes. This method accounts for continuous hours choices, measurement error, unobserved heterogeneity in tastes for work, the nonlinear form of the tax code, and fixed costs of work in one comprehensive specification. Estimated on data from the 2001 PSID, the…
Descriptors: Labor Supply, Taxes, Computation, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Huitema, Bradley E.; McKean, Joseph W. – Educational and Psychological Measurement, 2007
Regression models used in the analysis of interrupted time-series designs assume statistically independent errors. Four methods of evaluating this assumption are the Durbin-Watson (D-W), Huitema-McKean (H-M), Box-Pierce (B-P), and Ljung-Box (L-B) tests. These tests were compared with respect to Type I error and power under a wide variety of error…
Descriptors: Regression (Statistics), Evaluation Methods, Error of Measurement, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Jamshidian, M.; Khatoonabadi, M. – International Journal of Mathematical Education in Science and Technology, 2007
Almost all introductory and intermediate level statistics textbooks include the topic of confidence interval for the population mean. Almost all these texts introduce the median as a robust measure of central tendency. Only a few of these books, however, cover inference on the population median and in particular confidence interval for the median.…
Descriptors: Intervals, Simulation, Computation, Error of Measurement
Wang, Huan – ProQuest LLC, 2010
Multiple uses of the same assessment may present challenges for both the design and use of an assessment. Little advice, however, has been given to assessment developers as to how to understand the phenomena of multiple assessment use and meet the challenges these present. Particularly problematic is the case in which an assessment is used for…
Descriptors: Test Use, Testing Programs, Program Effectiveness, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Bradford, George; Wyatt, Shelly – Internet and Higher Education, 2010
A study by Mullen and Tallent-Runnels (2006) found significance in the differences between online and traditional students' reports of instructors' academic support, instructors' demands, and students' satisfaction. They also recognized that the limitation to their study was their demographic data. In an original report funded by the Alfred P.…
Descriptors: Electronic Learning, Ethnicity, Student Attitudes, Satisfaction
Pages: 1  |  ...  |  115  |  116  |  117  |  118  |  119  |  120  |  121  |  122  |  123  |  ...  |  220