NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Does not meet standards1
Showing 1,126 to 1,140 of 3,316 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Woods, Carol M.; Cai, Li; Wang, Mian – Educational and Psychological Measurement, 2013
Differential item functioning (DIF) occurs when the probability of responding in a particular category to an item differs for members of different groups who are matched on the construct being measured. The identification of DIF is important for valid measurement. This research evaluates an improved version of Lord's X[superscript 2] Wald test for…
Descriptors: Test Bias, Item Response Theory, Computation, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Liu, Yang; Maydeu-Olivares, Alberto – Educational and Psychological Measurement, 2013
Local dependence (LD) for binary IRT models can be diagnosed using Chen and Thissen's bivariate X[superscript 2] statistic and the score test statistics proposed by Glas and Suarez-Falcon, and Liu and Thissen. Alternatively, LD can be assessed using general purpose statistics such as bivariate residuals or Maydeu-Olivares and Joe's M[subscript r]…
Descriptors: Item Response Theory, Statistical Analysis, Models, Goodness of Fit
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Rindskopf, David – Society for Research on Educational Effectiveness, 2013
Single case designs (SCDs) generally consist of a small number of short time series in two or more phases. The analysis of SCDs statistically fits in the framework of a multilevel model, or hierarchical model. The usual analysis does not take into account the uncertainty in the estimation of the random effects. This not only has an effect on the…
Descriptors: Research Design, Bayesian Statistics, Computation, Data
Peer reviewed Peer reviewed
Direct linkDirect link
Petscher, Yaacov; Cummings, Kelli Dawn; Biancarosa, Gina; Fien, Hank – Assessment for Effective Intervention, 2013
The purpose of this article is to provide a commentary on the current state of several measurement issues pertaining to curriculum-based measures of reading (R-CBM). We begin by providing an overview of the utility of R-CBM, followed by a presentation of five specific measurements considerations: (a) the reliability of R-CBM oral reading fluency…
Descriptors: Measurement, Reading Fluency, Curriculum Based Assessment, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Deygers, Bart; Van Gorp, Koen – Language Testing, 2015
Considering scoring validity as encompassing both reliable rating scale use and valid descriptor interpretation, this study reports on the validation of a CEFR-based scale that was co-constructed and used by novice raters. The research questions this paper wishes to answer are (a) whether it is possible to construct a CEFR-based rating scale with…
Descriptors: Rating Scales, Scoring, Validity, Interrater Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Köhler, Carmen; Pohl, Steffi; Carstensen, Claus H. – Educational and Psychological Measurement, 2015
When competence tests are administered, subjects frequently omit items. These missing responses pose a threat to correctly estimating the proficiency level. Newer model-based approaches aim to take nonignorable missing data processes into account by incorporating a latent missing propensity into the measurement model. Two assumptions are typically…
Descriptors: Competence, Tests, Evaluation Methods, Adults
Peer reviewed Peer reviewed
Direct linkDirect link
Quiroz, Waldo; Rubilar, Cristian Merino – Chemistry Education Research and Practice, 2015
This study develops a tool to identify errors in the presentation of natural laws based on the epistemology and ontology of the Scientific Realism of Mario Bunge. The tool is able to identify errors of different types: (1) epistemological, in which the law is incorrectly presented as data correlation instead of as a pattern of causality; (2)…
Descriptors: Chemistry, Scientific Concepts, Scientific Principles, Error Patterns
Peer reviewed Peer reviewed
Direct linkDirect link
Halpin, Peter F.; Kieffer, Michael J. – Educational Researcher, 2015
The authors outline the application of latent class analysis (LCA) to classroom observational instruments. LCA offers diagnostic information about teachers' instructional strengths and weaknesses, along with estimates of measurement error for individual teachers, while remaining relatively straightforward to implement and interpret. It is…
Descriptors: Multivariate Analysis, Classroom Observation Techniques, Data Analysis, Error of Measurement
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Halpin, Peter F.; Kieffer, Michael J. – Grantee Submission, 2015
The authors outline the application of latent class analysis (LCA) to classroom observational instruments. LCA offers diagnostic information about teachers' instructional strengths and weaknesses, along with estimates of measurement error for individual teachers, while remaining relatively straightforward to implement and interpret. It is…
Descriptors: Multivariate Analysis, Classroom Observation Techniques, Data Analysis, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Drake, Michael – Australian Primary Mathematics Classroom, 2014
Ever wondered why children have difficulty using a ruler? In this article Michael Drake investigates some of the difficulties students encounter and provides some ideas for teaching about and learning to use rulers.
Descriptors: Teaching Methods, Mathematics Instruction, Educational Technology, Investigations
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Prieto, Gerardo; Nieto, Eloísa – Psicologica: International Journal of Methodology and Experimental Psychology, 2014
This paper describes how a Many Faceted Rasch Measurement (MFRM) approach can be applied to performance assessment focusing on rater analysis. The article provides an introduction to MFRM, a description of MFRM analysis procedures, and an example to illustrate how to examine the effects of various sources of variability on test takers' performance…
Descriptors: Item Response Theory, Interrater Reliability, Rating Scales, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Michaelides, Michalis P.; Haertel, Edward H. – Applied Measurement in Education, 2014
The standard error of equating quantifies the variability in the estimation of an equating function. Because common items for deriving equated scores are treated as fixed, the only source of variability typically considered arises from the estimation of common-item parameters from responses of samples of examinees. Use of alternative, equally…
Descriptors: Equated Scores, Test Items, Sampling, Statistical Inference
Peer reviewed Peer reviewed
Direct linkDirect link
Jia, Fan; Moore, E. Whitney G.; Kinai, Richard; Crowe, Kelly S.; Schoemann, Alexander M.; Little, Todd D. – International Journal of Behavioral Development, 2014
Utilizing planned missing data (PMD) designs (ex. 3-form surveys) enables researchers to ask participants fewer questions during the data collection process. An important question, however, is just how few participants are needed to effectively employ planned missing data designs in research studies. This article explores this question by using…
Descriptors: Data Analysis, Statistical Inference, Error of Measurement, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Lockwood, J. R.; McCaffrey, Daniel F. – Journal of Educational and Behavioral Statistics, 2014
A common strategy for estimating treatment effects in observational studies using individual student-level data is analysis of covariance (ANCOVA) or hierarchical variants of it, in which outcomes (often standardized test scores) are regressed on pretreatment test scores, other student characteristics, and treatment group indicators. Measurement…
Descriptors: Error of Measurement, Scores, Statistical Analysis, Computation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Liu, Sha; Kunnan, Antony John – CALICO Journal, 2016
This study investigated the application of "WriteToLearn" on Chinese undergraduate English majors' essays in terms of its scoring ability and the accuracy of its error feedback. Participants were 163 second-year English majors from a university located in Sichuan province who wrote 326 essays from two writing prompts. Each paper was…
Descriptors: Foreign Countries, Undergraduate Students, English (Second Language), Second Language Learning
Pages: 1  |  ...  |  72  |  73  |  74  |  75  |  76  |  77  |  78  |  79  |  80  |  ...  |  222