NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Does not meet standards1
Showing 511 to 525 of 3,295 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Kyung Yong; Lee, Won-Chan – Journal of Educational Measurement, 2018
Reporting confidence intervals with test scores helps test users make important decisions about examinees by providing information about the precision of test scores. Although a variety of estimation procedures based on the binomial error model are available for computing intervals for test scores, these procedures assume that items are randomly…
Descriptors: Weighted Scores, Error of Measurement, Test Use, Decision Making
Peer reviewed Peer reviewed
Direct linkDirect link
Luecht, Richard; Ackerman, Terry A. – Educational Measurement: Issues and Practice, 2018
Simulation studies are extremely common in the item response theory (IRT) research literature. This article presents a didactic discussion of "truth" and "error" in IRT-based simulation studies. We ultimately recommend that future research focus less on the simple recovery of parameters from a convenient generating IRT model,…
Descriptors: Item Response Theory, Simulation, Ethics, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Clauser, Brian E.; Kane, Michael; Clauser, Jerome C. – Journal of Educational Measurement, 2020
An Angoff standard setting study generally yields judgments on a number of items by a number of judges (who may or may not be nested in panels). Variability associated with judges (and possibly panels) contributes error to the resulting cut score. The variability associated with items plays a more complicated role. To the extent that the mean item…
Descriptors: Cutting Scores, Generalization, Decision Making, Standard Setting
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sahin, Melek Gulsah – International Journal of Assessment Tools in Education, 2020
Computer Adaptive Multistage Testing (ca-MST), which take the advantage of computer technology and adaptive test form, are widely used, and are now a popular issue of assessment and evaluation. This study aims at analyzing the effect of different panel designs, module lengths, and different sequence of a parameter value across stages and change in…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Response Theory
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kara, Hakan; Cetin, Sevda – International Journal of Assessment Tools in Education, 2020
In this study, the efficiency of various random sampling methods to reduce the number of items rated by judges in an Angoff standard-setting study was examined and the methods were compared with each other. Firstly, the full-length test was formed by combining Placement Test 2012 and 2013 mathematics subsets. After then, simple random sampling…
Descriptors: Cutting Scores, Standard Setting (Scoring), Sampling, Error of Measurement
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Duprey, Michael A.; Pratt, Daniel J.; Wilson, David H.; Jewell, Donna M.; Brown, Derick S.; Caves, Lesa R.; Kinney, Satkartar K.; Mattox, Tiffany L.; Ritchie, Nichole Smith; Rogers, James E.; Spagnardi, Colleen M.; Wescott, Jamie D. – National Center for Education Statistics, 2020
The nine appendices in this publication accompany the full report, "High School Longitudinal Study of 2009 (HSLS:09) Postsecondary Education Transcript Study and Student Financial Aid Records Collection. Data File Documentation. NCES 2020-004" (ED607366). They include: (1) Glossary of Terms; (2) Student Financial Aid Records Instrument…
Descriptors: Longitudinal Studies, High School Students, Data Collection, Academic Records
Rachel A. Gross – ProQuest LLC, 2020
The present study was motivated by the theory-method mismatch between heterotypic continuity (aspects of development that manifest differently across the lifespan thus cannot be measured the same way over time) and longitudinal measurement equivalence (the statistical assumption that the developmental phenomenon studied is measured on the same…
Descriptors: Robustness (Statistics), Structural Equation Models, Longitudinal Studies, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Korsun, Igor; Kryzhanovskyi, Serhii; Monchuk, Maryna – Physics Education, 2019
Medical physics uses physics knowledge in medicine or healthcare. The question surrounding the methods of measuring temperature is important in medicine. The aim of this article is to explore the possibilities of using Microsoft Excel to study thermometers. The physical concepts and laws related to temperature measurement have been considered. The…
Descriptors: Physics, Measurement Equipment, Spreadsheets, Computer Software
Peer reviewed Peer reviewed
Direct linkDirect link
Hayes, Timothy – Journal of Educational and Behavioral Statistics, 2019
Multiple imputation is a popular method for addressing data that are presumed to be missing at random. To obtain accurate results, one's imputation model must be congenial to (appropriate for) one's intended analysis model. This article reviews and demonstrates two recent software packages, Blimp and jomo, to multiply impute data in a manner…
Descriptors: Computer Software Evaluation, Computer Software Reviews, Hierarchical Linear Modeling, Data Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Yang, Shitao; Black, Ken – Teaching Statistics: An International Journal for Teachers, 2019
Summary Employing a Wald confidence interval to test hypotheses about population proportions could lead to an increase in Type I or Type II errors unless the hypothesized value, p0, is used in computing its standard error rather than the sample proportion. Whereas the Wald confidence interval to estimate a population proportion uses the sample…
Descriptors: Error Patterns, Evaluation Methods, Error of Measurement, Measurement Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
Leite, Walter L.; Aydin, Burak; Gurel, Sungur – Journal of Experimental Education, 2019
This Monte Carlo simulation study compares methods to estimate the effects of programs with multiple versions when assignment of individuals to program version is not random. These methods use generalized propensity scores, which are predicted probabilities of receiving a particular level of the treatment conditional on covariates, to remove…
Descriptors: Probability, Weighted Scores, Monte Carlo Methods, Statistical Bias
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Astivia, Oscar L. Olvera; Zumbo, Bruno D. – Practical Assessment, Research & Evaluation, 2019
Within psychology and the social sciences, Ordinary Least Squares (OLS) regression is one of the most popular techniques for data analysis. In order to ensure the inferences from the use of this method are appropriate, several assumptions must be satisfied, including the one of constant error variance (i.e. homoskedasticity). Most of the training…
Descriptors: Multiple Regression Analysis, Least Squares Statistics, Statistical Analysis, Error of Measurement
Xu, Jie – ProQuest LLC, 2019
Research has shown that cross-sectional mediation analysis cannot accurately reflect a true longitudinal mediated effect. To investigate longitudinal mediated effects, different longitudinal mediation models have been proposed and these models focus on different research questions related to longitudinal mediation. When fitting mediation models to…
Descriptors: Case Studies, Error of Measurement, Longitudinal Studies, Models
Wang, Chun; Chen, Ping; Jiang, Shengyu – Grantee Submission, 2019
Many large-scale educational surveys have moved from linear form design to multistage testing (MST) design. One advantage of MST is that it can provide more accurate latent trait [theta] estimates using fewer items than required by linear tests. However, MST generates incomplete response data by design; hence questions remain as to how to…
Descriptors: Adaptive Testing, Test Items, Item Response Theory, Maximum Likelihood Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
Jing Sun; Laura M. Justice; Ye Shen; Hui Jiang; Hugo Gonzalez Villasanti; Mary Beth Schmitt – Grantee Submission, 2024
Purpose: The purpose of this study was to examine the measurement structure of the linguistic features of speech-language pathologists' (SLPs) talk during business-as-usual therapy sessions in the public schools, and to test the longitudinal stability of a theorized dimensional structure consisting of quantity, grammatical complexity, and lexical…
Descriptors: Speech Language Pathology, Allied Health Personnel, Speech Therapy, Longitudinal Studies
Pages: 1  |  ...  |  31  |  32  |  33  |  34  |  35  |  36  |  37  |  38  |  39  |  ...  |  220