NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 21 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Pornphan Sureeyatanapas; Panitas Sureeyatanapas; Uthumporn Panitanarak; Jittima Kraisriwattana; Patchanan Sarootyanapat; Daniel O'Connell – Language Testing in Asia, 2024
Ensuring consistent and reliable scoring is paramount in education, especially in performance-based assessments. This study delves into the critical issue of marking consistency, focusing on speaking proficiency tests in English language learning, which often face greater reliability challenges. While existing literature has explored various…
Descriptors: Foreign Countries, Students, English Language Learners, Speech
Peer reviewed Peer reviewed
Direct linkDirect link
Chengyu Cui; Chun Wang; Gongjun Xu – Grantee Submission, 2024
Multidimensional item response theory (MIRT) models have generated increasing interest in the psychometrics literature. Efficient approaches for estimating MIRT models with dichotomous responses have been developed, but constructing an equally efficient and robust algorithm for polytomous models has received limited attention. To address this gap,…
Descriptors: Item Response Theory, Accuracy, Simulation, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Grund, Simon; Lüdtke, Oliver; Robitzsch, Alexander – Journal of Educational and Behavioral Statistics, 2021
Large-scale assessments (LSAs) use Mislevy's "plausible value" (PV) approach to relate student proficiency to noncognitive variables administered in a background questionnaire. This method requires background variables to be completely observed, a requirement that is seldom fulfilled. In this article, we evaluate and compare the…
Descriptors: Data Analysis, Error of Measurement, Research Problems, Statistical Inference
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kibret, Berhanu Abera – Educational Research and Reviews, 2017
This paper discusses reasons why manuscripts are not accepted for publication in "Ethiopian Journal of Education" ("EJE"). It intends to promote publication by domestic and/or international authors in "EJE" by analyzing the reasons for rejection of manuscripts. To gather the relevant data, a total of 101 rejected…
Descriptors: Foreign Countries, Periodicals, Journal Articles, Writing for Publication
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Yalcin, Seher – Eurasian Journal of Educational Research, 2018
Purpose: Studies in the literature have generally demonstrated that the causes of differential item functioning (DIF) are complex and not directly related to defined groups. The purpose of this study is to determine the DIF according to the mixture item response theory (MixIRT) model, based on the latent group approach, as well as the…
Descriptors: Item Response Theory, Test Items, Test Bias, Error of Measurement
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Temel, Gülhan Orekici; Erdogan, Semra; Selvi, Hüseyin; Kaya, Irem Ersöz – Educational Sciences: Theory and Practice, 2016
Studies based on longitudinal data focus on the change and development of the situation being investigated and allow for examining cases regarding education, individual development, cultural change, and socioeconomic improvement in time. However, as these studies require taking repeated measures in different time periods, they may include various…
Descriptors: Investigations, Sample Size, Longitudinal Studies, Interrater Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Alper, Paul – Higher Education Review, 2014
In 1916 Robert Frost published his famous poem, "The Road Not Taken," in which he muses about what might have been had he chosen a different path, made a different choice. While counterfactual arguments in general can often lead to vacuous nowheres, frequently in statistics the data that are not presented actually exist, in a sense,…
Descriptors: Data Interpretation, Data Analysis, Error of Measurement, Theory Practice Relationship
Peer reviewed Peer reviewed
Direct linkDirect link
Pampaka, Maria; Hutcheson, Graeme; Williams, Julian – International Journal of Research & Method in Education, 2016
Missing data is endemic in much educational research. However, practices such as step-wise regression common in the educational research literature have been shown to be dangerous when significant data are missing, and multiple imputation (MI) is generally recommended by statisticians. In this paper, we provide a review of these advances and their…
Descriptors: Data Analysis, Statistical Inference, Error of Measurement, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Deygers, Bart; Van Gorp, Koen – Language Testing, 2015
Considering scoring validity as encompassing both reliable rating scale use and valid descriptor interpretation, this study reports on the validation of a CEFR-based scale that was co-constructed and used by novice raters. The research questions this paper wishes to answer are (a) whether it is possible to construct a CEFR-based rating scale with…
Descriptors: Rating Scales, Scoring, Validity, Interrater Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Köhler, Carmen; Pohl, Steffi; Carstensen, Claus H. – Educational and Psychological Measurement, 2015
When competence tests are administered, subjects frequently omit items. These missing responses pose a threat to correctly estimating the proficiency level. Newer model-based approaches aim to take nonignorable missing data processes into account by incorporating a latent missing propensity into the measurement model. Two assumptions are typically…
Descriptors: Competence, Tests, Evaluation Methods, Adults
Peer reviewed Peer reviewed
Direct linkDirect link
Browne, Dillon T.; Leckie, George; Prime, Heather; Perlman, Michal; Jenkins, Jennifer M. – Developmental Psychology, 2016
The present study sought to investigate the family, individual, and dyad-specific contributions to observed cognitive sensitivity during family interactions. Moreover, the influence of cumulative risk on sensitivity at the aforementioned levels of the family was examined. Mothers and 2 children per family were observed interacting in a round robin…
Descriptors: Family Relationship, Family (Sociological Unit), Sibling Relationship, Siblings
Peer reviewed Peer reviewed
Direct linkDirect link
Zhuang, Jie; Chen, Peijie; Wang, Chao; Jin, Jing; Zhu, Zheng; Zhang, Wenjie – Research Quarterly for Exercise and Sport, 2013
Purpose: The purpose of this study was to determine which method, individual information-centered (IIC) or group information-centered (GIC), is more efficient in recovering missing physical activity (PA) data. Method: A total of 2,758 Chinese children and youth aged 9 to 17 years old (1,438 boys and 1,320 girls) wore ActiGraph GT3X/GT3X+…
Descriptors: Foreign Countries, Physical Activities, Measurement Equipment, Data Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Zhuang, Jie; Chen, Peijie; Wang, Chao; Huang, Liang; Zhu, Zheng; Zhang, Wenjie; Fan, Xiang – Research Quarterly for Exercise and Sport, 2013
Purpose: The purpose of this study was to investigate the characteristics of missing physical activity (PA) data of children and youth. Method: PA data from the Chinese City Children and Youth Physical Activity Study ("N" = 2,758; 1,438 boys and 1,320 girls; aged 9-17 years old) were used for the study. After the data were sorted by the…
Descriptors: Physical Activities, Error of Measurement, Statistical Data, Gender Differences
Peer reviewed Peer reviewed
Direct linkDirect link
Bouhlila, Donia Smaali; Sellaouti, Fethi – Large-scale Assessments in Education, 2013
In this paper, we document a study that involved applying a multiple imputation technique with chained equations to data drawn from the 2007 iteration of the TIMSS database. More precisely, we imputed missing variables contained in the student background datafile for Tunisia (one of the TIMSS 2007 participating countries), by using Van Buuren,…
Descriptors: Databases, Student Characteristics, Error of Measurement, Intervals
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, C. Matthew; Gorelick, Mark – Measurement in Physical Education and Exercise Science, 2011
The purpose of this study was to examine the validity of the Smarthealth watch (Salutron, Inc., Fremont, California, USA), a heart rate monitor that includes a wristwatch without an accompanying chest strap. Twenty-five individuals participated in 3-min periods of standing, 2.0 mph walking, 3.5 mph walking, 4.5 mph jogging, and 6.0 mph running.…
Descriptors: Metabolism, Intervals, Physical Activities, Validity
Previous Page | Next Page »
Pages: 1  |  2