NotesFAQContact Us
Collection
Advanced
Search Tips
Assessments and Surveys
What Works Clearinghouse Rating
Showing 1 to 15 of 52 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Yuting Han; Zhehan Jiang; Lingling Xu; Fen Cai – AERA Online Paper Repository, 2024
To address the computational constraints of parameter estimation in the polytomous Cognitive Diagnosis Model (pCDM) in large-scale high data volume situations, this study proposes two two-stage polytomous attribute estimation methods: P_max and P_linear. The effects of the two-stage methods were studied via a Monte Carlo simulation study, and the…
Descriptors: Medical Education, Licensing Examinations (Professions), Measurement Techniques, Statistical Data
Peer reviewed Peer reviewed
Direct linkDirect link
McNeish, Daniel; Harring, Jeffrey R. – Educational and Psychological Measurement, 2017
To date, small sample problems with latent growth models (LGMs) have not received the amount of attention in the literature as related mixed-effect models (MEMs). Although many models can be interchangeably framed as a LGM or a MEM, LGMs uniquely provide criteria to assess global data-model fit. However, previous studies have demonstrated poor…
Descriptors: Growth Models, Goodness of Fit, Error Correction, Sampling
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Wedman, Jonathan; Lyrén, Per-Erik – Practical Assessment, Research & Evaluation, 2015
When subscores on a test are reported to the test taker, the appropriateness of reporting them depends on whether they provide useful information above what is provided by the total score. Subscores that fail to do so lack adequate psychometric quality and should not be reported. There are several methods for examining the quality of subscores,…
Descriptors: Evaluation Methods, Psychometrics, Scores, Tests
MacHardy, Zachary; Pardos, Zachary A. – International Educational Data Mining Society, 2015
Along with the advent of MOOCs and other online learning platforms such as Khan Academy, the role of online education has continued to grow in relation to that of traditional on-campus instruction. Rather than tackle the problem of evaluating large educational units such as entire online courses, this paper approaches a smaller problem: exploring…
Descriptors: Educational Technology, Video Technology, Units of Study, Multimedia Materials
Peer reviewed Peer reviewed
Direct linkDirect link
Khan, R. Nazim – International Journal of Mathematical Education in Science and Technology, 2015
Open book assessment is not a new idea, but it does not seem to have gained ground in higher education. In particular, not much literature is available on open book examinations in mathematics and statistics in higher education. The objective of this paper is to investigate the appropriateness of open book assessments in a first-year business…
Descriptors: Evaluation Methods, Higher Education, Mathematics Tests, Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
Liang, Guodong; Akiba, Motoko – Educational Policy, 2015
Using statewide longitudinal teacher survey data collected in 2009 and 2010, this study examined the characteristics of teacher evaluation used to determine performance-related pay (PRP), and the association between PRP and improvement in the practice of constructivist instruction. The study found that 10.9% of middle school mathematics teachers…
Descriptors: Constructivism (Learning), Teacher Evaluation, Merit Pay, Longitudinal Studies
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Gill, Brian; Shoji, Megan; Coen, Thomas; Place, Kate – Regional Educational Laboratory Mid-Atlantic, 2016
School districts and states across the Regional Educational Laboratory Mid-Atlantic Region and the country as a whole have been modifying their teacher evaluation systems to identify more effective and less effective teachers and provide better feedback to improve instructional practice. The new systems typically include components related to…
Descriptors: Predictive Validity, Test Bias, Test Content, School Districts
Peer reviewed Peer reviewed
Direct linkDirect link
Reichardt, Charles S. – Multivariate Behavioral Research, 2011
Maxwell, Cole, and Mitchell (2011) demonstrated that simple structural equation models, when used with cross-sectional data, generally produce biased estimates of meditated effects. I extend those results by showing how simple structural equation models can produce biased estimates of meditated effects when used even with longitudinal data. Even…
Descriptors: Structural Equation Models, Statistical Data, Longitudinal Studies, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Gunn, Alexandra C. – Journal of Early Childhood Research, 2011
Heteronormativity, the concept that heterosexual sexuality is an institutionalized norm and a superior and privileged standard, is held firm when discourses of gender, sexualities and family form converge. In a study of heteronormative discourses in the context of early childhood education, teachers shared accounts of practices where genders,…
Descriptors: Early Childhood Education, Sexual Orientation, Young Children, Statistical Data
Peer reviewed Peer reviewed
Direct linkDirect link
Vanbelle, Sophie; Albert, Adelin – Psychometrika, 2009
We propose a coefficient of agreement to assess the degree of concordance between two independent groups of raters classifying items on a nominal scale. This coefficient, defined on a population-based model, extends the classical Cohen's kappa coefficient for quantifying agreement between two raters. Weighted and intraclass versions of the…
Descriptors: Interrater Reliability, Weighted Scores, Congruence (Psychology), Rating Scales
Peer reviewed Peer reviewed
Direct linkDirect link
Warren, John Robert; Saliba, Jim – Educational Researcher, 2012
How many students repeat a grade each year? How do retention rates vary across states and over time? Despite extensive research on the predictors and consequences of grade retention, there is no systematic way to quantify state-level retention rates; even national estimates rely on imperfect proxy measures. We present a conceptually simple…
Descriptors: Grade Repetition, School Holding Power, Public Education, National Surveys
Peer reviewed Peer reviewed
Direct linkDirect link
Vera-Toscano, Esperanza; Ateca-Amestoy, Victoria – Social Indicators Research, 2008
For most individuals, housing is the largest consumption and investment item of their lifetime and, as a result, housing satisfaction is an important component of their quality of life. The purpose of this paper then is to investigate the determinants of individual housing satisfaction as a particular domain of satisfaction with life as a whole,…
Descriptors: Life Satisfaction, Quality of Life, Housing, Statistical Data
Peer reviewed Peer reviewed
Direct linkDirect link
Dekle, Dawn J.; Leung, Denis H. Y.; Zhu, Min – Psychological Methods, 2008
Across many areas of psychology, concordance is commonly used to measure the (intragroup) agreement in ranking a number of items by a group of judges. Sometimes, however, the judges come from multiple groups, and in those situations, the interest is to measure the concordance between groups, under the assumption that there is some within-group…
Descriptors: Item Response Theory, Statistical Analysis, Psychological Studies, Evaluators
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Minott, Mark A.; Young, Allan E. – Australian Journal of Teacher Education, 2009
The main purpose of the study was to ascertain the benefits of employing a hybrid evaluation approach to assessing a teacher education programme's objectives or intended outcomes. The benefits of employing the hybrid evaluation approach enacted through its evaluation survey component was seen in the fact that it acts as a guide for participants'…
Descriptors: Program Evaluation, Statistical Data, Foreign Countries, Journal Writing
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Seonghoon; Kolen, Michael J. – Journal of Educational and Behavioral Statistics, 2007
Under item response theory, the characteristic curve methods (Haebara and Stocking-Lord methods) are used to link two ability scales from separate calibrations. The linking methods use their respective criterion functions that can be defined differently according to the symmetry- and distribution-related schemes. The symmetry-related scheme…
Descriptors: Measures (Individuals), Item Response Theory, Simulation, Comparative Analysis
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4