Publication Date
In 2025 | 39 |
Since 2024 | 192 |
Since 2021 (last 5 years) | 495 |
Since 2016 (last 10 years) | 996 |
Since 2006 (last 20 years) | 2028 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 93 |
Practitioners | 23 |
Teachers | 22 |
Policymakers | 10 |
Administrators | 5 |
Students | 4 |
Counselors | 2 |
Parents | 2 |
Community | 1 |
Location
United States | 47 |
Germany | 42 |
Australia | 34 |
Canada | 27 |
Turkey | 27 |
California | 22 |
United Kingdom (England) | 20 |
Netherlands | 18 |
China | 16 |
New York | 15 |
United Kingdom | 15 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Oranje, Andreas; Li, Deping; Kandathil, Mathew – ETS Research Report Series, 2009
Several complex sample standard error estimators based on linearization and resampling for the latent regression model of the National Assessment of Educational Progress (NAEP) are studied with respect to design choices such as number of items, number of regressors, and the efficiency of the sample. This paper provides an evaluation of the extent…
Descriptors: Error of Measurement, Computation, Regression (Statistics), National Competency Tests
Pullin, Andrew S.; Knight, Teri M. – New Directions for Evaluation, 2009
To use environmental program evaluation to increase effectiveness, predictive power, and resource allocation efficiency, evaluators need good data. Data require sufficient credibility in terms of fitness for purpose and quality to develop the necessary evidence base. The authors examine elements of data credibility using experience from critical…
Descriptors: Data, Credibility, Conservation (Environment), Program Evaluation
Okada, Kensuke; Shigemasu, Kazuo – Applied Psychological Measurement, 2009
Bayesian multidimensional scaling (MDS) has attracted a great deal of attention because: (1) it provides a better fit than do classical MDS and ALSCAL; (2) it provides estimation errors of the distances; and (3) the Bayesian dimension selection criterion, MDSIC, provides a direct indication of optimal dimensionality. However, Bayesian MDS is not…
Descriptors: Bayesian Statistics, Multidimensional Scaling, Computation, Computer Software
Bai, Yun; Poon, Wai-Yin – Structural Equation Modeling: A Multidisciplinary Journal, 2009
Two-level data sets are frequently encountered in social and behavioral science research. They arise when observations are drawn from a known hierarchical structure, such as when individuals are randomly drawn from groups that are randomly drawn from a target population. Although 2-level data analysis in the context of structural equation modeling…
Descriptors: Structural Equation Models, Data Analysis, Simulation, Goodness of Fit
Innes, Richard G. – Journal of School Choice, 2012
This article provides examples of how serious misconceptions can result when only "all student" scores from the National Assessment of Educational Progress (NAEP) are used for simplistic state-to-state comparisons. Suggestions for better treatment are presented. The article also compares Kentucky's eighth grade EXPLORE testing to NAEP…
Descriptors: National Competency Tests, Scoring, Misconceptions, Academic Achievement
Engdahl, Ryan M.; Elhai, Jon D.; Richardson, J. Don; Frueh, B. Christopher – Psychological Assessment, 2011
We tested two empirically validated 4-factor models of posttraumatic stress disorder (PTSD) symptoms using the PTSD Checklist: King, Leskin, King, and Weathers' (1998) model including reexperiencing, avoidance, emotional numbing, and hyperarousal factors, and Simms, Watson, and Doebbeling's (2002) model including reexperiencing, avoidance,…
Descriptors: Posttraumatic Stress Disorder, Mental Disorders, Factor Structure, Factor Analysis
He, Qingping; Boyle, Andrew; Opposs, Dennis – Evaluation & Research in Education, 2011
Building on findings from existing qualitative research into public perceptions of reliability in examination results in England, a questionnaire was developed and administered to samples of teachers, students and employers to study their awareness of and opinions about various aspects of reliability quantitatively. Main findings from the study…
Descriptors: Qualitative Research, Student Evaluation, Tests, Program Effectiveness
Hill, Heather D.; Morris, Pamela A.; Castells, Nina; Walker, Jessica Thornton – Journal of Policy Analysis and Management, 2011
This study uses data from an experimental employment program and instrumental variables (IV) estimation to examine the effects of maternal job loss on child classroom behavior. Random assignment to the treatment at one of three program sites is an exogenous predictor of employment patterns. Cross-site variation in treatment-control differences is…
Descriptors: Student Behavior, Employment Level, Social Behavior, Employment Programs
Kwon, Hyungil Harry; Pyun, Do Young; Han, Siwan; Ogasawara, Etsuko – Asia Pacific Journal of Education, 2011
The objective of this study was to provide empirical evidence to support psychometric properties of a modified four-dimensional model of the Leadership Scale for Sports (LSS). The study tested invariance of all parameters (i.e., factor loadings, error variances, and factor variances-covariances) in the four-dimensional measurement model between…
Descriptors: Feedback (Response), Testing, Athletes, Factor Structure
Haberman, Shelby J. – ETS Research Report Series, 2008
The reliability of a scaled score can be computed by use of item response theory. Estimated reliability can be obtained even if the item response model selected is not valid.
Descriptors: Reliability, Scores, Item Response Theory, Computation
Brandt, Lorilynn – ProQuest LLC, 2010
Phonics was identified as one of the critical components in reading development by the National Reading Panel. Over time, research has repeatedly identified phonics as important to early reading development. Given the compelling evidence supporting the teaching of phonics in early reading, it is critical to make sure that instructional decisions…
Descriptors: Generalizability Theory, Phonics, Early Reading, Validity
Fletcher, Jack M.; Stuebing, Karla K.; Hughes, Lisa C. – Journal of Psychoeducational Assessment, 2010
IQ test scores should be corrected for high stakes decisions that employ these assessments, including capital offense cases. If scores are not corrected, then diagnostic standards must change with each generation. Arguments against corrections, based on standards of practice, information present and absent in test manuals, and related issues,…
Descriptors: Testing, Mental Retardation, Validity, Intelligence Quotient
Randall, Jennifer; Engelhard, George, Jr. – Applied Measurement in Education, 2010
The psychometric properties and multigroup measurement invariance of scores across subgroups, items, and persons on the "Reading for Meaning" items from the Georgia Criterion Referenced Competency Test (CRCT) were assessed in a sample of 778 seventh-grade students. Specifically, we sought to determine the extent to which score-based…
Descriptors: Testing Accommodations, Test Items, Learning Disabilities, Factor Analysis
Zajonc, Tristan – ProQuest LLC, 2012
Effective policymaking requires understanding the causal effects of competing proposals. Relevant causal quantities include proposals' expected effect on different groups of recipients, the impact of policies over time, the potential trade-offs between competing objectives, and, ultimately, the optimal policy. This dissertation studies causal…
Descriptors: Public Policy, Policy Formation, Bayesian Statistics, Economic Development
Hughes, Gail D. – Research in the Schools, 2009
The impacts of incorrect responses to reverse-coded survey items were examined in this simulation study by reversing responses to traditional Likert-format items from 700 administrators in randomly selected schools in a 7-county region in central Arkansas that were obtained from an archival dataset. Specifically, the number of reverse-coded items…
Descriptors: Surveys, Coding, Context Effect, Measures (Individuals)