Publication Date
In 2025 | 39 |
Since 2024 | 192 |
Since 2021 (last 5 years) | 495 |
Since 2016 (last 10 years) | 996 |
Since 2006 (last 20 years) | 2028 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 93 |
Practitioners | 23 |
Teachers | 22 |
Policymakers | 10 |
Administrators | 5 |
Students | 4 |
Counselors | 2 |
Parents | 2 |
Community | 1 |
Location
United States | 47 |
Germany | 42 |
Australia | 34 |
Canada | 27 |
Turkey | 27 |
California | 22 |
United Kingdom (England) | 20 |
Netherlands | 18 |
China | 16 |
New York | 15 |
United Kingdom | 15 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Solano-Flores, Guillermo; Backhoff, Eduardo; Contreras-Nino, Luis Angel – International Journal of Testing, 2009
In this article, we present a theory of test translation whose intent is to provide the conceptual foundation for effective, systematic work in the process of test translation and test translation review. According to the theory, translation error is multidimensional; it is not simply the consequence of defective translation but an inevitable fact…
Descriptors: Test Items, Investigations, Semantics, Translation
Chafouleas, Sandra M.; Christ, Theodore J.; Riley-Tillman, T. Chris – Educational and Psychological Measurement, 2009
Generalizability theory is used to examine the impact of scaling gradients on a single-item Direct Behavior Rating (DBR). A DBR refers to a type of rating scale used to efficiently record target behavior(s) following an observation occasion. Variance components associated with scale gradients are estimated using a random effects design for persons…
Descriptors: Generalizability Theory, Undergraduate Students, Scaling, Rating Scales
Robitzsch, Alexander; Rupp, Andre A. – Educational and Psychological Measurement, 2009
This article describes the results of a simulation study to investigate the impact of missing data on the detection of differential item functioning (DIF). Specifically, it investigates how four methods for dealing with missing data (listwise deletion, zero imputation, two-way imputation, response function imputation) interact with two methods of…
Descriptors: Test Bias, Simulation, Interaction, Effect Size
Isenberg, Eric; Hock, Heinrich – Mathematica Policy Research, Inc., 2012
This report describes the value-added models used as part of teacher evaluation systems in the District of Columbia Public Schools (DCPS) and in eligible DC charter schools participating in "Race to the Top." The authors estimated: (1) teacher effectiveness in DCPS and eligible DC charter schools during the 2011-2012 school year; and (2)…
Descriptors: Teacher Evaluation, Value Added Models, Public Schools, Charter Schools
Radford, Alexandria Walton; Horn, Laura – National Center for Education Statistics, 2012
These Web Tables provide an overview of classes taken and credits earned by a nationwide sample of first-time beginning postsecondary students based on data from the Postsecondary Education Transcript Study (PETS) of the 2004/09 Beginning Postsecondary Students Longitudinal Study. PETS collected transcripts from all the postsecondary institutions…
Descriptors: Postsecondary Education, College Freshmen, Academic Records, Longitudinal Studies
Elosua, Paula; Iliescu, Dragos – International Journal of Testing, 2012
Psychometric practice does not always converge with the advances of psychometric theory. In order to investigate this gap, the authors focus on the 10 most used psychological tests in Europe, as identified by recent surveys. The article analyzes test manuals published in 6 different European countries for these 10 most used tests. A total of 32…
Descriptors: Psychological Testing, Personality Measures, Error of Measurement, Foreign Countries
Tourangeau, Karen; Nord, Christine; Lê, Thanh; Wallner-Allen, Kathleen; Vaden-Kiernan, Nancy; Blaker, Lisa; Najarian, Michelle – National Center for Education Statistics, 2018
This manual provides guidance and documentation for users of the longitudinal kindergarten-fourth grade (K-4) public-use data file of the Early Childhood Longitudinal Study, Kindergarten Class of 2010-11 (ECLS-K:2011), which includes the first release of the public version of the third-grade data. This manual mainly provides information specific…
Descriptors: Longitudinal Studies, Children, Surveys, Kindergarten
National Center for Education Statistics, 2010
This paper presents the supplemental figures, tables, and standard error tables for the report "Student Financing of Undergraduate Education: 2007-08. Web Tables. NCES 2010-162." (Contains 6 figures and 10 tables.) [For the main report, see ED511828.]
Descriptors: Undergraduate Study, Higher Education, Undergraduate Students, Tables (Data)
Volkwein, J. Fredericks; Yin, Alexander C. – New Directions for Institutional Research, 2010
This chapter summarizes ten selected issues and common problems that arise in most assessment research projects. These include: (1) the uses of grades in assessment; (2) institutional review boards; (3) research design as a compromise; (4) standardized testing; (5) self-reported measures; (6) missing data; (7) weighting data; (8) conditional…
Descriptors: Research Design, Research Methodology, Standardized Tests, Least Squares Statistics
Bubany, Shawn T.; Hansen, Jo-Ida C. – Measurement and Evaluation in Counseling and Development, 2010
Conceptual differences between self-efficacy and ability self-estimate scores, used in vocational psychology and career counseling, were examined with confirmatory factor analysis, discriminate relations, and reliability analysis. Results suggest that empirical differences may be due to measurement error or scale content, rather than due to the…
Descriptors: Self Efficacy, Measurement, Factor Analysis, Error of Measurement
Karcher, Michael J.; Sass, Daniel – Journal of Counseling Psychology, 2010
Counselors, psychologists, and evaluators of intervention programs for youth increasingly view the promotion of connectedness as an important intervention outcome. When evaluating these programs, researchers frequently test whether the treatment effects differ across gender and ethnic or racial groups. Doing so necessitates the availability of…
Descriptors: African Americans, Neighborhoods, Ethnicity, Race
Krishnakumar, Jaya; Nagar, A. L. – Social Indicators Research, 2008
Recent empirical literature has seen many multidimensional indices emerge as well-being or poverty measures, in particular indices derived from principal components and various latent variable models. Though such indices are being increasingly and widely employed, few studies motivate their use or report the standard errors or confidence intervals…
Descriptors: Intervals, Structural Equation Models, Factor Analysis, Computation
Bonnett, Douglas G. – Psychological Methods, 2008
Most psychology journals now require authors to report a sample value of effect size along with hypothesis testing results. The sample effect size value can be misleading because it contains sampling error. Authors often incorrectly interpret the sample effect size as if it were the population effect size. A simple solution to this problem is to…
Descriptors: Intervals, Hypothesis Testing, Effect Size, Sampling
Savalei, Victoria; Kolenikov, Stanislav – Psychological Methods, 2008
Recently, R. D. Stoel, F. G. Garre, C. Dolan, and G. van den Wittenboer (2006) reviewed approaches for obtaining reference mixture distributions for difference tests when a parameter is on the boundary. The authors of the present study argue that this methodology is incomplete without a discussion of when the mixtures are needed and show that they…
Descriptors: Structural Equation Models, Goodness of Fit, Evaluation Methods, Statistical Analysis
Jackson, Margaret C.; Raymond, Jane E. – Journal of Experimental Psychology: Human Perception and Performance, 2008
Although it is intuitive that familiarity with complex visual objects should aid their preservation in visual working memory (WM), empirical evidence for this is lacking. This study used a conventional change-detection procedure to assess visual WM for unfamiliar and famous faces in healthy adults. Across experiments, faces were upright or…
Descriptors: Familiarity, Long Term Memory, Short Term Memory, Stimuli