Publication Date
| In 2026 | 0 |
| Since 2025 | 59 |
| Since 2022 (last 5 years) | 416 |
| Since 2017 (last 10 years) | 919 |
| Since 2007 (last 20 years) | 1970 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Wing, Coady; Cook, Thomas D. – Journal of Policy Analysis and Management, 2013
The sharp regression discontinuity design (RDD) has three key weaknesses compared to the randomized clinical trial (RCT). It has lower statistical power, it is more dependent on statistical modeling assumptions, and its treatment effect estimates are limited to the narrow subpopulation of cases immediately around the cutoff, which is rarely of…
Descriptors: Regression (Statistics), Research Design, Statistical Analysis, Research Problems
Zhuang, Jie; Chen, Peijie; Wang, Chao; Jin, Jing; Zhu, Zheng; Zhang, Wenjie – Research Quarterly for Exercise and Sport, 2013
Purpose: The purpose of this study was to determine which method, individual information-centered (IIC) or group information-centered (GIC), is more efficient in recovering missing physical activity (PA) data. Method: A total of 2,758 Chinese children and youth aged 9 to 17 years old (1,438 boys and 1,320 girls) wore ActiGraph GT3X/GT3X+…
Descriptors: Foreign Countries, Physical Activities, Measurement Equipment, Data Analysis
Kim, YoungKoung; Hendrickson, Amy; Patel, Priyank; Melican, Gerald; Sweeney, Kevin – College Board, 2013
The purpose of this report is to describe the procedure for revising the ReadiStep™ score scale using the field trial data, and to provide technical information about the development of the new ReadiStep scale score. In doing so, this report briefly introduces the three assessments--ReadiStep, PSAT/NMSQT®, and SAT®--in the College Board Pathway…
Descriptors: College Entrance Examinations, Educational Assessment, High School Students, Scores
Reporting Data with "Over-the-Counter" Data Analysis Supports Increases Educators' Analysis Accuracy
Rankin, Jenny Grant – Online Submission, 2013
There is extensive research on the benefits of making data-informed decisions to improve learning, but these benefits rely on the data being effectively interpreted. Despite educators' above-average intellect and education levels, there is evidence many educators routinely misinterpret student data. Data analysis problems persist even at districts…
Descriptors: Statistical Data, Data Interpretation, Data Analysis, Error of Measurement
Olivera-Aguilar, Margarita; Millsap, Roger E. – Multivariate Behavioral Research, 2013
A common finding in studies of differential prediction across groups is that although regression slopes are the same or similar across groups, group differences exist in regression intercepts. Building on earlier work by Birnbaum (1979), Millsap (1998) presented an invariant factor model that would explain such intercept differences as arising due…
Descriptors: Statistical Analysis, Measurement, Prediction, Regression (Statistics)
Solano-Flores, Guillermo; Li, Min – Educational Research and Evaluation, 2013
We discuss generalizability (G) theory and the fair and valid assessment of linguistic minorities, especially emergent bilinguals. G theory allows examination of the relationship between score variation and language variation (e.g., variation of proficiency across languages, language modes, and social contexts). Studies examining score variation…
Descriptors: Measurement, Testing, Language Proficiency, Test Construction
Phillips, Gary W. – Applied Measurement in Education, 2015
This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…
Descriptors: State Programs, Sampling, Research Design, Error of Measurement
Roncancio, Angelica M.; Ward, Kristy K.; Sanchez, Ingrid A.; Cano, Miguel A.; Byrd, Theresa L.; Vernon, Sally W.; Fernandez-Esquer, Maria Eugenia; Fernandez, Maria E. – Health Education & Behavior, 2015
To reduce the high incidence of cervical cancer among Latinas in the United States it is important to understand factors that predict screening behavior. The aim of this study was to test the utility of theory of planned behavior in predicting cervical cancer screening among a group of Latinas. A sample of Latinas (N = 614) completed a baseline…
Descriptors: Cancer, Screening Tests, Incidence, Hispanic Americans
New York State Education Department, 2016
This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2016 Operational Tests. This report includes information about test content and test development, item (i.e.,…
Descriptors: Testing Programs, English, Language Arts, Mathematics Tests
Sapp, Marty – Multicultural Learning and Teaching, 2012
Like many journals within education, for "Multicultural Learning and Teaching," this writer found little written on measurement, effect sizes, and confidence intervals; therefore, the purpose of this article is to address these factors. The effect of not addressing these issues is that a basic foundation of science cannot be established…
Descriptors: Multicultural Education, Educational Research, Scholarship, Reliability
Oh, Hyeonjoo; Moses, Tim – Journal of Educational Measurement, 2012
This study investigated differences between two approaches to chained equipercentile (CE) equating (one- and bi-direction CE equating) in nearly equal groups and relatively unequal groups. In one-direction CE equating, the new form is linked to the anchor in one sample of examinees and the anchor is linked to the reference form in the other…
Descriptors: Equated Scores, Statistical Analysis, Comparative Analysis, Differences
Micklewright, John; Schnepf, Sylke V.; Silva, Pedro N. – Economics of Education Review, 2012
Investigation of peer effects on achievement with sample survey data on schools may mean that only a random sample of the population of peers is observed for each individual. This generates measurement error in peer variables similar in form to the textbook case of errors-in-variables, resulting in the estimated peer group effects in an OLS…
Descriptors: Foreign Countries, Sampling, Error of Measurement, Peer Groups
Jiao, Hong; Kamata, Akihito; Wang, Shudong; Jin, Ying – Journal of Educational Measurement, 2012
The applications of item response theory (IRT) models assume local item independence and that examinees are independent of each other. When a representative sample for psychometric analysis is selected using a cluster sampling method in a testlet-based assessment, both local item dependence and local person dependence are likely to be induced.…
Descriptors: Item Response Theory, Test Items, Markov Processes, Monte Carlo Methods
Le, Huy; Marcus, Justin – Educational and Psychological Measurement, 2012
This study used Monte Carlo simulation to examine the properties of the overall odds ratio (OOR), which was recently introduced as an index for overall effect size in multiple logistic regression. It was found that the OOR was relatively independent of study base rate and performed better than most commonly used R-square analogs in indexing model…
Descriptors: Monte Carlo Methods, Probability, Mathematical Concepts, Effect Size
McAllister, Daniel; Guidice, Rebecca M. – Teaching in Higher Education, 2012
The primary goal of teaching is to successfully facilitate learning. Testing can help accomplish this goal in two ways. First, testing can provide a powerful motivation for students to prepare when they perceive that the effort involved leads to valued outcomes. Second, testing can provide instructors with valuable feedback on whether their…
Descriptors: Testing, Role, Student Motivation, Feedback (Response)

Peer reviewed
Direct link
