Publication Date
In 2025 | 39 |
Since 2024 | 192 |
Since 2021 (last 5 years) | 495 |
Since 2016 (last 10 years) | 996 |
Since 2006 (last 20 years) | 2028 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 93 |
Practitioners | 23 |
Teachers | 22 |
Policymakers | 10 |
Administrators | 5 |
Students | 4 |
Counselors | 2 |
Parents | 2 |
Community | 1 |
Location
United States | 47 |
Germany | 42 |
Australia | 34 |
Canada | 27 |
Turkey | 27 |
California | 22 |
United Kingdom (England) | 20 |
Netherlands | 18 |
China | 16 |
New York | 15 |
United Kingdom | 15 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
VanHoudnos, Nathan M.; Greenhouse, Joel B. – Journal of Educational and Behavioral Statistics, 2016
When cluster randomized experiments are analyzed as if units were independent, test statistics for treatment effects can be anticonservative. Hedges proposed a correction for such tests by scaling them to control their Type I error rate. This article generalizes the Hedges correction from a posttest-only experimental design to more common designs…
Descriptors: Statistical Analysis, Randomized Controlled Trials, Error of Measurement, Scaling
Li, Tongyun; Jiao, Hong; Macready, George B. – Educational and Psychological Measurement, 2016
The present study investigates different approaches to adding covariates and the impact in fitting mixture item response theory models. Mixture item response theory models serve as an important methodology for tackling several psychometric issues in test development, including the detection of latent differential item functioning. A Monte Carlo…
Descriptors: Item Response Theory, Psychometrics, Test Construction, Monte Carlo Methods
Kannan, Priya; Sgammato, Adrienne; Tannenbaum, Richard J.; Katz, Irvin R. – Applied Measurement in Education, 2015
The Angoff method requires experts to view every item on the test and make a probability judgment. This can be time consuming when there are large numbers of items on the test. In this study, a G-theory framework was used to determine if a subset of items can be used to make generalizable cut-score recommendations. Angoff ratings (i.e.,…
Descriptors: Reliability, Standard Setting (Scoring), Cutting Scores, Test Items
Ballou, Dale; Springer, Matthew G. – Educational Researcher, 2015
Our aim in this article is to draw attention to some underappreciated problems in the design and implementation of evaluation systems that incorporate value-added measures. We focus on four: (1) taking into account measurement error in teacher assessments, (2) revising teachers' scores as more information becomes available about their students,…
Descriptors: Teacher Evaluation, Teacher Effectiveness, Scores, Error of Measurement
Lockwood, J. R.; Castellano, Katherine E. – Grantee Submission, 2015
This article suggests two alternative statistical approaches for estimating student growth percentiles (SGP). The first is to estimate percentile ranks of current test scores conditional on past test scores directly, by modeling the conditional cumulative distribution functions, rather than indirectly through quantile regressions. This would…
Descriptors: Statistical Analysis, Achievement Gains, Academic Achievement, Computation
McNeish, Daniel – Review of Educational Research, 2017
In education research, small samples are common because of financial limitations, logistical challenges, or exploratory studies. With small samples, statistical principles on which researchers rely do not hold, leading to trust issues with model estimates and possible replication issues when scaling up. Researchers are generally aware of such…
Descriptors: Models, Statistical Analysis, Sampling, Sample Size
Schweig, Jonathan David – Applied Measurement in Education, 2014
Developing indicators that reflect important aspects of school and classroom environments has become central in a nationwide effort to develop comprehensive programs that measure teacher quality and effectiveness. Formulating teacher evaluation policy necessitates accurate and reliable methods for measuring these environmental variables. This…
Descriptors: Error of Measurement, Educational Environment, Classroom Environment, Surveys
Dodge, Nadine; Chapman, Ralph – International Journal of Social Research Methodology, 2018
Electronically assisted survey techniques offer several advantages over traditional survey techniques. However, they can also potentially introduce biases, such as coverage biases and measurement error. The current study compares the relative merits of two survey distribution and completion modes: email recruitment with internet completion; and…
Descriptors: Online Surveys, Handheld Devices, Bias, Electronic Mail
Sekercioglu, Güçlü; Kogar, Hakan – Novitas-ROYAL (Research on Youth and Language), 2018
The aim of the present study was to examine the measurement invariance (MI) of the reading, mathematics, and science tests in terms of the commonly used languages. It also aimed to examine the differential item functioning (DIF) of the PISA test, the original items of which are in the languages of English and French, in terms of the language…
Descriptors: Error of Measurement, Item Response Theory, International Assessment, Achievement Tests
Kibret, Berhanu Abera – Educational Research and Reviews, 2017
This paper discusses reasons why manuscripts are not accepted for publication in "Ethiopian Journal of Education" ("EJE"). It intends to promote publication by domestic and/or international authors in "EJE" by analyzing the reasons for rejection of manuscripts. To gather the relevant data, a total of 101 rejected…
Descriptors: Foreign Countries, Periodicals, Journal Articles, Writing for Publication
Leckie, George; Goldstein, Harvey – British Educational Research Journal, 2017
Since 1992, the UK Government has published so-called "school league tables" summarising the average General Certificate of Secondary Education (GCSE) "attainment" and "progress" made by pupils in each state-funded secondary school in England. While the headline measure of school attainment has remained the percentage…
Descriptors: Foreign Countries, Achievement Rating, Academic Achievement, Secondary School Students
Schoen, Robert C.; Yang, Xiaotong; Liu, Sicong; Paek, Insu – Grantee Submission, 2017
The Early Fractions Test v2.2 is a paper-pencil test designed to measure mathematics achievement of third- and fourth-grade students in the domain of fractions. The purpose, or intended use, of the Early Fractions Test v2.2 is to serve as a measure of student outcomes in a randomized trial designed to estimate the effect of an educational…
Descriptors: Psychometrics, Mathematics Tests, Mathematics Achievement, Fractions
Vaughan, Robert; Laborde, Sylvain – Measurement in Physical Education and Exercise Science, 2018
The purpose of this study was to assess the psychometrics properties of the Emotional Intelligence Scale and assess the measurement invariance across elite (n = 367), amateur (n = 629), and non-athletes (n = 550). In total, 1,546 participants from various sports completed the emotional intelligence scale. Several competing models were compared…
Descriptors: Psychometrics, Emotional Intelligence, Measures (Individuals), Athletes
Vaughan, Timothy S. – Journal of Statistics Education, 2015
This paper introduces a dataset and associated analysis of the scores of National Football League (NFL) games over the 2012, 2013, and first five weeks of the 2014 season. In the face of current media attention to "lopsided" scores in Thursday night games in the early part of the 2014 season, t-test results indicate no statistically…
Descriptors: Team Sports, Success, Scores, Statistics
Can, Seda; van de Schoot, Rens; Hox, Joop – Educational and Psychological Measurement, 2015
Because variables may be correlated in the social and behavioral sciences, multicollinearity might be problematic. This study investigates the effect of collinearity manipulated in within and between levels of a two-level confirmatory factor analysis by Monte Carlo simulation. Furthermore, the influence of the size of the intraclass correlation…
Descriptors: Factor Analysis, Comparative Analysis, Maximum Likelihood Statistics, Bayesian Statistics