Publication Date
In 2025 | 39 |
Since 2024 | 192 |
Since 2021 (last 5 years) | 495 |
Since 2016 (last 10 years) | 996 |
Since 2006 (last 20 years) | 2028 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 93 |
Practitioners | 23 |
Teachers | 22 |
Policymakers | 10 |
Administrators | 5 |
Students | 4 |
Counselors | 2 |
Parents | 2 |
Community | 1 |
Location
United States | 47 |
Germany | 42 |
Australia | 34 |
Canada | 27 |
Turkey | 27 |
California | 22 |
United Kingdom (England) | 20 |
Netherlands | 18 |
China | 16 |
New York | 15 |
United Kingdom | 15 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Kane, Michael T. – ETS Research Report Series, 2017
By aggregating residual gain scores (the differences between each student's current score and a predicted score based on prior performance) for a school or a teacher, value-added models (VAMs) can be used to generate estimates of school or teacher effects. It is known that random errors in the prior scores will introduce bias into predictions of…
Descriptors: Error of Measurement, Value Added Models, Scores, Teacher Effectiveness
Sinharay, Sandip; Johnson, Matthew S. – Educational and Psychological Measurement, 2017
In a pioneering research article, Wollack and colleagues suggested the "erasure detection index" (EDI) to detect test tampering. The EDI can be used with or without a continuity correction and is assumed to follow the standard normal distribution under the null hypothesis of no test tampering. When used without a continuity correction,…
Descriptors: Deception, Identification, Testing Problems, Error of Measurement
Sachse, Karoline A.; Haag, Nicole – Applied Measurement in Education, 2017
Standard errors computed according to the operational practices of international large-scale assessment studies such as the Programme for International Student Assessment's (PISA) or the Trends in International Mathematics and Science Study (TIMSS) may be biased when cross-national differential item functioning (DIF) and item parameter drift are…
Descriptors: Error of Measurement, Test Bias, International Assessment, Computation
Stallasch, Sophie E.; Lüdtke, Oliver; Artelt, Cordula; Brunner, Martin – Journal of Research on Educational Effectiveness, 2021
To plan cluster-randomized trials with sufficient statistical power to detect intervention effects on student achievement, researchers need multilevel design parameters, including measures of between-classroom and between-school differences and the amounts of variance explained by covariates at the student, classroom, and school level. Previous…
Descriptors: Foreign Countries, Randomized Controlled Trials, Intervention, Educational Research
Gilman, Leon J.; Zhang, Bo; Jones, Curtis J. – Learning Environments Research, 2021
Students' perceptions of the learning environment play an important role in their academic achievement and social lives. While most measures of school environment have been developed for middle- and high-school students, they also have been used for younger students, such as 4th and 5th graders. What is unclear is whether these measures are…
Descriptors: Student Attitudes, Educational Environment, Academic Achievement, Social Life
Karakolidis, Anastasios; O'Leary, Michael; Scully, Darina – International Journal of Testing, 2021
The linguistic complexity of many text-based tests can be a source of construct-irrelevant variance, as test-takers' performance may be affected by factors that are beyond the focus of the assessment itself, such as reading comprehension skills. This experimental study examined the extent to which the use of animated videos, as opposed to written…
Descriptors: Animation, Vignettes, Video Technology, Test Format
Elahi Shirvan, Majid; Taherian, Tahereh; Yazdanmehr, Elham – Studies in Second Language Acquisition, 2022
Given the longitudinal nature of L2 grit, the use of conventional research methodologies with cross-sectional data to examine the validity of L2 grit scale seems inadequate. The present research was an attempt to extend the domain-specific phase of research on L2 grit, with the pursuit of long-term goals at its core, into a dynamic one. Thus, we…
Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Academic Persistence
Gagnon-Bartsch, J. A.; Sales, A. C.; Wu, E.; Botelho, A. F.; Erickson, J. A.; Miratrix, L. W.; Heffernan, N. T. – Grantee Submission, 2019
Randomized controlled trials (RCTs) admit unconfounded design-based inference--randomization largely justifies the assumptions underlying statistical effect estimates--but often have limited sample sizes. However, researchers may have access to big observational data on covariates and outcomes from RCT non-participants. For example, data from A/B…
Descriptors: Randomized Controlled Trials, Educational Research, Prediction, Algorithms
Gibson, C. Ben; Mayhall, Timothy B. – Sociological Methods & Research, 2019
Although a wealth of literature exists studying the effect of sponsor characteristics on self-reports of mental health, little work assesses a related but potentially powerful effect: a context comprehension effect, that is, a change in the respondent's interpretation of a survey question, given the concept elicited by the interviewer. Further,…
Descriptors: Mental Health, Hospitals, Context Effect, Comprehension
Gu, Lixiong; Ling, Guangming; Qu, Yanxuan – ETS Research Report Series, 2019
Research has found that the "a"-stratified item selection strategy (STR) for computerized adaptive tests (CATs) may lead to insufficient use of high a items at later stages of the tests and thus to reduced measurement precision. A refined approach, unequal item selection across strata (USTR), effectively improves test precision over the…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Use, Test Items
Parsons, Eric; Koedel, Cory; Tan, Li – Journal of Educational and Behavioral Statistics, 2019
We study the relative performance of two policy-relevant value-added models--a one-step fixed effect model and a two-step aggregated residuals model--using a simulated data set well grounded in the value-added literature. A key feature of our data generating process is that student achievement depends on a continuous measure of economic…
Descriptors: Value Added Models, Economically Disadvantaged, Academic Achievement, Low Income Students
Meyer, Jennifer; Schmidt, Fabian T. C.; Fleckenstein, Johanna; Köller, Olaf – British Journal of Educational Psychology, 2023
Background: Many empirical investigations focus on how personality traits and academic motivation are related to academic achievement. Regarding the personality traits described in the five-factor model, prior research has shown associations between openness to experience and language achievement in particular. Following the principle of trait…
Descriptors: Longitudinal Studies, German, Personality Traits, Student Motivation
Schnoor, Birger; Hartig, Johannes; Klinger, Thorsten; Naumann, Alexander; Usanova, Irina – Language Testing, 2023
Research on assessing English as a foreign language (EFL) development has been growing recently. However, empirical evidence from longitudinal analyses based on substantial samples is still needed. In such settings, tests for measuring language development must meet high standards of test quality such as validity, reliability, and objectivity, as…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Longitudinal Studies
Nicewander, W. Alan – Educational and Psychological Measurement, 2018
Spearman's correction for attenuation (measurement error) corrects a correlation coefficient for measurement errors in either-or-both of two variables, and follows from the assumptions of classical test theory. Spearman's equation removes all measurement error from a correlation coefficient which translates into "increasing the reliability of…
Descriptors: Error of Measurement, Correlation, Sample Size, Computation
Yoder, Paul J.; Ledford, Jennifer R.; Harbison, Amy L.; Tapp, Jon T. – Journal of Early Intervention, 2018
A simulation study that used 3,000 computer-generated event streams with known behavior rates, interval durations, and session durations was conducted to test whether the main and interaction effects of true rate and interval duration affect the error level of uncorrected and Poisson-transformed (i.e., "corrected") count as estimated by…
Descriptors: Computation, Child Behavior, Early Childhood Education, Early Intervention