Publication Date
In 2025 | 39 |
Since 2024 | 192 |
Since 2021 (last 5 years) | 495 |
Since 2016 (last 10 years) | 996 |
Since 2006 (last 20 years) | 2028 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 93 |
Practitioners | 23 |
Teachers | 22 |
Policymakers | 10 |
Administrators | 5 |
Students | 4 |
Counselors | 2 |
Parents | 2 |
Community | 1 |
Location
United States | 47 |
Germany | 42 |
Australia | 34 |
Canada | 27 |
Turkey | 27 |
California | 22 |
United Kingdom (England) | 20 |
Netherlands | 18 |
China | 16 |
New York | 15 |
United Kingdom | 15 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Tong, Xin; Zhang, Zhiyong – Grantee Submission, 2020
Despite broad applications of growth curve models, few studies have dealt with a practical issue -- nonnormality of data. Previous studies have used Student's "t" distributions to remedy the nonnormal problems. In this study, robust distributional growth curve models are proposed from a semiparametric Bayesian perspective, in which…
Descriptors: Robustness (Statistics), Bayesian Statistics, Models, Error of Measurement
Heidemanns, Merlin; Gelman, Andrew; Morris, G. Elliott – Grantee Submission, 2020
During modern general election cycles, information to forecast the electoral outcome is plentiful. So-called fundamentals like economic growth provide information early in the cycle. Trial-heat polls become informative closer to Election Day. Our model builds on (Linzer, 2013) and is implemented in Stan (Team, 2020). We improve on the estimation…
Descriptors: Evaluation, Bayesian Statistics, Elections, Presidents
Monroe, Scott – Journal of Educational and Behavioral Statistics, 2019
In item response theory (IRT) modeling, the Fisher information matrix is used for numerous inferential procedures such as estimating parameter standard errors, constructing test statistics, and facilitating test scoring. In principal, these procedures may be carried out using either the expected information or the observed information. However, in…
Descriptors: Item Response Theory, Error of Measurement, Scoring, Inferences
Oranje, Andreas; Kolstad, Andrew – Journal of Educational and Behavioral Statistics, 2019
The design and psychometric methodology of the National Assessment of Educational Progress (NAEP) is constantly evolving to meet the changing interests and demands stemming from a rapidly shifting educational landscape. NAEP has been built on strong research foundations that include conducting extensive evaluations and comparisons before new…
Descriptors: National Competency Tests, Psychometrics, Statistical Analysis, Computation
Manna, Venessa F.; Gu, Lixiong – ETS Research Report Series, 2019
When using the Rasch model, equating with a nonequivalent groups anchor test design is commonly achieved by adjustment of new form item difficulty using an additive equating constant. Using simulated 5-year data, this report compares 4 approaches to calculating the equating constants and the subsequent impact on equating results. The 4 approaches…
Descriptors: Item Response Theory, Test Items, Test Construction, Sample Size
Finch, Holmes; French, Brian F. – Applied Measurement in Education, 2019
The usefulness of item response theory (IRT) models depends, in large part, on the accuracy of item and person parameter estimates. For the standard 3 parameter logistic model, for example, these parameters include the item parameters of difficulty, discrimination, and pseudo-chance, as well as the person ability parameter. Several factors impact…
Descriptors: Item Response Theory, Accuracy, Test Items, Difficulty Level
Pashley, Nicole E.; Miratrix, Luke W. – Grantee Submission, 2019
In the causal inference literature, evaluating blocking from a potential outcomes perspective has two main branches of work. The first focuses on larger blocks, with multiple treatment and control units in each block. The second focuses on matched pairs, with a single treatment and control unit in each block. These literatures not only provide…
Descriptors: Causal Models, Statistical Inference, Research Methodology, Computation
Lee, Selene Sunmin – ProQuest LLC, 2019
Measuring socioeconomic status (SES) is very important in educational research, as researchers often use this information to contextualize the results of an assessment or to control for SES when analyzing the relationship between academic achievement and other variables. However, any cross-country comparisons using SES data from international…
Descriptors: Error of Measurement, Achievement Tests, International Assessment, Foreign Countries
Schürer, Sina; van Ophuysen, Stefanie; Behrmann, Lars – Journal of Psychoeducational Assessment, 2021
Class climate has been focused in the context of school research for decades. One central facet of class climate is cohesion. Whereas there are well-elaborated instruments to assess cohesion in different contexts (e.g., sport and therapy), such an instrument is missing for the school context. The aim of the current article is to present an…
Descriptors: Elementary School Students, Factor Structure, Classroom Environment, Group Dynamics
Jack, Brady M.; Chen, Chi-Chen; Smith, Thomas J. – Journal of Psychoeducational Assessment, 2021
This investigation (1) elucidates genuine interest in the context of learning from Dewey's perspective, (2) assesses construct validity evidence for a genuine interest conceptual model using data from the Factors Effecting Ethics Learning survey, and (3) assesses measurement invariance of the genuine interest constructs between male (n = 352) and…
Descriptors: Ethics, Educational Philosophy, Error of Measurement, Interpersonal Communication
Kilic, Abdullah Faruk; Dogan, Nuri – International Journal of Assessment Tools in Education, 2021
Weighted least squares (WLS), weighted least squares mean-and-variance-adjusted (WLSMV), unweighted least squares mean-and-variance-adjusted (ULSMV), maximum likelihood (ML), robust maximum likelihood (MLR) and Bayesian estimation methods were compared in mixed item response type data via Monte Carlo simulation. The percentage of polytomous items,…
Descriptors: Factor Analysis, Computation, Least Squares Statistics, Maximum Likelihood Statistics
Emam, Mahmoud Mohamed; Almehrizi, Rashid; Omara, Ehab; Kazem, Ali Mahdi – International Journal of Developmental Disabilities, 2021
Students at risk for learning disabilities (LD) are overidentified in elementary schools in Oman due to the absence of adequate instruments which teachers can use in validating their observations. Teachers need valid instruments so that their judgment of students' behaviours can help in making academic and non-academic decision. The Learning…
Descriptors: Screening Tests, Diagnostic Tests, Educational Diagnosis, Test Validity
Chen, Ssu-Kuang; Liu, Yih-Lan; Lin, Sunny S. J. – Educational Psychology, 2022
In research on math self-concept (MSC) formation, very few studies have juxtaposed the effects of math grades, math ability, school-average math grades, and school-average math ability. However, these factors are important in enabling Taiwanese senior school students to achieve proper MSC and choose a suitable academic track. Thus, the present…
Descriptors: Self Concept, Mathematical Aptitude, Mathematics Skills, Gender Differences
Beula M. Magimairaj; Philip Capin; Sandra L. Gillam; Sharon Vaughn; Greg Roberts; Anna-Maria Fall; Ronald B. Gillam – Grantee Submission, 2022
Purpose: Our aim was to evaluate the psychometric properties of the online administered format of the Test of Narrative Language--Second Edition (TNL-2; Gillam & Pearson, 2017), given the importance of assessing children's narrative ability and considerable absence of psychometric studies of spoken language assessments administered online.…
Descriptors: Computer Assisted Testing, Language Tests, Story Telling, Language Impairments
Rubio-Aparicio, María; López-López, José Antonio; Sánchez-Meca, Julio; Marín-Martínez, Fulgencio; Viechtbauer, Wolfgang; Van den Noortgate, Wim – Research Synthesis Methods, 2018
The random-effects model, applied in most meta-analyses nowadays, typically assumes normality of the distribution of the effect parameters. The purpose of this study was to examine the performance of various random-effects methods (standard method, Hartung's method, profile likelihood method, and bootstrapping) for computing an average effect size…
Descriptors: Effect Size, Meta Analysis, Intervals, Monte Carlo Methods