Publication Date
| In 2026 | 0 |
| Since 2025 | 59 |
| Since 2022 (last 5 years) | 416 |
| Since 2017 (last 10 years) | 919 |
| Since 2007 (last 20 years) | 1970 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Burke, Danielle L.; Ensor, Joie; Snell, Kym I. E.; van der Windt, Danielle; Riley, Richard D. – Research Synthesis Methods, 2018
Percentage study weights in meta-analysis reveal the contribution of each study toward the overall summary results and are especially important when some studies are considered outliers or at high risk of bias. In meta-analyses of test accuracy reviews, such as a bivariate meta-analysis of sensitivity and specificity, the percentage study weights…
Descriptors: Meta Analysis, Research Reports, Statistical Analysis, Sample Size
Sinharay, Sandip – Journal of Educational Measurement, 2018
The value-added method of Haberman is arguably one of the most popular methods to evaluate the quality of subscores. The method is based on the classical test theory and deems a subscore to be of added value if the subscore predicts the corresponding true subscore better than does the total score. Sinharay provided an interpretation of the added…
Descriptors: Scores, Value Added Models, Raw Scores, Item Response Theory
Moses, Tim; Kim, YoungKoung – Journal of Educational Measurement, 2017
The focus of this article is on scale score transformations that can be used to stabilize conditional standard errors of measurement (CSEMs). Three transformations for stabilizing the estimated CSEMs are reviewed, including the traditional arcsine transformation, a recently developed general variance stabilization transformation, and a new method…
Descriptors: Error of Measurement, Scores, Comparative Analysis, Item Response Theory
da Silva, M. A. Salgueiro; Seixas, T. M. – Physics Teacher, 2017
Measuring one physical quantity as a function of another often requires making some choices prior to the measurement process. Two of these choices are: the data range where measurements should focus and the number (n) of data points to acquire in the chosen data range. Here, we consider data range as the interval of variation of the independent…
Descriptors: Physics, Regression (Statistics), Measurement, Measurement Techniques
Kilic, Abdullah Faruk; Uysal, Ibrahim; Atar, Burcu – International Journal of Assessment Tools in Education, 2020
This Monte Carlo simulation study aimed to investigate confirmatory factor analysis (CFA) estimation methods under different conditions, such as sample size, distribution of indicators, test length, average factor loading, and factor structure. Binary data were generated to compare the performance of maximum likelihood (ML), mean and variance…
Descriptors: Factor Analysis, Computation, Methods, Sample Size
Lee, HyeSun; Smith, Weldon Z. – Educational and Psychological Measurement, 2020
Based on the framework of testlet models, the current study suggests the Bayesian random block item response theory (BRB IRT) model to fit forced-choice formats where an item block is composed of three or more items. To account for local dependence among items within a block, the BRB IRT model incorporated a random block effect into the response…
Descriptors: Bayesian Statistics, Item Response Theory, Monte Carlo Methods, Test Format
Tong, Xin; Zhang, Zhiyong – Grantee Submission, 2020
Despite broad applications of growth curve models, few studies have dealt with a practical issue -- nonnormality of data. Previous studies have used Student's "t" distributions to remedy the nonnormal problems. In this study, robust distributional growth curve models are proposed from a semiparametric Bayesian perspective, in which…
Descriptors: Robustness (Statistics), Bayesian Statistics, Models, Error of Measurement
Heidemanns, Merlin; Gelman, Andrew; Morris, G. Elliott – Grantee Submission, 2020
During modern general election cycles, information to forecast the electoral outcome is plentiful. So-called fundamentals like economic growth provide information early in the cycle. Trial-heat polls become informative closer to Election Day. Our model builds on (Linzer, 2013) and is implemented in Stan (Team, 2020). We improve on the estimation…
Descriptors: Evaluation, Bayesian Statistics, Elections, Presidents
Schürer, Sina; van Ophuysen, Stefanie; Behrmann, Lars – Journal of Psychoeducational Assessment, 2021
Class climate has been focused in the context of school research for decades. One central facet of class climate is cohesion. Whereas there are well-elaborated instruments to assess cohesion in different contexts (e.g., sport and therapy), such an instrument is missing for the school context. The aim of the current article is to present an…
Descriptors: Elementary School Students, Factor Structure, Classroom Environment, Group Dynamics
Jack, Brady M.; Chen, Chi-Chen; Smith, Thomas J. – Journal of Psychoeducational Assessment, 2021
This investigation (1) elucidates genuine interest in the context of learning from Dewey's perspective, (2) assesses construct validity evidence for a genuine interest conceptual model using data from the Factors Effecting Ethics Learning survey, and (3) assesses measurement invariance of the genuine interest constructs between male (n = 352) and…
Descriptors: Ethics, Educational Philosophy, Error of Measurement, Interpersonal Communication
Kilic, Abdullah Faruk; Dogan, Nuri – International Journal of Assessment Tools in Education, 2021
Weighted least squares (WLS), weighted least squares mean-and-variance-adjusted (WLSMV), unweighted least squares mean-and-variance-adjusted (ULSMV), maximum likelihood (ML), robust maximum likelihood (MLR) and Bayesian estimation methods were compared in mixed item response type data via Monte Carlo simulation. The percentage of polytomous items,…
Descriptors: Factor Analysis, Computation, Least Squares Statistics, Maximum Likelihood Statistics
Emam, Mahmoud Mohamed; Almehrizi, Rashid; Omara, Ehab; Kazem, Ali Mahdi – International Journal of Developmental Disabilities, 2021
Students at risk for learning disabilities (LD) are overidentified in elementary schools in Oman due to the absence of adequate instruments which teachers can use in validating their observations. Teachers need valid instruments so that their judgment of students' behaviours can help in making academic and non-academic decision. The Learning…
Descriptors: Screening Tests, Diagnostic Tests, Educational Diagnosis, Test Validity
Chen, Ssu-Kuang; Liu, Yih-Lan; Lin, Sunny S. J. – Educational Psychology, 2022
In research on math self-concept (MSC) formation, very few studies have juxtaposed the effects of math grades, math ability, school-average math grades, and school-average math ability. However, these factors are important in enabling Taiwanese senior school students to achieve proper MSC and choose a suitable academic track. Thus, the present…
Descriptors: Self Concept, Mathematical Aptitude, Mathematics Skills, Gender Differences
Beula M. Magimairaj; Philip Capin; Sandra L. Gillam; Sharon Vaughn; Greg Roberts; Anna-Maria Fall; Ronald B. Gillam – Grantee Submission, 2022
Purpose: Our aim was to evaluate the psychometric properties of the online administered format of the Test of Narrative Language--Second Edition (TNL-2; Gillam & Pearson, 2017), given the importance of assessing children's narrative ability and considerable absence of psychometric studies of spoken language assessments administered online.…
Descriptors: Computer Assisted Testing, Language Tests, Story Telling, Language Impairments
Monroe, Scott – Journal of Educational and Behavioral Statistics, 2019
In item response theory (IRT) modeling, the Fisher information matrix is used for numerous inferential procedures such as estimating parameter standard errors, constructing test statistics, and facilitating test scoring. In principal, these procedures may be carried out using either the expected information or the observed information. However, in…
Descriptors: Item Response Theory, Error of Measurement, Scoring, Inferences

Peer reviewed
Direct link
