Publication Date
In 2025 | 39 |
Since 2024 | 192 |
Since 2021 (last 5 years) | 495 |
Since 2016 (last 10 years) | 996 |
Since 2006 (last 20 years) | 2028 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 93 |
Practitioners | 23 |
Teachers | 22 |
Policymakers | 10 |
Administrators | 5 |
Students | 4 |
Counselors | 2 |
Parents | 2 |
Community | 1 |
Location
United States | 47 |
Germany | 42 |
Australia | 34 |
Canada | 27 |
Turkey | 27 |
California | 22 |
United Kingdom (England) | 20 |
Netherlands | 18 |
China | 16 |
New York | 15 |
United Kingdom | 15 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Jamshidi, Laleh; Declercq, Lies; Fernández-Castilla, Belén; Ferron, John M.; Moeyaert, Mariola; Beretvas, S. Natasha; Van den Noortgate, Wim – Grantee Submission, 2020
The focus of the current study is on handling the dependence among multiple regression coefficients representing the treatment effects when meta-analyzing data from single-case experimental studies. We compare the results when applying three different multilevel meta-analytic models (i.e., a univariate multilevel model avoiding the dependence, a…
Descriptors: Multivariate Analysis, Hierarchical Linear Modeling, Meta Analysis, Regression (Statistics)
Lenz, A. Stephen; Li, Chi – Measurement and Evaluation in Counseling and Development, 2022
The factor structure, measurement invariance, and internal consistency of the Patient Health Questionnaire for Depression and Anxiety (PHQ-4) was examined with a rural, predominately Hispanic sample (N = 711). Findings supported use of a one-factor model across gender, age groups, and Spanish-speaking groups. Counseling practice and research…
Descriptors: Psychometrics, Error of Measurement, Patients, Questionnaires
Chang, Heesun – Language Assessment Quarterly, 2022
Drawing on the framework of invariant measurement from Rasch measurement theory, the purpose of this study is to psychometrically evaluate the 20 language and teaching skill domains of the International Teaching Assistant (ITA) Test using the many-facet Rasch model and to empirically explore performance differences between females and males in…
Descriptors: Teaching Assistants, Grammar, Second Language Learning, Second Language Instruction
Paulsen, Justin; Valdivia, Dubravka Svetina – Journal of Experimental Education, 2022
Cognitive diagnostic models (CDMs) are a family of psychometric models designed to provide categorical classifications for multiple latent attributes. CDMs provide more granular evidence than other psychometric models and have potential for guiding teaching and learning decisions in the classroom. However, CDMs have primarily been conducted using…
Descriptors: Psychometrics, Classification, Teaching Methods, Learning Processes
ALKursheh, Taha Okleh; Al-zboon, Habis Saad; AlNasraween, Mo'en Salman – International Journal of Instruction, 2022
This study aimed at comparing the effect of two test item formats (multiple-choice and complete) on estimating person's ability, item parameters and the test information function (TIF).To achieve the aim of the study, two format of mathematics(1) test have been created: multiple-choice and complete, In its final format consisted of (31) items. The…
Descriptors: Comparative Analysis, Test Items, Item Response Theory, Test Format
Robert Meyer; Sara Hu; Michael Christian – Society for Research on Educational Effectiveness, 2022
This paper develops models to measure growth in student achievement with a focus on the possibility of differential growth in achievement for low and high-achieving students. We consider a gap-closing model that evaluates the degree to which students in a target group -- students in the bottom quartile of measured achievement -- perform better…
Descriptors: Academic Achievement, Achievement Gap, Models, Measurement Techniques
Rollins, Derrick, Sr. – Chemical Engineering Education, 2017
Statistical inference simply means to draw a conclusion based on information that comes from data. Error bars are the most commonly used tool for data analysis and inference in chemical engineering data studies. This work demonstrates, using common types of data collection studies, the importance of specifying the statistical model for sound…
Descriptors: Data Analysis, Statistical Inference, Chemical Engineering, Models
Wang, Yan; Kim, Eun Sook; Nguyen, Diep Thi; Pham, Thanh Vinh; Chen, Yi-Hsin; Yi, Zhiyao – AERA Online Paper Repository, 2017
The analysis of variance (ANOVA) F test is a commonly used method to test the mean equality among two or more populations. A critical assumption of ANOVA is homogeneity of variance (HOV), that is, the compared groups have equal variances. Although it is encouraged to test HOV as part of the regular ANOVA procedure, the efficacy of the initial HOV…
Descriptors: Statistical Analysis, Error of Measurement, Robustness (Statistics), Sampling
Jewsbury, Paul A. – ETS Research Report Series, 2019
When an assessment undergoes changes to the administration or instrument, bridge studies are typically used to try to ensure comparability of scores before and after the change. Among the most common and powerful is the common population linking design, with the use of a linear transformation to link scores to the metric of the original…
Descriptors: Evaluation Research, Scores, Error Patterns, Error of Measurement
Shear, Benjamin R.; Reardon, Sean F. – Stanford Center for Education Policy Analysis, 2019
This paper describes a method for pooling grouped, ordered-categorical data across multiple waves to improve small-sample heteroskedastic ordered probit (HETOP) estimates of latent distributional parameters. We illustrate the method with aggregate proficiency data reporting the number of students in schools or districts scoring in each of a small…
Descriptors: Computation, Scores, Statistical Distributions, Sample Size
Weng, Cathy; Puspitasari, Dani; Tran, Khanh Nguyen Phuong; Feng, Pei Jie; Awuor, Nicholas O.; Matere, Isaac Manyonge – Interactive Learning Environments, 2023
The purpose of this study was to investigate the effect of augmented reality (AR) using a 3D app in a smartphone on students' learning outcomes and satisfaction in teaching angle measurement error to vocational high school students with different spatial ability. A quasi-experimental pretest/posttest was employed. There were 197 students from…
Descriptors: Teaching Methods, Error of Measurement, Multimedia Instruction, Learning Processes
Philipp, Michel; Strobl, Carolin; de la Torre, Jimmy; Zeileis, Achim – Journal of Educational and Behavioral Statistics, 2018
Cognitive diagnosis models (CDMs) are an increasingly popular method to assess mastery or nonmastery of a set of fine-grained abilities in educational or psychological assessments. Several inference techniques are available to quantify the uncertainty of model parameter estimates, to compare different versions of CDMs, or to check model…
Descriptors: Computation, Error of Measurement, Models, Cognitive Measurement
Zhang, Xue; Wang, Chun; Tao, Jian – Grantee Submission, 2018
Testing item-level fit is important in scale development to guide item revision/deletion. Many item-level fit indices have been proposed in literature, yet none of them were directly applicable to an important family of models, namely, the higher order item response theory (HO-IRT) models. In this study, chi-square-based fit indices (i.e., Yen's…
Descriptors: Item Response Theory, Models, Test Items, Goodness of Fit
Hyunsuk Han – ProQuest LLC, 2018
In Huggins-Manley & Han (2017), it was shown that WLSMV global model fit indices used in structural equating modeling practice are sensitive to person parameter estimate RMSE and item difficulty parameter estimate RMSE that results from local dependence in 2-PL IRT models, particularly when conditioning on number of test items and sample size.…
Descriptors: Models, Statistical Analysis, Item Response Theory, Evaluation Methods
Pei-Hsuan Chiu – ProQuest LLC, 2018
Evidence of student growth is a primary outcome of interest for educational accountability systems. When three or more years of student test data are available, questions around how students grow and what their predicted growth is can be answered. Given that test scores contain measurement error, this error should be considered in growth and…
Descriptors: Bayesian Statistics, Scores, Error of Measurement, Growth Models