Publication Date
In 2025 | 0 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 8 |
Since 2016 (last 10 years) | 30 |
Since 2006 (last 20 years) | 47 |
Descriptor
Decision Making | 47 |
Error of Measurement | 47 |
Reliability | 12 |
Scores | 10 |
Accuracy | 9 |
Comparative Analysis | 7 |
Item Analysis | 7 |
Item Response Theory | 7 |
Models | 7 |
Curriculum Based Assessment | 6 |
Evaluation Methods | 6 |
More ▼ |
Source
Author
Burns, Matthew K. | 2 |
Emons, Wilco H. M. | 2 |
Sijtsma, Klaas | 2 |
Adrian Adams | 1 |
Alici, Devrim | 1 |
Avi Feller | 1 |
Ayan, Cansu | 1 |
Baker, Scott K. | 1 |
Bichi, Ado Abdu | 1 |
Birnbaum, Michael H. | 1 |
Chen, Ssu-Kuang | 1 |
More ▼ |
Publication Type
Journal Articles | 43 |
Reports - Research | 30 |
Reports - Evaluative | 10 |
Reports - Descriptive | 5 |
Opinion Papers | 2 |
Dissertations/Theses -… | 1 |
Speeches/Meeting Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 9 |
Elementary Education | 8 |
Postsecondary Education | 8 |
Secondary Education | 6 |
Elementary Secondary Education | 3 |
Grade 1 | 3 |
Early Childhood Education | 2 |
Grade 2 | 2 |
Grade 3 | 2 |
Grade 5 | 2 |
High Schools | 2 |
More ▼ |
Audience
Location
California | 1 |
China | 1 |
Georgia | 1 |
Netherlands | 1 |
Oregon | 1 |
Portugal | 1 |
Taiwan | 1 |
Turkey | 1 |
United Kingdom (England) | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Dynamic Indicators of Basic… | 2 |
Center for Epidemiologic… | 1 |
Cognitive Abilities Test | 1 |
Iowa Tests of Basic Skills | 1 |
National Assessment of… | 1 |
Program for International… | 1 |
Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating
Jiangqiong Li – ProQuest LLC, 2024
When measuring latent constructs, for example, language ability, we use statistical models to specify appropriate relationships between the latent construct and observe responses to test items. These models rely on theoretical assumptions to ensure accurate parameter estimates for valid inferences based on the test results. This dissertation…
Descriptors: Goodness of Fit, Item Response Theory, Models, Measurement Techniques
Shunji Wang; Katerina M. Marcoulides; Jiashan Tang; Ke-Hai Yuan – Structural Equation Modeling: A Multidisciplinary Journal, 2024
A necessary step in applying bi-factor models is to evaluate the need for domain factors with a general factor in place. The conventional null hypothesis testing (NHT) was commonly used for such a purpose. However, the conventional NHT meets challenges when the domain loadings are weak or the sample size is insufficient. This article proposes…
Descriptors: Hypothesis Testing, Error of Measurement, Comparative Analysis, Monte Carlo Methods
Oscar Clivio; Avi Feller; Chris Holmes – Grantee Submission, 2024
Reweighting a distribution to minimize a distance to a target distribution is a powerful and flexible strategy for estimating a wide range of causal effects, but can be challenging in practice because optimal weights typically depend on knowledge of the underlying data generating process. In this paper, we focus on design-based weights, which do…
Descriptors: Evaluation Methods, Causal Models, Error of Measurement, Guidelines
Hitczenko, Marcin – Sociological Methods & Research, 2022
Researchers interested in studying the frequency of events or behaviors among a population must rely on count data provided by sampled individuals. Often, this involves a decision between live event counting, such as a behavioral diary, and recalled aggregate counts. Diaries are generally more accurate, but their greater cost and respondent burden…
Descriptors: Surveys, Social Science Research, Recall (Psychology), Diaries
Guler, Gul; Cikrikci, Rahime Nukhet – International Journal of Assessment Tools in Education, 2022
The purpose of this study was to investigate the Type I Error findings and power rates of the methods used to determine dimensionality in unidimensional and bidimensional psychological constructs for various conditions (characteristic of the distribution, sample size, length of the test, and interdimensional correlation) and to examine the joint…
Descriptors: Comparative Analysis, Error of Measurement, Decision Making, Factor Analysis
DeMars, Christine E. – Educational and Psychological Measurement, 2019
Previous work showing that revised parallel analysis can be effective with dichotomous items has used a two-parameter model and normally distributed abilities. In this study, both two- and three-parameter models were used with normally distributed and skewed ability distributions. Relatively minor skew and kurtosis in the underlying ability…
Descriptors: Item Analysis, Models, Error of Measurement, Item Response Theory
Jones, Andrew T.; Kopp, Jason P.; Ong, Thai Q. – Educational Measurement: Issues and Practice, 2020
Studies investigating invariance have often been limited to measurement or prediction invariance. Selection invariance, wherein the use of test scores for classification results in equivalent classification accuracy between groups, has received comparatively little attention in the psychometric literature. Previous research suggests that some form…
Descriptors: Test Construction, Test Bias, Classification, Accuracy
Adrian Adams; Lauren Barth-Cohen – CBE - Life Sciences Education, 2024
In undergraduate research settings, students are likely to encounter anomalous data, that is, data that do not meet their expectations. Most of the research that directly or indirectly captures the role of anomalous data in research settings uses post-hoc reflective interviews or surveys. These data collection approaches focus on recall of past…
Descriptors: Undergraduate Students, Physics, Science Instruction, Laboratory Experiments
Kim, Kyung Yong; Lee, Won-Chan – Journal of Educational Measurement, 2018
Reporting confidence intervals with test scores helps test users make important decisions about examinees by providing information about the precision of test scores. Although a variety of estimation procedures based on the binomial error model are available for computing intervals for test scores, these procedures assume that items are randomly…
Descriptors: Weighted Scores, Error of Measurement, Test Use, Decision Making
Clauser, Brian E.; Kane, Michael; Clauser, Jerome C. – Journal of Educational Measurement, 2020
An Angoff standard setting study generally yields judgments on a number of items by a number of judges (who may or may not be nested in panels). Variability associated with judges (and possibly panels) contributes error to the resulting cut score. The variability associated with items plays a more complicated role. To the extent that the mean item…
Descriptors: Cutting Scores, Generalization, Decision Making, Standard Setting
Paulsen, Justin; Valdivia, Dubravka Svetina – Journal of Experimental Education, 2022
Cognitive diagnostic models (CDMs) are a family of psychometric models designed to provide categorical classifications for multiple latent attributes. CDMs provide more granular evidence than other psychometric models and have potential for guiding teaching and learning decisions in the classroom. However, CDMs have primarily been conducted using…
Descriptors: Psychometrics, Classification, Teaching Methods, Learning Processes
Selvi, Hüseyin; Alici, Devrim; Uzun, Nezaket Bilge – Asian Journal of Education and Training, 2020
This study aims to comparatively examine the resultant findings by testing the measurement invariance with structural equation modeling in cases where the missing data is handled using the expectation-maximization (EM), regression imputation, and mean substitution methods in the complete data matrix and the 5% missing data matrix that is randomly…
Descriptors: Error of Measurement, Structural Equation Models, Attitude Measures, Student Attitudes
Hopster-den Otter, Dorien; Muilenburg, Selia N.; Wools, Saskia; Veldkamp, Bernard P.; Eggen, Theo J. H. M. – Assessment in Education: Principles, Policy & Practice, 2019
This study investigated (1) the extent to which presentations of measurement error in score reports influence teachers' decisions and (2) teachers' preferences in relation to these presentations. Three presentation formats of measurement error (blur, colour value and error bar) were compared to a presentation format that omitted measurement error.…
Descriptors: Error of Measurement, Scores, Decision Making, Teacher Attitudes
Chen, Ssu-Kuang; Liu, Yih-Lan; Lin, Sunny S. J. – Educational Psychology, 2022
In research on math self-concept (MSC) formation, very few studies have juxtaposed the effects of math grades, math ability, school-average math grades, and school-average math ability. However, these factors are important in enabling Taiwanese senior school students to achieve proper MSC and choose a suitable academic track. Thus, the present…
Descriptors: Self Concept, Mathematical Aptitude, Mathematics Skills, Gender Differences
Cikrikci, Nukhet; Yalcin, Seher; Kalender, Ilker; Gul, Emrah; Ayan, Cansu; Uyumaz, Gizem; Sahin-Kursad, Merve; Kamis, Omer – International Journal of Assessment Tools in Education, 2020
This study tested the applicability of the theoretical Examination for Candidates of Driving License (ECODL) in Turkey as a computerized adaptive test (CAT). Firstly, various simulation conditions were tested for the live CAT through an item response theory-based calibrated item bank. The application of the simulated CAT was based on data from…
Descriptors: Motor Vehicles, Traffic Safety, Computer Assisted Testing, Item Response Theory