Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 15 |
Since 2006 (last 20 years) | 28 |
Descriptor
Error Patterns | 40 |
Error of Measurement | 40 |
Research Methodology | 6 |
Statistical Analysis | 6 |
Statistical Bias | 6 |
Foreign Countries | 5 |
Higher Education | 5 |
Item Response Theory | 5 |
Models | 5 |
Sample Size | 5 |
Scoring | 5 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 40 |
Journal Articles | 33 |
Information Analyses | 3 |
Numerical/Quantitative Data | 2 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 9 |
Postsecondary Education | 6 |
Elementary Secondary Education | 4 |
Audience
Practitioners | 1 |
Location
American Samoa | 1 |
District of Columbia | 1 |
Guam | 1 |
Massachusetts (Boston) | 1 |
Nepal | 1 |
Northern Mariana Islands | 1 |
Oklahoma | 1 |
Puerto Rico | 1 |
Taiwan | 1 |
Turkey | 1 |
United Kingdom (England) | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Wechsler Intelligence Scale… | 2 |
International English… | 1 |
National Assessment of… | 1 |
New Jersey College Basic… | 1 |
Wide Range Achievement Test | 1 |
What Works Clearinghouse Rating
Mark White; Matt Ronfeldt – Educational Assessment, 2024
Standardized observation systems seek to reliably measure a specific conceptualization of teaching quality, managing rater error through mechanisms such as certification, calibration, validation, and double-scoring. These mechanisms both support high quality scoring and generate the empirical evidence used to support the scoring inference (i.e.,…
Descriptors: Interrater Reliability, Quality Control, Teacher Effectiveness, Error Patterns
Ayse Bilicioglu Gunes; Bayram Bicak – International Journal of Assessment Tools in Education, 2023
The main purpose of this study is to examine the Type I error and statistical power ratios of Differential Item Functioning (DIF) techniques based on different theories under different conditions. For this purpose, a simulation study was conducted by using Mantel-Haenszel (MH), Logistic Regression (LR), Lord's [chi-squared], and Raju's Areas…
Descriptors: Test Items, Item Response Theory, Error of Measurement, Test Bias
Silber, Henning; Roßmann, Joss; Gummer, Tobias – Field Methods, 2022
Attention checks detect inattentiveness by instructing respondents to perform a specific task. However, while respondents may correctly process the task, they may choose to not comply with the instructions. We investigated the issue of noncompliance in attention checks in two web surveys. In Study 1, we measured respondents' attitudes toward…
Descriptors: Compliance (Psychology), Attention, Task Analysis, Online Surveys
Zhang, Zhonghua – Journal of Experimental Education, 2022
Reporting standard errors of equating has been advocated as a standard practice when conducting test equating. The two most widely applied procedures for standard errors of equating including the bootstrap method and the delta method are either computationally intensive or confined to the derivations of complicated formulas. In the current study,…
Descriptors: Error of Measurement, Item Response Theory, True Scores, Equated Scores
Ellison, George T. H. – Journal of Statistics and Data Science Education, 2021
Temporality-driven covariate classification had limited impact on: the specification of directed acyclic graphs (DAGs) by 85 novice analysts (medical undergraduates); or the risk of bias in DAG-informed multivariable models designed to generate causal inference from observational data. Only 71 students (83.5%) managed to complete the…
Descriptors: Statistics Education, Medical Education, Undergraduate Students, Graphs
Jewsbury, Paul A. – ETS Research Report Series, 2019
When an assessment undergoes changes to the administration or instrument, bridge studies are typically used to try to ensure comparability of scores before and after the change. Among the most common and powerful is the common population linking design, with the use of a linear transformation to link scores to the metric of the original…
Descriptors: Evaluation Research, Scores, Error Patterns, Error of Measurement
Investigating the Impact of Rater Training on Rater Errors in the Process of Assessing Writing Skill
Sata, Mehmet; Karakaya, Ismail – International Journal of Assessment Tools in Education, 2022
In the process of measuring and assessing high-level cognitive skills, interference of rater errors in measurements brings about a constant concern and low objectivity. The main purpose of this study was to investigate the impact of rater training on rater errors in the process of assessing individual performance. The study was conducted with a…
Descriptors: Evaluators, Training, Comparative Analysis, Academic Language
Gauns Dessai, Kissan G.; Kamat, Venkatesh V. – International Journal of Information and Communication Technology Education, 2018
Educational institutions worldwide conduct summative examinations to evaluate academic performance of students. Such summative examinations are normally subjective in nature in higher education institutions and needs manual evaluation. However, the manual evaluation of subjective answer-scripts often suffers from evaluation anomalies and the…
Descriptors: Computer Assisted Testing, Student Evaluation, Scoring Rubrics, Error Patterns
Montoye, Alexander H. K.; Mitrzyk, Joe R.; Molesky, Monroe J. – Measurement in Physical Education and Exercise Science, 2017
The purpose of the current study was to determine the accuracy of the Fitbit Charge HR and Hexoskin smart shirt. Participants (n = 32, age: 23.5 ± 1.3 years) wore a Fitbit and Hexoskin while performing 14 activities in a laboratory and on a track (lying, sitting, standing, walking various speeds and inclines, jogging, and cycling). Steps, kcals,…
Descriptors: Measurement Equipment, Physical Activity Level, Measures (Individuals), Exercise Physiology
Brenner, Philip S. – Sociological Methods & Research, 2017
That rates of normative behaviors produced by sample surveys are higher than actual behavior warrants is well evidenced in the research literature. Less well understood is the source of this error. Twenty-five cognitive interviews were conducted to probe responses to a set of common, conventional survey questions about one such normative behavior:…
Descriptors: Interviews, Surveys, Religion, Religious Factors
Eshach, Haim; Kukliansky, Ida – International Journal of Science and Mathematics Education, 2018
The present study uses the intuitive rules theory as a framework to examine whether some of the difficulties in dealing with errors and uncertainties observed among students in the university physics laboratory can stem from their use of intuitive rules. The study also examines the relationship between the use of intuitive rules and laboratory…
Descriptors: Physics, Engineering Education, Error of Measurement, Error Patterns
Kogar, Esin Yilmaz; Kelecioglu, Hülya – Journal of Education and Learning, 2017
The purpose of this research is to first estimate the item and ability parameters and the standard error values related to those parameters obtained from Unidimensional Item Response Theory (UIRT), bifactor (BIF) and Testlet Response Theory models (TRT) in the tests including testlets, when the number of testlets, number of independent items, and…
Descriptors: Item Response Theory, Models, Mathematics Tests, Test Items
Brown, Molly; Bossé, Michael J.; Chandler, Kayla – International Journal for Mathematics Teaching and Learning, 2016
This study investigates the nature of student errors in the context of problem solving and Dynamic Math Environments. This led to the development of the Problem Solving Action Identification Framework; this framework captures and defines all activities and errors associated with problem solving in a dynamic math environment. Found are three…
Descriptors: Error Patterns, Student Projects, Problem Solving, Mathematics Activities
Deke, John; Wei, Thomas; Kautz, Tim – National Center for Education Evaluation and Regional Assistance, 2017
Evaluators of education interventions are increasingly designing studies to detect impacts much smaller than the 0.20 standard deviations that Cohen (1988) characterized as "small." While the need to detect smaller impacts is based on compelling arguments that such impacts are substantively meaningful, the drive to detect smaller impacts…
Descriptors: Intervention, Educational Research, Research Problems, Statistical Bias
Schoeneberger, Jason A. – Journal of Experimental Education, 2016
The design of research studies utilizing binary multilevel models must necessarily incorporate knowledge of multiple factors, including estimation method, variance component size, or number of predictors, in addition to sample sizes. This Monte Carlo study examined the performance of random effect binary outcome multilevel models under varying…
Descriptors: Sample Size, Models, Computation, Predictor Variables