Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 8 |
Since 2016 (last 10 years) | 19 |
Since 2006 (last 20 years) | 49 |
Descriptor
Error Patterns | 78 |
Error of Measurement | 78 |
Statistical Analysis | 18 |
Computation | 12 |
Research Methodology | 12 |
Measurement Techniques | 10 |
Test Reliability | 10 |
Evaluation Methods | 9 |
Higher Education | 9 |
Statistical Bias | 9 |
Correlation | 8 |
More ▼ |
Source
Author
Publication Type
Education Level
Higher Education | 17 |
Postsecondary Education | 11 |
Elementary Secondary Education | 6 |
Elementary Education | 1 |
Grade 4 | 1 |
Audience
Practitioners | 2 |
Location
American Samoa | 1 |
California | 1 |
District of Columbia | 1 |
Guam | 1 |
Massachusetts (Boston) | 1 |
Nepal | 1 |
Northern Mariana Islands | 1 |
Oklahoma | 1 |
Puerto Rico | 1 |
Taiwan | 1 |
Texas | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Wechsler Intelligence Scale… | 3 |
National Assessment of… | 2 |
International English… | 1 |
New Jersey College Basic… | 1 |
Wechsler Adult Intelligence… | 1 |
Wide Range Achievement Test | 1 |
What Works Clearinghouse Rating
Mark White; Matt Ronfeldt – Educational Assessment, 2024
Standardized observation systems seek to reliably measure a specific conceptualization of teaching quality, managing rater error through mechanisms such as certification, calibration, validation, and double-scoring. These mechanisms both support high quality scoring and generate the empirical evidence used to support the scoring inference (i.e.,…
Descriptors: Interrater Reliability, Quality Control, Teacher Effectiveness, Error Patterns
Ayse Bilicioglu Gunes; Bayram Bicak – International Journal of Assessment Tools in Education, 2023
The main purpose of this study is to examine the Type I error and statistical power ratios of Differential Item Functioning (DIF) techniques based on different theories under different conditions. For this purpose, a simulation study was conducted by using Mantel-Haenszel (MH), Logistic Regression (LR), Lord's [chi-squared], and Raju's Areas…
Descriptors: Test Items, Item Response Theory, Error of Measurement, Test Bias
Atehortua, Laura – ProQuest LLC, 2022
Intelligence tests are used in a variety of settings such as schools, clinics, and courts to assess the intellectual capacity of individuals of all ages. Intelligence tests are used to make high-stakes decisions such as special education placement, employment, eligibility for social security services, and determination of the death penalty.…
Descriptors: Adults, Intelligence Tests, Children, Error of Measurement
Warne, Russell T. – Journal of Advanced Academics, 2022
Recently, Picho-Kiroga (2021) published a meta-analysis on the effect of stereotype threat on females. Their conclusion was that the average effect size for stereotype threat studies was d = .28, but that effects are overstated because the majority of studies on stereotype threat in females include methodological characteristics that inflate the…
Descriptors: Sex Stereotypes, Females, Meta Analysis, Effect Size
Silber, Henning; Roßmann, Joss; Gummer, Tobias – Field Methods, 2022
Attention checks detect inattentiveness by instructing respondents to perform a specific task. However, while respondents may correctly process the task, they may choose to not comply with the instructions. We investigated the issue of noncompliance in attention checks in two web surveys. In Study 1, we measured respondents' attitudes toward…
Descriptors: Compliance (Psychology), Attention, Task Analysis, Online Surveys
Zhang, Zhonghua – Journal of Experimental Education, 2022
Reporting standard errors of equating has been advocated as a standard practice when conducting test equating. The two most widely applied procedures for standard errors of equating including the bootstrap method and the delta method are either computationally intensive or confined to the derivations of complicated formulas. In the current study,…
Descriptors: Error of Measurement, Item Response Theory, True Scores, Equated Scores
Ellison, George T. H. – Journal of Statistics and Data Science Education, 2021
Temporality-driven covariate classification had limited impact on: the specification of directed acyclic graphs (DAGs) by 85 novice analysts (medical undergraduates); or the risk of bias in DAG-informed multivariable models designed to generate causal inference from observational data. Only 71 students (83.5%) managed to complete the…
Descriptors: Statistics Education, Medical Education, Undergraduate Students, Graphs
Yang, Shitao; Black, Ken – Teaching Statistics: An International Journal for Teachers, 2019
Summary Employing a Wald confidence interval to test hypotheses about population proportions could lead to an increase in Type I or Type II errors unless the hypothesized value, p0, is used in computing its standard error rather than the sample proportion. Whereas the Wald confidence interval to estimate a population proportion uses the sample…
Descriptors: Error Patterns, Evaluation Methods, Error of Measurement, Measurement Techniques
Jewsbury, Paul A. – ETS Research Report Series, 2019
When an assessment undergoes changes to the administration or instrument, bridge studies are typically used to try to ensure comparability of scores before and after the change. Among the most common and powerful is the common population linking design, with the use of a linear transformation to link scores to the metric of the original…
Descriptors: Evaluation Research, Scores, Error Patterns, Error of Measurement
Investigating the Impact of Rater Training on Rater Errors in the Process of Assessing Writing Skill
Sata, Mehmet; Karakaya, Ismail – International Journal of Assessment Tools in Education, 2022
In the process of measuring and assessing high-level cognitive skills, interference of rater errors in measurements brings about a constant concern and low objectivity. The main purpose of this study was to investigate the impact of rater training on rater errors in the process of assessing individual performance. The study was conducted with a…
Descriptors: Evaluators, Training, Comparative Analysis, Academic Language
Gauns Dessai, Kissan G.; Kamat, Venkatesh V. – International Journal of Information and Communication Technology Education, 2018
Educational institutions worldwide conduct summative examinations to evaluate academic performance of students. Such summative examinations are normally subjective in nature in higher education institutions and needs manual evaluation. However, the manual evaluation of subjective answer-scripts often suffers from evaluation anomalies and the…
Descriptors: Computer Assisted Testing, Student Evaluation, Scoring Rubrics, Error Patterns
Montoye, Alexander H. K.; Mitrzyk, Joe R.; Molesky, Monroe J. – Measurement in Physical Education and Exercise Science, 2017
The purpose of the current study was to determine the accuracy of the Fitbit Charge HR and Hexoskin smart shirt. Participants (n = 32, age: 23.5 ± 1.3 years) wore a Fitbit and Hexoskin while performing 14 activities in a laboratory and on a track (lying, sitting, standing, walking various speeds and inclines, jogging, and cycling). Steps, kcals,…
Descriptors: Measurement Equipment, Physical Activity Level, Measures (Individuals), Exercise Physiology
Brenner, Philip S. – Sociological Methods & Research, 2017
That rates of normative behaviors produced by sample surveys are higher than actual behavior warrants is well evidenced in the research literature. Less well understood is the source of this error. Twenty-five cognitive interviews were conducted to probe responses to a set of common, conventional survey questions about one such normative behavior:…
Descriptors: Interviews, Surveys, Religion, Religious Factors
Eshach, Haim; Kukliansky, Ida – International Journal of Science and Mathematics Education, 2018
The present study uses the intuitive rules theory as a framework to examine whether some of the difficulties in dealing with errors and uncertainties observed among students in the university physics laboratory can stem from their use of intuitive rules. The study also examines the relationship between the use of intuitive rules and laboratory…
Descriptors: Physics, Engineering Education, Error of Measurement, Error Patterns
Grabovsky, Irina; Wainer, Howard – Journal of Educational and Behavioral Statistics, 2017
In this essay, we describe the construction and use of the Cut-Score Operating Function in aiding standard setting decisions. The Cut-Score Operating Function shows the relation between the cut-score chosen and the consequent error rate. It allows error rates to be defined by multiple loss functions and will show the behavior of each loss…
Descriptors: Cutting Scores, Standard Setting (Scoring), Decision Making, Error Patterns