ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	8

Descriptor

Error Patterns	8
Error of Measurement	8
Item Response Theory	2
Scoring	2
Undergraduate Students	2
Academic Language	1
Adults	1
Attention	1
Bias	1
Causal Models	1
Children	1
Classification	1
Cognitive Ability	1
Comparative Analysis	1
Compliance (Psychology)	1
Data Analysis	1
Effect Size	1
English (Second Language)	1
Equated Scores	1
Evaluation Problems	1
Evaluators	1
Evidence	1
Examiners	1
Failure	1
Females	1
More ▼

Source

International Journal of…	2
Educational Assessment	1
Field Methods	1
Journal of Advanced Academics	1
Journal of Experimental…	1
Journal of Statistics and…	1
ProQuest LLC	1

Author

Atehortua, Laura	1
Ayse Bilicioglu Gunes	1
Bayram Bicak	1
Ellison, George T. H.	1
Gummer, Tobias	1
Karakaya, Ismail	1
Mark White	1
Matt Ronfeldt	1
Roßmann, Joss	1
Sata, Mehmet	1
Silber, Henning	1
Warne, Russell T.	1
Zhang, Zhonghua	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	6
Dissertations/Theses -…	1
Reports - Evaluative	1

Education Level

Higher Education	3
Postsecondary Education	3

Audience

Location

Turkey

Laws, Policies, & Programs

Assessments and Surveys

International English…	1
Wechsler Adult Intelligence…	1
Wechsler Intelligence Scale…	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Monitoring Rater Quality in Observational Systems: Issues Due to Unreliable Estimates of Rater Quality

Peer reviewed

Direct link

Mark White; Matt Ronfeldt – Educational Assessment, 2024

Standardized observation systems seek to reliably measure a specific conceptualization of teaching quality, managing rater error through mechanisms such as certification, calibration, validation, and double-scoring. These mechanisms both support high quality scoring and generate the empirical evidence used to support the scoring inference (i.e.,…

Descriptors: Interrater Reliability, Quality Control, Teacher Effectiveness, Error Patterns

Type I Error and Power Rates: A Comparative Analysis of Techniques in Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Ayse Bilicioglu Gunes; Bayram Bicak – International Journal of Assessment Tools in Education, 2023

The main purpose of this study is to examine the Type I error and statistical power ratios of Differential Item Functioning (DIF) techniques based on different theories under different conditions. For this purpose, a simulation study was conducted by using Mantel-Haenszel (MH), Logistic Regression (LR), Lord's [chi-squared], and Raju's Areas…

Descriptors: Test Items, Item Response Theory, Error of Measurement, Test Bias

The Effect of Student Examiner Errors on WAIS-IV and WISC-V Composite Scores

Direct link

Atehortua, Laura – ProQuest LLC, 2022

Intelligence tests are used in a variety of settings such as schools, clinics, and courts to assess the intellectual capacity of individuals of all ages. Intelligence tests are used to make high-stakes decisions such as special education placement, employment, eligibility for social security services, and determination of the death penalty.…

Descriptors: Adults, Intelligence Tests, Children, Error of Measurement

No Strong Evidence of Stereotype Threat in Females: A Reassessment of the Meta-Analysis

Peer reviewed

Direct link

Warne, Russell T. – Journal of Advanced Academics, 2022

Recently, Picho-Kiroga (2021) published a meta-analysis on the effect of stereotype threat on females. Their conclusion was that the average effect size for stereotype threat studies was d = .28, but that effects are overstated because the majority of studies on stereotype threat in females include methodological characteristics that inflate the…

Descriptors: Sex Stereotypes, Females, Meta Analysis, Effect Size

The Issue of Noncompliance in Attention Check Questions: False Positives in Instructed Response Items

Peer reviewed

Direct link

Silber, Henning; Roßmann, Joss; Gummer, Tobias – Field Methods, 2022

Attention checks detect inattentiveness by instructing respondents to perform a specific task. However, while respondents may correctly process the task, they may choose to not comply with the instructions. We investigated the issue of noncompliance in attention checks in two web surveys. In Study 1, we measured respondents' attitudes toward…

Descriptors: Compliance (Psychology), Attention, Task Analysis, Online Surveys

Estimating Standard Errors of IRT True Score Equating Coefficients Using Imputed Item Parameters

Peer reviewed

Direct link

Zhang, Zhonghua – Journal of Experimental Education, 2022

Reporting standard errors of equating has been advocated as a standard practice when conducting test equating. The two most widely applied procedures for standard errors of equating including the bootstrap method and the delta method are either computationally intensive or confined to the derivations of complicated formulas. In the current study,…

Descriptors: Error of Measurement, Item Response Theory, True Scores, Equated Scores

Might Temporal Logic Improve the Specification of Directed Acyclic Graphs (DAGs)?

Peer reviewed

Direct link

Ellison, George T. H. – Journal of Statistics and Data Science Education, 2021

Temporality-driven covariate classification had limited impact on: the specification of directed acyclic graphs (DAGs) by 85 novice analysts (medical undergraduates); or the risk of bias in DAG-informed multivariable models designed to generate causal inference from observational data. Only 71 students (83.5%) managed to complete the…

Descriptors: Statistics Education, Medical Education, Undergraduate Students, Graphs

Investigating the Impact of Rater Training on Rater Errors in the Process of Assessing Writing Skill

Peer reviewed
PDF on ERIC

Download full text

Sata, Mehmet; Karakaya, Ismail – International Journal of Assessment Tools in Education, 2022

In the process of measuring and assessing high-level cognitive skills, interference of rater errors in measurements brings about a constant concern and low objectivity. The main purpose of this study was to investigate the impact of rater training on rater errors in the process of assessing individual performance. The study was conducted with a…

Descriptors: Evaluators, Training, Comparative Analysis, Academic Language