ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	16

Descriptor

Bias	16
Computer Assisted Testing	16
Item Response Theory	6
Models	6
Foreign Countries	5
Adaptive Testing	4
College Students	3
Essays	3
Evaluation Methods	3
Response Style (Tests)	3
Scoring	3
Validity	3
Writing Evaluation	3
Educational Technology	2
English (Second Language)	2
Error of Measurement	2
Evidence	2
Gender Differences	2
Identification	2
Interrater Reliability	2
Monte Carlo Methods	2
Online Courses	2
Peer Evaluation	2
Scores	2
Scoring Rubrics	2
More ▼

Source

Educational Technology &…	2
AERA Online Paper Repository	1
Applied Psychological…	1
Community College Journal of…	1
ETS Research Report Series	1
Educational Psychology Review	1
IEEE Transactions on Learning…	1
International Association for…	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Experimental…	1
ProQuest LLC	1
SAGE Open	1
Society for the Teaching of…	1
More ▼

Publication Type

Journal Articles	12
Reports - Research	11
Reports - Evaluative	3
Speeches/Meeting Papers	2
Collected Works - General	1
Dissertations/Theses -…	1
Guides - Non-Classroom	1

Education Level

Higher Education	4
Postsecondary Education	2
Secondary Education	2
Elementary Education	1
Elementary Secondary Education	1
Grade 5	1
Intermediate Grades	1
Middle Schools	1
Two Year Colleges	1

Audience

Administrators	1
Teachers	1

Location

Taiwan	3
Indiana	1
Sweden	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

A Complete Smockery: Daily Online Testing Did Not Boost College Performance

Peer reviewed

Direct link

Robinson, Daniel H. – Educational Psychology Review, 2021

In an article published in an open-access journal, (Pennebaker et al. "PLoS One," 8(11), e79774, 2013) reported that an innovative computer-based system that included daily online testing resulted in better student performance in other concurrent courses and a reduction in achievement gaps between lower and upper middle-class students.…

Descriptors: Computer Assisted Testing, Academic Achievement, Student Evaluation, College Students

Using Keystroke Behavior Patterns to Detect Nonauthentic Texts in Writing Assessments: Evaluating the Fairness of Predictive Models

Peer reviewed

Direct link

Yang Jiang; Mo Zhang; Jiangang Hao; Paul Deane; Chen Li – Journal of Educational Measurement, 2024

The emergence of sophisticated AI tools such as ChatGPT, coupled with the transition to remote delivery of educational assessments in the COVID-19 era, has led to increasing concerns about academic integrity and test security. Using AI tools, test takers can produce high-quality texts effortlessly and use them to game assessments. It is thus…

Descriptors: Integrity, Artificial Intelligence, Technology Uses in Education, Ethics

Using Response Times for Joint Modeling of Careless Responding and Attentive Response Styles

Peer reviewed

Direct link

Esther Ulitzsch; Steffi Pohl; Lale Khorramdel; Ulf Kroehne; Matthias von Davier – Journal of Educational and Behavioral Statistics, 2024

Questionnaires are by far the most common tool for measuring noncognitive constructs in psychology and educational sciences. Response bias may pose an additional source of variation between respondents that threatens validity of conclusions drawn from questionnaire data. We present a mixture modeling approach that leverages response time data from…

Descriptors: Item Response Theory, Response Style (Tests), Questionnaires, Secondary School Students

Learning Automated Essay Scoring Models Using Item-Response-Theory-Based Scores to Decrease Effects of Rater Biases

Peer reviewed

Direct link

Uto, Masaki; Okano, Masashi – IEEE Transactions on Learning Technologies, 2021

In automated essay scoring (AES), scores are automatically assigned to essays as an alternative to grading by humans. Traditional AES typically relies on handcrafted features, whereas recent studies have proposed AES models based on deep neural networks to obviate the need for feature engineering. Those AES models generally require training on a…

Descriptors: Essays, Scoring, Writing Evaluation, Item Response Theory

Computer-Programmed Decision Trees for Assessing Teacher Noticing

Peer reviewed

Direct link

Schack, Edna O.; Dueber, David; Thomas, Jonathan Norris; Fisher, Molly H.; Jong, Cindy – AERA Online Paper Repository, 2019

Scoring of teachers' noticing responses is typically burdened with rater bias and reliance upon interrater consensus. The authors sought to make the scoring process more objective, equitable, and generalizable. The development process began with a description of response characteristics for each professional noticing component disconnected from…

Descriptors: Models, Teacher Evaluation, Observation, Bias

Mixed Digital Messages: The Ability to Determine News Credibility among Swedish Teenagers

Peer reviewed
PDF on ERIC

Download full text

Nygren, Thomas; Guath, Mona – International Association for Development of the Information Society, 2018

In this study we investigate the abilities to determine credibility of digital news among 532 teenagers. Using an online test we assess to what extent teenagers are able to determine the credibility of different sources, evaluate credible and biased uses of evidence, and corroborate information. Many respondents fail to identify the credibility of…

Descriptors: Credibility, Information Sources, Information Literacy, News Reporting

Toward Culturally Responsive and Equitable Testing: Innovative Psychometric Analyses on Contextualized Measurement and Adaptive Testing

Direct link

Nixi Wang – ProQuest LLC, 2022

Measurement errors attributable to cultural issues are complex and challenging for educational assessments. We need assessment tests sensitive to the cultural heterogeneity of populations, and psychometric methods appropriate to address fairness and equity concerns. Built on the research of culturally responsive assessment, this dissertation…

Descriptors: Culturally Relevant Education, Testing, Equal Education, Validity

The Impact of Computers on Marking Behaviors and Assessment: A Many-Facet Rasch Measurement Analysis of Essays by EFL College Students

Peer reviewed

Direct link

He, Tung-hsien – SAGE Open, 2019

This study employed a mixed-design approach and the Many-Facet Rasch Measurement (MFRM) framework to investigate whether rater bias occurred between the onscreen scoring (OSS) mode and the paper-based scoring (PBS) mode. Nine human raters analytically marked scanned scripts and paper scripts using a six-category (i.e., six-criterion) rating…

Descriptors: Computer Assisted Testing, Scoring, Item Response Theory, Essays

Online Targeting Behavior of Peer-Assessors under Identity-Revealed, Nicknamed, and Concealed Modes

Peer reviewed

Direct link

Yu, Fu Yun; Sung, Shannon – Educational Technology & Society, 2019

This study examined whether different identity revelation conditions result in different online targeting behavior among peer-assessors through a pretest and posttest quasi-experimental research design. Students from six fifth-grade classes (N = 196) participated in online learning tasks where they generated and selected peer-generated questions…

Descriptors: Peer Evaluation, Grade 5, Elementary School Students, Educational Technology

Action Alters Object Identification: Wielding a Gun Increases the Bias to See Guns

Peer reviewed

Direct link

Witt, Jessica K.; Brockmole, James R. – Journal of Experimental Psychology: Human Perception and Performance, 2012

Stereotypes, expectations, and emotions influence an observer's ability to detect and categorize objects as guns. In light of recent work in action-perception interactions, however, there is another unexplored factor that may be critical: The action choices available to the perceiver. In five experiments, participants determined whether another…

Descriptors: Weapons, Identification, Stereotypes, Visual Perception

Reducing the Impact of Inappropriate Items on Reviewable Computerized Adaptive Testing

Peer reviewed

Direct link

Yen, Yung-Chin; Ho, Rong-Guey; Liao, Wen-Wei; Chen, Li-Ju – Educational Technology & Society, 2012

In a test, the testing score would be closer to examinee's actual ability when careless mistakes were corrected. In CAT, however, changing the answer of one item in CAT might cause the following items no longer appropriate for estimating the examinee's ability. These inappropriate items in a reviewable CAT might in turn introduce bias in ability…

Descriptors: Foreign Countries, Adaptive Testing, Computer Assisted Testing, Item Response Theory

A Monte Carlo Simulation Investigating the Validity and Reliability of Ability Estimation in Item Response Theory with Speeded Computer Adaptive Tests

Peer reviewed

Direct link

Schmitt, T. A.; Sass, D. A.; Sullivan, J. R.; Walker, C. M. – International Journal of Testing, 2010

Imposed time limits on computer adaptive tests (CATs) can result in examinees having difficulty completing all items, thus compromising the validity and reliability of ability estimates. In this study, the effects of speededness were explored in a simulated CAT environment by varying examinee response patterns to end-of-test items. Expectedly,…

Descriptors: Monte Carlo Methods, Simulation, Computer Assisted Testing, Adaptive Testing

Effective Evaluation of Teaching: A Guide for Faculty and Administrators

Direct link

Kite, Mary E., Ed. – Society for the Teaching of Psychology, 2012

This book compiles several essays about effective evaluation of teaching. Contents of this publication include: (1) Conducting Research on Student Evaluations of Teaching (William E. Addison and Jeffrey R. Stowell); (2) Choosing an Instrument for Student Evaluation of Instruction (Jared W. Keeley); (3) Formative Teaching Evaluations: Is Student…

Descriptors: Feedback (Response), Student Evaluation of Teacher Performance, Online Courses, Teacher Effectiveness

Using Web Surveys to Reach Community College Students: An Analysis of Response Rates and Response Bias

Peer reviewed

Direct link

Sax, Linda J.; Gilmartin, Shannon K.; Lee, Jenny J.; Hagedorn, Linda Serra – Community College Journal of Research and Practice, 2008

This study was designed to examine response rates and bias among a sample of community college students who received a district-wide survey by standard mail or e-mail. Findings suggest that predictors of response and types of responses are not appreciably different across paper and online mail-out samples when these samples are "matched" in terms…

Descriptors: College Students, Response Style (Tests), Response Rates (Questionnaires), Community Colleges

Construct Validity of "e-rater"® in Scoring TOEFL® Essays. Research Report. ETS RR-07-21

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal – ETS Research Report Series, 2007

This study examined the construct validity of the "e-rater"® automated essay scoring engine as an alternative to human scoring in the context of TOEFL® essay writing. Analyses were based on a sample of students who repeated the TOEFL within a short time period. Two "e-rater" scores were investigated in this study, the first…

Descriptors: Construct Validity, Computer Assisted Testing, Scoring, English (Second Language)

Previous Page | Next Page »

Pages: 1 | 2

Attali, Yigal	1
Brockmole, James R.	1
Chen Li	1
Chen, Li-Ju	1
Dueber, David	1
Esther Ulitzsch	1
Fisher, Molly H.	1
Gilmartin, Shannon K.	1
Guath, Mona	1
Hagedorn, Linda Serra	1
He, Tung-hsien	1
Ho, Rong-Guey	1
Jiangang Hao	1
Jong, Cindy	1
Kite, Mary E., Ed.	1
Lale Khorramdel	1
Lee, Jenny J.	1
Liao, Wen-Wei	1
Matthias von Davier	1
Mo Zhang	1
Nixi Wang	1
Nygren, Thomas	1
Okano, Masashi	1
Paul Deane	1
Robinson, Daniel H.	1
More ▼