ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	10

Descriptor

Guessing (Tests)	11
Item Response Theory	7
Reaction Time	5
Simulation	4
Test Items	4
College Students	3
Computation	3
Gender Differences	3
Scores	3
Achievement Tests	2
Computer Software	2
Difficulty Level	2
Models	2
Motivation	2
Multiple Choice Tests	2
Response Style (Tests)	2
Test Bias	2
Ability	1
Achievement Gains	1
Attendance	1
Bias	1
Business Administration…	1
Comparative Analysis	1
Contingency Management	1
Differences	1
More ▼

Source

Educational Assessment	3
Applied Psychological…	2
Educational and Psychological…	2
International Journal of…	1
Journal of Educational…	1
Online Submission	1
Research & Practice in…	1

Author

DeMars, Christine E.	11
Wise, Steven L.	5
Bashkov, Bozhidar M.	1
Jurich, Daniel P.	1
Kong, Xiaojing	1
Socha, Alan	1
Socha, Alan B.	1
Waterbury, Glenn Thomas	1

Publication Type

Journal Articles	10
Reports - Research	7
Reports - Evaluative	3
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Higher Education	3
Postsecondary Education	2

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Anchors Aweigh: How the Choice of Anchor Items Affects the Vertical Scaling of 3PL Data with the Rasch Model

Peer reviewed

Direct link

Waterbury, Glenn Thomas; DeMars, Christine E. – Educational Assessment, 2021

Vertical scaling is used to put tests of different difficulty onto a common metric. The Rasch model is often used to perform vertical scaling, despite its strict functional form. Few, if any, studies have examined anchor item choice when using the Rasch model to vertically scale data that do not fit the model. The purpose of this study was to…

Descriptors: Test Items, Equated Scores, Item Response Theory, Scaling

The Interaction of Ability Differences and Guessing When Modeling Differential Item Functioning with the Rasch Model: Conventional and Tailored Calibration

Peer reviewed

Direct link

DeMars, Christine E.; Jurich, Daniel P. – Educational and Psychological Measurement, 2015

In educational testing, differential item functioning (DIF) statistics must be accurately estimated to ensure the appropriate items are flagged for inspection or removal. This study showed how using the Rasch model to estimate DIF may introduce considerable bias in the results when there are large group differences in ability (impact) and the data…

Descriptors: Test Bias, Guessing (Tests), Ability, Differences

A Note on Specifying the Guessing Parameter in ATFIND and DIMTEST

Peer reviewed

Direct link

Socha, Alan; DeMars, Christine E. – Applied Psychological Measurement, 2013

The software program DIMTEST can be used to assess the unidimensionality of item scores. The software allows the user to specify a guessing parameter. Using simulated data, the effects of guessing parameter specification for use with the ATFIND procedure for empirically deriving the Assessment Subtest (AT; that is, a subtest composed of items that…

Descriptors: Item Response Theory, Computer Software, Guessing (Tests), Simulation

The Role of Gender in Test-Taking Motivation under Low-Stakes Conditions

Peer reviewed
PDF on ERIC

Download full text

DeMars, Christine E.; Bashkov, Bozhidar M.; Socha, Alan B. – Research & Practice in Assessment, 2013

Examinee effort can impact the validity of scores on higher education assessments. Many studies of examinee effort have briefly noted gender differences, but gender differences in test-taking effort have not been a primary focus of research. This review of the literature brings together gender-related findings regarding three measures of examinee…

Descriptors: Gender Differences, Scores, Student Motivation, Test Wiseness

Examinee Noneffort and the Validity of Program Assessment Results

Peer reviewed

Direct link

Wise, Steven L.; DeMars, Christine E. – Educational Assessment, 2010

Educational program assessment studies often use data from low-stakes tests to provide evidence of program quality. The validity of scores from such tests, however, is potentially threatened by examinee noneffort. This study investigated the extent to which one type of noneffort--rapid-guessing behavior--distorted the results from three types of…

Descriptors: Validity, Program Evaluation, Guessing (Tests), Motivation

Can Differential Rapid-Guessing Behavior Lead to Differential Item Functioning?

Peer reviewed

Direct link

DeMars, Christine E.; Wise, Steven L. – International Journal of Testing, 2010

This investigation examined whether different rates of rapid guessing between groups could lead to detectable levels of differential item functioning (DIF) in situations where the item parameters were the same for both groups. Two simulation studies were designed to explore this possibility. The groups in Study 1 were simulated to reflect…

Descriptors: Guessing (Tests), Test Bias, Motivation, Gender Differences

A Clarification of the Effects of Rapid Guessing on Coefficient [Alpha]: A Note on Attali's "Reliability of Speeded Number-Right Multiple-Choice Tests"

Peer reviewed

Direct link

Wise, Steven L.; DeMars, Christine E. – Applied Psychological Measurement, 2009

Attali (2005) recently demonstrated that Cronbach's coefficient [alpha] estimate of reliability for number-right multiple-choice tests will tend to be deflated by speededness, rather than inflated as is commonly believed and taught. Although the methods, findings, and conclusions of Attali (2005) are correct, his article may inadvertently invite a…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Reliability, Computation

"Guessing" Parameter Estimates for Multidimensional Item Response Theory Models

Peer reviewed

Direct link

DeMars, Christine E. – Educational and Psychological Measurement, 2007

Two software packages commonly used for multidimensional item response theory (IRT) models require the user to input values for the lower asymptotes of the item response functions. One way of selecting these values is to estimate lower asymptotes with a one-dimensional IRT model and use those estimates as fixed values in the multidimensional…

Descriptors: Guessing (Tests), Item Response Theory, Computer Software, Models

Changes in Rapid-Guessing Behavior over a Series of Assessments

Peer reviewed

Direct link

DeMars, Christine E. – Educational Assessment, 2007

A series of 8 tests was administered to university students over 4 weeks for program assessment purposes. The stakes of these tests were low for students; they received course points based on test completion, not test performance. Tests were administered in a counterbalanced order across 2 administrations. Response time effort, a measure of the…

Descriptors: Reaction Time, Guessing (Tests), Testing Programs, College Students

An Application of Item Response Time: The Effort-Moderated IRT Model

Peer reviewed

Direct link

Wise, Steven L.; DeMars, Christine E. – Journal of Educational Measurement, 2006

The validity of inferences based on achievement test scores is dependent on the amount of effort that examinees put forth while taking the test. With low-stakes tests, for which this problem is particularly prevalent, there is a consequent need for psychometric models that can take into account differing levels of examinee effort. This article…

Descriptors: Guessing (Tests), Psychometrics, Inferences, Reaction Time

An Application of Item Response Time: The Effort-Moderated IRT Model

Download full text

Wise, Steven L.; DeMars, Christine E.; Kong, Xiaojing – Online Submission, 2005

Descriptors: Item Response Theory, Mathematical Models, Measurement Techniques, Reaction Time