ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	10

Source

Applied Measurement in…	6
Journal of Applied Testing…	2
Educational Measurement:…	1
Educational and Psychological…	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1

Author

Wise, Steven L.	21
Finney, Sara J.	3
Kong, Xiaojing J.	2
Soland, James	2
Beverly, Tanesia	1
Bhola, Dennison S.	1
Brinkhuis, Matthieu	1
Circi, Ruhan	1
DeMars, Christine E.	1
Domingue, Benjamin W.	1
Dupray, Laurence M.	1
Enders, Craig K.	1
Faul, Jessica	1
Freeman, Sharon A.	1
Kanopka, Klint	1
Kingsbury, G. Gage	1
Kong, Xiaojing	1
Liao, Dandan	1
Ling, Guangming	1
McCandliss, Bruce	1
Obradovic, Jelena	1
Pastor, Dena A.	1
Piech, Chris	1
Plake, Barbara S.	1
More ▼

Publication Type

Journal Articles	13
Reports - Research	13
Reports - Evaluative	7
Speeches/Meeting Papers	7
Information Analyses	1

Education Level

Higher Education	4
Postsecondary Education	3
Elementary Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Measures of Academic Progress	2
SAT (College Admission Test)	2
Major Field Achievement Test…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

The Impact of Test-Taking Disengagement on Item Content Representation

Peer reviewed

Direct link

Wise, Steven L. – Applied Measurement in Education, 2020

In achievement testing there is typically a practical requirement that the set of items administered should be representative of some target content domain. This is accomplished by establishing test blueprints specifying the content constraints to be followed when selecting the items for a test. Sometimes, however, students give disengaged…

Descriptors: Test Items, Test Content, Achievement Tests, Guessing (Tests)

Speed-Accuracy Trade-Off? Not so Fast: Marginal Changes in Speed Have Inconsistent Relationships with Accuracy in Real-World Settings

Peer reviewed
PDF on ERIC

Download full text

Direct link

Domingue, Benjamin W.; Kanopka, Klint; Stenhaug, Ben; Sulik, Michael J.; Beverly, Tanesia; Brinkhuis, Matthieu; Circi, Ruhan; Faul, Jessica; Liao, Dandan; McCandliss, Bruce; Obradovic, Jelena; Piech, Chris; Porter, Tenelle; Soland, James; Weeks, Jon; Wise, Steven L.; Yeatman, Jason – Journal of Educational and Behavioral Statistics, 2022

The speed-accuracy trade-off (SAT) suggests that time constraints reduce response accuracy. Its relevance in observational settings--where response time (RT) may not be constrained but respondent speed may still vary--is unclear. Using 29 data sets containing data from cognitive tasks, we use a flexible method for identification of the SAT (which…

Descriptors: Accuracy, Reaction Time, Task Analysis, College Entrance Examinations

The Impact of Technology-Enhanced Items on Test-Taker Disengagement

Peer reviewed

Direct link

Wise, Steven L.; Soland, James; Dupray, Laurence M. – Journal of Applied Testing Technology, 2021

Technology-Enhanced Items (TEIs) have been purported to be more motivating and engaging to test takers than traditional multiple-choice items. The claim of enhanced engagement, however, has thus far received limited research attention. This study examined the rates of rapid-guessing behavior received by three types of items (multiple-choice,…

Descriptors: Test Items, Guessing (Tests), Multiple Choice Tests, Achievement Tests

Effort Analysis: Individual Score Validation of Achievement Test Data

Peer reviewed

Direct link

Wise, Steven L. – Applied Measurement in Education, 2015

Whenever the purpose of measurement is to inform an inference about a student's achievement level, it is important that we be able to trust that the student's test score accurately reflects what that student knows and can do. Such trust requires the assumption that a student's test event is not unduly influenced by construct-irrelevant factors…

Descriptors: Achievement Tests, Scores, Validity, Test Items

Rapid-Guessing Behavior: Its Identification, Interpretation, and Implications

Peer reviewed

Direct link

Wise, Steven L. – Educational Measurement: Issues and Practice, 2017

The rise of computer-based testing has brought with it the capability to measure more aspects of a test event than simply the answers selected or constructed by the test taker. One behavior that has drawn much research interest is the time test takers spend responding to individual multiple-choice items. In particular, very short response…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Items, Reaction Time

An Investigation of Examinee Test-Taking Effort on a Large-Scale Assessment

Peer reviewed

Direct link

Setzer, J. Carl; Wise, Steven L.; van den Heuvel, Jill R.; Ling, Guangming – Applied Measurement in Education, 2013

Assessment results collected under low-stakes testing situations are subject to effects of low examinee effort. The use of computer-based testing allows researchers to develop new ways of measuring examinee effort, particularly using response times. At the item level, responses can be classified as exhibiting either rapid-guessing behavior or…

Descriptors: Testing, Guessing (Tests), Reaction Time, Test Items

Can Differential Rapid-Guessing Behavior Lead to Differential Item Functioning?

Peer reviewed

Direct link

DeMars, Christine E.; Wise, Steven L. – International Journal of Testing, 2010

This investigation examined whether different rates of rapid guessing between groups could lead to detectable levels of differential item functioning (DIF) in situations where the item parameters were the same for both groups. Two simulation studies were designed to explore this possibility. The groups in Study 1 were simulated to reflect…

Descriptors: Guessing (Tests), Test Bias, Motivation, Gender Differences

Correlates of Rapid-Guessing Behavior in Low-Stakes Testing: Implications for Test Development and Measurement Practice

Peer reviewed

Direct link

Wise, Steven L.; Pastor, Dena A.; Kong, Xiaojing J. – Applied Measurement in Education, 2009

Previous research has shown that rapid-guessing behavior can degrade the validity of test scores from low-stakes proficiency tests. This study examined, using hierarchical generalized linear modeling, examinee and item characteristics for predicting rapid-guessing behavior. Several item characteristics were found significant; items with more text…

Descriptors: Guessing (Tests), Achievement Tests, Correlation, Test Items

Creating a K-12 Adaptive Test: Examining the Stability of Item Parameter Estimates and Measurement Scales

Peer reviewed

Direct link

Kingsbury, G. Gage; Wise, Steven L. – Journal of Applied Testing Technology, 2011

Development of adaptive tests used in K-12 settings requires the creation of stable measurement scales to measure the growth of individual students from one grade to the next, and to measure change in groups from one year to the next. Accountability systems like No Child Left Behind require stable measurement scales so that accountability has…

Descriptors: Elementary Secondary Education, Adaptive Testing, Academic Achievement, Measures (Individuals)

Setting the Response Time Threshold Parameter to Differentiate Solution Behavior from Rapid-Guessing Behavior

Peer reviewed

Direct link

Kong, Xiaojing J.; Wise, Steven L.; Bhola, Dennison S. – Educational and Psychological Measurement, 2007

This study compared four methods for setting item response time thresholds to differentiate rapid-guessing behavior from solution behavior. Thresholds were either (a) common for all test items, (b) based on item surface features such as the amount of reading required, (c) based on visually inspecting response time frequency distributions, or (d)…

Descriptors: Test Items, Reaction Time, Timed Tests, Item Response Theory

Examinee Judgments of Changes in Item Difficulty: Implications for Item Review in Computerized Adaptive Testing.

Peer reviewed

Wise, Steven L.; Finney, Sara J.; Enders, Craig K.; Freeman, Sharon A.; Severance, Donald D. – Applied Measurement in Education, 1999

Examined whether providing item review on a computerized adaptive test could be used by examinees to inflate their scores. Two studies involving 139 undergraduates suggest that examinees are not highly proficient at discriminating item difficulty. A simulation study showed the usefulness of a strategy identified by G. Kingsbury (1996) as a way to…

Descriptors: Adaptive Testing, Computer Assisted Testing, Difficulty Level, Higher Education

Response Time Effort: A New Measure of Examinee Motivation in Computer-Based Tests

Peer reviewed

Direct link

Wise, Steven L.; Kong, Xiaojing – Applied Measurement in Education, 2005

When low-stakes assessments are administered, the degree to which examinees give their best effort is often unclear, complicating the validity and interpretation of the resulting test scores. This study introduces a new method, based on item response time, for measuring examinee test-taking effort on computer-based test items. This measure, termed…

Descriptors: Psychometrics, Validity, Reaction Time, Test Items

The Accuracy of Examinee Judgments of Relative Item Difficulty: Implications for Computerized Adaptive Testing.

Download full text

Wise, Steven L.; And Others – 1997

The degree to which item review on a computerized adaptive test (CAT) could be used by examinees to inflate their scores artificially was studied. G. G. Kingsbury (1996) described a strategy in which examinees could use the changes in item difficulty during a CAT to determine which items' answers are incorrect and should be changed during item…

Descriptors: Achievement Gains, Adaptive Testing, College Students, Computer Assisted Testing

The Effects of Judgment-Based Stratum Classifications on the Efficiency of Stratum Scored CATs.

Download full text

Finney, Sara J.; Smith, Russell W.; Wise, Steven L. – 1999

Two operational item pools were used to investigate the performance of stratum computerized adaptive tests (CATs) when items were assigned to strata based on empirical estimates of item difficulty or human judgments of item difficulty. Items from the first data set consisted of 54 5-option multiple choice items from a form of the ACT mathematics…

Descriptors: Adaptive Testing, Classification, Computer Assisted Testing, High School Students

Comparing Restricted and Unrestricted Self-Adapted Testing as Alternatives to Computerized Adaptive Testing.

Download full text

Roos, Linda L.; Wise, Steven L.; Finney, Sara J. – 1998

Previous studies have shown that, when administered a self-adapted test, a few examinees will choose item difficulty levels that are not well-matched to their proficiencies, resulting in high standard errors of proficiency estimation. This study investigated whether the previously observed effects of a self-adapted test--lower anxiety and higher…

Descriptors: Adaptive Testing, College Students, Comparative Analysis, Computer Assisted Testing

Previous Page | Next Page »

Pages: 1 | 2

Test Items	21
Computer Assisted Testing	14
Adaptive Testing	12
Difficulty Level	9
Guessing (Tests)	7
Test Construction	7
Reaction Time	6
Achievement Tests	5
Higher Education	5
Item Response Theory	5
Mathematics Tests	5
Review (Reexamination)	5
Scores	5
Test Anxiety	5
College Students	4
Multiple Choice Tests	4
Student Motivation	4
Testing Problems	4
Timed Tests	4
Undergraduate Students	4
Comparative Testing	3
Item Analysis	3
Item Banks	3
Achievement Gains	2
Computer Attitudes	2
More ▼