ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	4

Source

Applied Measurement in…	3
Education Inquiry	1
International Journal of…	1
Journal of Educational…	1
Online Submission	1

Author

Wise, Steven L.	10
Kong, Xiaojing	2
Bo, Yuanchao	1
Horst, Sonia J.	1
Kissel, Hilary L.	1
Kuhfeld, Megan R.	1
Owens, Kara M.	1
Soland, James	1
Soland,, James	1
Weiss, Brandi	1
Yang, Sheng-Ta	1
More ▼

Publication Type

Journal Articles	6
Reports - Research	5
Reports - Evaluative	4
Speeches/Meeting Papers	4
Reports - Descriptive	1

Education Level

Higher Education	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing all 10 results Save | Export

The (Non)Impact of Differential Test Taker Engagement on Aggregated Scores

Peer reviewed

Direct link

Wise, Steven L.; Soland,, James; Bo, Yuanchao – International Journal of Testing, 2020

Disengaged test taking tends to be most prevalent with low-stakes tests. This has led to questions about the validity of aggregated scores from large-scale international assessments such as PISA and TIMSS, as previous research has found a meaningful correlation between the mean engagement and mean performance of countries. The current study, using…

Descriptors: Foreign Countries, International Assessment, Achievement Tests, Secondary School Students

Controlling Construct-Irrelevant Factors through Computer-Based Testing: Disengagement, Anxiety, & Cheating

Peer reviewed

Direct link

Wise, Steven L. – Education Inquiry, 2019

A decision of whether to move from paper-and-pencil to computer-based tests is based largely on a careful weighing of the potential benefits of a change against its costs, disadvantages, and challenges. This paper briefly discusses the trade-offs involved in making such a transition, and then focuses on a relatively unexplored benefit of…

Descriptors: Computer Assisted Testing, Cheating, Test Wiseness, Scores

The Effects of Effort Monitoring with Proctor Notification on Test-Taking Engagement, Test Performance, and Validity

Peer reviewed

Direct link

Wise, Steven L.; Kuhfeld, Megan R.; Soland, James – Applied Measurement in Education, 2019

When we administer educational achievement tests, we want to be confident that the resulting scores validly indicate what the test takers know and can do. However, if the test is perceived as low stakes by the test taker, disengaged test taking sometimes occurs, which poses a serious threat to score validity. When computer-based tests are used,…

Descriptors: Guessing (Tests), Computer Assisted Testing, Achievement Tests, Scores

Comparison of Stratum Scored and Maximum-Likelihood Scored CATs.

Download full text

Wise, Steven L. – 1999

Outside of large-scale testing programs, the computerized adaptive test (CAT) has thus far had only limited impact on measurement practice. In smaller-scale testing contexts, limited data are often available, which precludes the establishment of calibrated item pools for use by traditional (i.e., item response theory (IRT) based) CATs. This paper…

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Response Theory, Scores

Response Time Effort: A New Measure of Examinee Motivation in Computer-Based Tests

Peer reviewed

Direct link

Wise, Steven L.; Kong, Xiaojing – Applied Measurement in Education, 2005

When low-stakes assessments are administered, the degree to which examinees give their best effort is often unclear, complicating the validity and interpretation of the resulting test scores. This study introduces a new method, based on item response time, for measuring examinee test-taking effort on computer-based test items. This measure, termed…

Descriptors: Psychometrics, Validity, Reaction Time, Test Items

The Accuracy of Examinee Judgments of Relative Item Difficulty: Implications for Computerized Adaptive Testing.

Download full text

Wise, Steven L.; And Others – 1997

The degree to which item review on a computerized adaptive test (CAT) could be used by examinees to inflate their scores artificially was studied. G. G. Kingsbury (1996) described a strategy in which examinees could use the changes in item difficulty during a CAT to determine which items' answers are incorrect and should be changed during item…

Descriptors: Achievement Gains, Adaptive Testing, College Students, Computer Assisted Testing

An Investigation of the Differential Effort Received by Items on a Low-Stakes Computer-Based Test

Peer reviewed

Direct link

Wise, Steven L. – Applied Measurement in Education, 2006

In low-stakes testing, the motivation levels of examinees are often a matter of concern to test givers because a lack of examinee effort represents a direct threat to the validity of the test data. This study investigated the use of response time to assess the amount of examinee effort received by individual test items. In 2 studies, it was found…

Descriptors: Computer Assisted Testing, Motivation, Test Validity, Item Response Theory

An Investigation of the Effects of Self-Adapted Testing on Examinee Effort and Performance in a Low-Stakes Achievement Test

Download full text

Wise, Steven L.; Owens, Kara M.; Yang, Sheng-Ta; Weiss, Brandi; Kissel, Hilary L.; Kong, Xiaojing; Horst, Sonia J. – Online Submission, 2005

There are a variety of situations in which low-stakes achievement tests--which are defined as those having few or no consequences for examinee performance--are used in applied measurement. A problem inherent in such testing is that we often cannot assume that all examinees give their best effort to their test, which suggests that the test scores…

Descriptors: Psychology, Mathematics Tests, Reaction Time, Achievement Tests

A Critical Analysis of the Arguments for and against Item Review in Computerized Adaptive Testing.

Download full text

Wise, Steven L. – 1996

In recent years, a controversy has arisen about the advisability of allowing examinees to review their test items and possibly change answers. Arguments for and against allowing item review are discussed, and issues that a test designer should consider when designing a Computerized Adaptive Test (CAT) are identified. Most CATs do not allow…

Descriptors: Achievement Gains, Adaptive Testing, Computer Assisted Testing, Error Correction

A Comparison of Self-Adapted and Computerized Adaptive Tests.

Peer reviewed

Wise, Steven L.; And Others – Journal of Educational Measurement, 1992

Performance of 156 undergraduate and 48 graduate students on a self-adapted test (SFAT)--students choose the difficulty level of their test items--was compared with performance on a computer-adapted test (CAT). Those taking the SFAT obtained higher ability scores and reported lower posttest state anxiety than did CAT takers. (SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Difficulty Level

Computer Assisted Testing	10
Scores	10
Adaptive Testing	5
Guessing (Tests)	4
Item Response Theory	4
Test Construction	4
Test Items	4
Achievement Tests	3
Mathematics Tests	3
Reaction Time	3
Student Motivation	3
Achievement Gains	2
Difficulty Level	2
Higher Education	2
Review (Reexamination)	2
Test Anxiety	2
Test Format	2
Test Validity	2
Test Wiseness	2
Undergraduate Students	2
Validity	2
Cheating	1
College Students	1
Communication (Thought…	1
Comparative Testing	1
More ▼