ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	3

Descriptor

Difficulty Level	11
Test Items	9
Adaptive Testing	8
Computer Assisted Testing	7
Higher Education	6
College Students	4
Item Response Theory	4
Mathematics Tests	4
Test Anxiety	4
Achievement Tests	3
Comparative Testing	3
Guessing (Tests)	3
Test Construction	3
Comparative Analysis	2
Item Analysis	2
Multiple Choice Tests	2
Review (Reexamination)	2
Scores	2
Selection	2
Simulation	2
Statistics	2
Test Format	2
Undergraduate Students	2
Ability	1
Ability Identification	1
More ▼

Source

Applied Measurement in…	3
International Journal of…	1
Journal of Applied Testing…	1
Journal of Educational…	1

Author

Wise, Steven L.	11
Finney, Sara J.	2
Barnes, Laura L. B.	1
DeMars, Christine E.	1
Enders, Craig K.	1
Freeman, Sharon A.	1
Kingsbury, G. Gage	1
Plake, Barbara S.	1
Roos, Linda L.	1
Severance, Donald D.	1

Publication Type

Reports - Research	11
Journal Articles	6
Speeches/Meeting Papers	4

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Measures of Academic Progress

What Works Clearinghouse Rating

Showing all 11 results Save | Export

The Impact of Test-Taking Disengagement on Item Content Representation

Peer reviewed

Direct link

Wise, Steven L. – Applied Measurement in Education, 2020

In achievement testing there is typically a practical requirement that the set of items administered should be representative of some target content domain. This is accomplished by establishing test blueprints specifying the content constraints to be followed when selecting the items for a test. Sometimes, however, students give disengaged…

Descriptors: Test Items, Test Content, Achievement Tests, Guessing (Tests)

Can Differential Rapid-Guessing Behavior Lead to Differential Item Functioning?

Peer reviewed

Direct link

DeMars, Christine E.; Wise, Steven L. – International Journal of Testing, 2010

This investigation examined whether different rates of rapid guessing between groups could lead to detectable levels of differential item functioning (DIF) in situations where the item parameters were the same for both groups. Two simulation studies were designed to explore this possibility. The groups in Study 1 were simulated to reflect…

Descriptors: Guessing (Tests), Test Bias, Motivation, Gender Differences

Creating a K-12 Adaptive Test: Examining the Stability of Item Parameter Estimates and Measurement Scales

Peer reviewed

Direct link

Kingsbury, G. Gage; Wise, Steven L. – Journal of Applied Testing Technology, 2011

Development of adaptive tests used in K-12 settings requires the creation of stable measurement scales to measure the growth of individual students from one grade to the next, and to measure change in groups from one year to the next. Accountability systems like No Child Left Behind require stable measurement scales so that accountability has…

Descriptors: Elementary Secondary Education, Adaptive Testing, Academic Achievement, Measures (Individuals)

Examinee Judgments of Changes in Item Difficulty: Implications for Item Review in Computerized Adaptive Testing.

Peer reviewed

Wise, Steven L.; Finney, Sara J.; Enders, Craig K.; Freeman, Sharon A.; Severance, Donald D. – Applied Measurement in Education, 1999

Examined whether providing item review on a computerized adaptive test could be used by examinees to inflate their scores. Two studies involving 139 undergraduates suggest that examinees are not highly proficient at discriminating item difficulty. A simulation study showed the usefulness of a strategy identified by G. Kingsbury (1996) as a way to…

Descriptors: Adaptive Testing, Computer Assisted Testing, Difficulty Level, Higher Education

The Utility of a Modified One-Parameter IRT Model with Small Samples.

Peer reviewed

Barnes, Laura L. B.; Wise, Steven L. – Applied Measurement in Education, 1991

One-parameter and three-parameter item response theory (IRT) model estimates were compared with estimates obtained from two modified one-parameter models that incorporated a constant nonzero guessing parameter. Using small-sample simulation data (50, 100, and 200 simulated examinees), modified 1-parameter models were most effective in estimating…

Descriptors: Ability, Achievement Tests, Comparative Analysis, Computer Simulation

The Accuracy of Examinee Judgments of Relative Item Difficulty: Implications for Computerized Adaptive Testing.

Download full text

Wise, Steven L.; And Others – 1997

The degree to which item review on a computerized adaptive test (CAT) could be used by examinees to inflate their scores artificially was studied. G. G. Kingsbury (1996) described a strategy in which examinees could use the changes in item difficulty during a CAT to determine which items' answers are incorrect and should be changed during item…

Descriptors: Achievement Gains, Adaptive Testing, College Students, Computer Assisted Testing

Comparing Restricted and Unrestricted Self-Adapted Testing as Alternatives to Computerized Adaptive Testing.

Download full text

Roos, Linda L.; Wise, Steven L.; Finney, Sara J. – 1998

Previous studies have shown that, when administered a self-adapted test, a few examinees will choose item difficulty levels that are not well-matched to their proficiencies, resulting in high standard errors of proficiency estimation. This study investigated whether the previously observed effects of a self-adapted test--lower anxiety and higher…

Descriptors: Adaptive Testing, College Students, Comparative Analysis, Computer Assisted Testing

A Comparison of Self-Adapted and Computer-Adaptive Tests.

Download full text

Wise, Steven L.; And Others – 1991

According to item response theory (IRT), examinee ability estimation is independent of the particular set of test items administered from a calibrated pool. Although the most popular application of this feature of IRT is computerized adaptive (CA) testing, a recently proposed alternative is self-adapted (SA) testing, in which examinees choose the…

Descriptors: Ability Identification, Adaptive Testing, College Students, Comparative Testing

An Investigation of Restricted Self-Adapted Testing.

PDF pending restoration

Wise, Steven L.; And Others – 1993

A new testing strategy that provides protection against the problem of having examinees in adaptive testing choose difficulty levels that are not matched to their proficiency levels was introduced and evaluated. The method, termed restricted self-adapted testing (RSAT), still provides examinees with a degree of control over the difficulty levels…

Descriptors: Achievement Tests, Adaptive Testing, Comparative Testing, Computer Assisted Testing

Examinee Selection of Subsequent Item Difficulty: Effects of Current Item Performance and Item Feedback.

Download full text

Plake, Barbara S.; Wise, Steven L. – 1986

One question regarding the utility of adaptive testing is the effect of individualized item arrangements on examinee test scores. The purpose of this study was to analyze the item difficulty choices by examinees as a function of previous item performance. The examination was a 25-item test of basic algebra skills given to 36 students in an…

Descriptors: Adaptive Testing, Algebra, College Students, Computer Assisted Testing

A Comparison of Self-Adapted and Computerized Adaptive Tests.

Peer reviewed

Wise, Steven L.; And Others – Journal of Educational Measurement, 1992

Performance of 156 undergraduate and 48 graduate students on a self-adapted test (SFAT)--students choose the difficulty level of their test items--was compared with performance on a computer-adapted test (CAT). Those taking the SFAT obtained higher ability scores and reported lower posttest state anxiety than did CAT takers. (SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Difficulty Level